
Runa Eschenhagen
@runame_
Followers
517
Following
485
Media
8
Statuses
186
PhD student in machine learning @CambridgeMLG and research scientist intern @AIatMeta.
Joined October 2021
RT @ThomasTCKZhang: I’ll be presenting our paper “On The Concurrence of Layer-wise Preconditioning Methods and Provable Feature Learning” a….
0
8
0
RT @kayembruno: You don't need bespoke tools for causal inference. Probabilistic modelling is enough. I'll be making this case (and dodgin….
0
4
0
RT @tmpethick: When comparing optimization methods, we often change *multiple things at once*—geometry, normalization, etc.—possibly withou….
0
2
0
RT @akristiadi7: 📢 [Openings] I'm now an Assistant Prof @WesternU CS dept. Funded PhD & MSc positions available! Topics: large probabilisti….
0
11
0
RT @MarkSchmidtUBC: My former PhD student Fred Kunstner has been awarded the @c_a_i_a_c Best Doctoral Dissertation Award:..
0
23
0
RT @aaron_defazio: Why do gradients increase near the end of training? .Read the paper to find out!.We also propose a simple fix to AdamW t….
0
74
0
RT @orvieto_antonio: Adam is similar to many algorithms, but cannot be effectively replaced by any simpler variant in LMs. The community is….
0
44
0
RT @_katieeverett: 1. We often observe power laws between loss and compute: loss = a * flops ^ b + c.2. Models are rapidly becoming more ef….
0
92
0
RT @roydanroy: This is a huge development. I want to highlight the theoreticians behind the scene, because this paper represents the reali….
0
52
0
RT @kayembruno: Great to be back from Singapore from #ICLR2025, and super excited to have given my first oral presentation on influence fun….
0
3
0
RT @JonathanWenger5: We have a fantastic lineup of speakers who have made deep contributions to open-source in ML, e.g. @sarahookr, @ChrisR….
0
5
0
RT @zhiyuanli_: Why does Adam outperform SGD in LLMs training? Adaptive step sizes alone don't fully explain this, as Adam also surpasses a….
0
35
0
RT @wormaniec: Ever wondered how the loss landscape of Transformers differs from that of other architectures? Or which Transformer componen….
0
8
0
RT @frankstefansch1: Tired of your open-source ML work not getting the academic recognition it deserves? 🤔 Submit to the first-ever CodeML….
0
2
0
RT @JonathanWenger5: Built a new ML library? Maintain a crucial project? Improved OSS practices? Your work deserves recognition! Submit you….
0
4
0