Olga Golovneva Profile
Olga Golovneva

@OlgaNLP

Followers
1K
Following
1K
Media
13
Statuses
134

Doing research at Meta AI

Joined October 2022
Don't wanna be here? Send us removal request.
@OlgaNLP
Olga Golovneva
5 months
We have been cooking! 👨‍🍳.🧵(1/6).
@jaseweston
Jason Weston
5 months
🚨Multi-Token Attention🚨.📝: Attention is critical for LLMs, but its weights are computed by single query & key vectors, limiting capability. MTA combines query, key & head operations over multiple tokens, improving performance in terms of PPL, std
Tweet media one
8
42
375
@OlgaNLP
Olga Golovneva
5 days
RT @jaseweston: 🪜Introducing: StepWiser🦉.📝: - Reframes stepwise reward modeling as a reasoning task: outputs CoT +….
0
97
0
@grok
Grok
5 days
Join millions who have switched to Grok.
80
150
962
@OlgaNLP
Olga Golovneva
23 days
RT @deedydas: Huge computer science result:. A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs.….
0
2K
0
@OlgaNLP
Olga Golovneva
24 days
RT @jaseweston: . is today a good day for new paper posts? .🤖Learning to Reason for Factuality 🤖.📝: - New reward f….
0
49
0
@OlgaNLP
Olga Golovneva
29 days
RT @julien_c: BREAKING:. we've partnered with @metaai and @paperswithcode to build a successor to Papers with Code (which was sunsetted yes….
0
221
0
@OlgaNLP
Olga Golovneva
2 months
After almost 10 years in AI, my most cited work is still about *underground* hidden layers
Tweet media one
0
0
10
@OlgaNLP
Olga Golovneva
2 months
RT @adinamwilliams: Our team is hiring a postdoc in (mech) interpretability! The ideal candidate will have research experience in interpret….
0
10
0
@OlgaNLP
Olga Golovneva
2 months
✨MTA was accepted at #COLM2025 ✨ Since our first announcement, we have updated the paper with scaling laws, new baselines, and more evaluations! Code is now available in our repo: @COLM_conf.
@OlgaNLP
Olga Golovneva
5 months
We have been cooking! 👨‍🍳.🧵(1/6).
0
5
19
@OlgaNLP
Olga Golovneva
2 months
RT @jaseweston: Reasoning, Attention & Memory Workshop @ COLM.Submission Deadline: June 23, 2025 -- Today!.
0
14
0
@OlgaNLP
Olga Golovneva
2 months
Don't lose it, reuse it! Instead of filtering out bad data samples, we propose to rewrite low-quality samples for better quality.
@thao_nguyen26
Thao Nguyen
2 months
Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔.We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats!.
Tweet media one
0
0
13
@OlgaNLP
Olga Golovneva
3 months
🍿🍿🥤.
@rohanpaul_ai
Rohan Paul
3 months
A follow-up study on Apple's "Illusion of Thinking" Paper is published now. Shows the same models succeed once the format lets them give compressed answers, proving the earlier collapse was a measurement artifact. Token limits, not logic, froze the models. Collapse vanished
Tweet media one
0
0
2
@OlgaNLP
Olga Golovneva
3 months
Our presentations are now available online!.
@Ar_Douillard
Arthur Douillard
3 months
The videos of the workshop are now online!.
0
1
4
@OlgaNLP
Olga Golovneva
3 months
Just look at the list of the invited speakers! Submit your papers by June 23rd, we are waiting for you 🫵🏿.
@jaseweston
Jason Weston
3 months
🚨Announcing RAM 2 workshop @ COLM25 - call for papers🚨 .- 10 years on, we present the sequel to the classic RAM🐏 (Reasoning, Attention, Memory) workshop that took place in 2015 at the cusp of major change in the area. Now in 2025 we reflect on what's happened and discuss the
Tweet media one
Tweet media two
0
1
7
@OlgaNLP
Olga Golovneva
4 months
RT @polkirichenko: We are hiring a PhD research intern at FAIR w/ @marksibrahim @kamalikac to start this summer or Fall!.Potential topics:….
0
36
0
@OlgaNLP
Olga Golovneva
4 months
RT @ArchikiPrasad: 🎉 Excited to share that my internship work, ScPO, on self-training LLMs to improve reasoning without human labels, has b….
0
35
0
@OlgaNLP
Olga Golovneva
4 months
RT @johannes_hage: panel on decentralized training @ ICLR
Tweet media one
0
5
0
@OlgaNLP
Olga Golovneva
4 months
Thanks to all the organizers, it was a pleasure to attend and learn from great speakers!.
@Ar_Douillard
Arthur Douillard
4 months
@ahmetustun89 @PluralisHQ @PierreAlbin @snehaark Collaborative and modular training by @OlgaNLP
Tweet media one
2
3
36
@OlgaNLP
Olga Golovneva
4 months
There are still some spots in the room, but they are running low fast!.
@Ar_Douillard
Arthur Douillard
4 months
starting soon, in hall 4 #3. see you all there and follow that thread :)
Tweet media one
0
3
12
@OlgaNLP
Olga Golovneva
4 months
RT @Ar_Douillard: the workshop is tomorrow!. and it will be livestreamed at for those not in Singapore.
0
6
0
@OlgaNLP
Olga Golovneva
5 months
RT @jaseweston: Google friends & ex-colleagues -- Google scholar seems pretty broken😔. Our most cited paper from last year "Self-Rewarding….
0
10
0
@OlgaNLP
Olga Golovneva
5 months
RT @tesatory: Ten years ago in 2015 we published a paper called End-to-End Memory Networks (. Looking back, this pa….
0
120
0