Olga Golovneva @OlgaNLP X Profile

Olga Golovneva

@OlgaNLP

Followers

1K

Following

1K

Media

13

Statuses

134

Doing research at Meta AI

Joined October 2022

Don't wanna be here? Send us removal request.

Olga Golovneva

@OlgaNLP

5 months

We have been cooking! 👨‍🍳.🧵(1/6).

Jason Weston

@jaseweston

5 months

🚨Multi-Token Attention🚨.📝: Attention is critical for LLMs, but its weights are computed by single query & key vectors, limiting capability. MTA combines query, key & head operations over multiple tokens, improving performance in terms of PPL, std

8

42

375

Olga Golovneva

@OlgaNLP

5 days

RT @jaseweston: 🪜Introducing: StepWiser🦉.📝: - Reframes stepwise reward modeling as a reasoning task: outputs CoT +….

0

97

0

Grok

@grok

5 days

Join millions who have switched to Grok.

80

150

962

Olga Golovneva

@OlgaNLP

23 days

RT @deedydas: Huge computer science result:. A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs.….

0

2K

0

Olga Golovneva

@OlgaNLP

24 days

RT @jaseweston: . is today a good day for new paper posts? .🤖Learning to Reason for Factuality 🤖.📝: - New reward f….

0

49

0

Olga Golovneva

@OlgaNLP

29 days

RT @julien_c: BREAKING:. we've partnered with @metaai and @paperswithcode to build a successor to Papers with Code (which was sunsetted yes….

0

221

0

Olga Golovneva

@OlgaNLP

2 months

After almost 10 years in AI, my most cited work is still about *underground* hidden layers

0

10

Olga Golovneva

@OlgaNLP

2 months

RT @adinamwilliams: Our team is hiring a postdoc in (mech) interpretability! The ideal candidate will have research experience in interpret….

0

10

0

Olga Golovneva

@OlgaNLP

2 months

✨MTA was accepted at #COLM2025 ✨ Since our first announcement, we have updated the paper with scaling laws, new baselines, and more evaluations! Code is now available in our repo: @COLM_conf.

Olga Golovneva

@OlgaNLP

5 months

We have been cooking! 👨‍🍳.🧵(1/6).

0

5

19

Olga Golovneva

@OlgaNLP

2 months

RT @jaseweston: Reasoning, Attention & Memory Workshop @ COLM.Submission Deadline: June 23, 2025 -- Today!.

0

14

0

Olga Golovneva

@OlgaNLP

2 months

Don't lose it, reuse it! Instead of filtering out bad data samples, we propose to rewrite low-quality samples for better quality.

Thao Nguyen

@thao_nguyen26

2 months

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔.We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats!.

0

13

Olga Golovneva

@OlgaNLP

3 months

🍿🍿🥤.

Rohan Paul

@rohanpaul_ai

3 months

A follow-up study on Apple's "Illusion of Thinking" Paper is published now. Shows the same models succeed once the format lets them give compressed answers, proving the earlier collapse was a measurement artifact. Token limits, not logic, froze the models. Collapse vanished

0

2

Olga Golovneva

@OlgaNLP

3 months

Our presentations are now available online!.

Arthur Douillard

@Ar_Douillard

3 months

The videos of the workshop are now online!.

0

1

4

Olga Golovneva

@OlgaNLP

3 months

Just look at the list of the invited speakers! Submit your papers by June 23rd, we are waiting for you 🫵🏿.

Jason Weston

@jaseweston

3 months

🚨Announcing RAM 2 workshop @ COLM25 - call for papers🚨 .- 10 years on, we present the sequel to the classic RAM🐏 (Reasoning, Attention, Memory) workshop that took place in 2015 at the cusp of major change in the area. Now in 2025 we reflect on what's happened and discuss the

0

1

7

Olga Golovneva

@OlgaNLP

4 months

RT @polkirichenko: We are hiring a PhD research intern at FAIR w/ @marksibrahim @kamalikac to start this summer or Fall!.Potential topics:….

0

36

0

Olga Golovneva

@OlgaNLP

4 months

RT @ArchikiPrasad: 🎉 Excited to share that my internship work, ScPO, on self-training LLMs to improve reasoning without human labels, has b….

0

35

0

Olga Golovneva

@OlgaNLP

4 months

RT @johannes_hage: panel on decentralized training @ ICLR

0

5

0

Olga Golovneva

@OlgaNLP

4 months

Thanks to all the organizers, it was a pleasure to attend and learn from great speakers!.

Arthur Douillard

@Ar_Douillard

4 months

@ahmetustun89 @PluralisHQ @PierreAlbin @snehaark Collaborative and modular training by @OlgaNLP

2

3

36

Olga Golovneva

@OlgaNLP

4 months

There are still some spots in the room, but they are running low fast!.

Arthur Douillard

@Ar_Douillard

4 months

starting soon, in hall 4 #3. see you all there and follow that thread :)

0

3

12

Olga Golovneva

@OlgaNLP

4 months

RT @Ar_Douillard: the workshop is tomorrow!. and it will be livestreamed at for those not in Singapore.

0

6

0

Olga Golovneva

@OlgaNLP

5 months

RT @jaseweston: Google friends & ex-colleagues -- Google scholar seems pretty broken😔. Our most cited paper from last year "Self-Rewarding….

0

10

0

Olga Golovneva

@OlgaNLP

5 months

RT @tesatory: Ten years ago in 2015 we published a paper called End-to-End Memory Networks (. Looking back, this pa….

0

120

0