Apoorv Khandelwal @apoorvkh X Profile

Apoorv Khandelwal

@apoorvkh

Followers

639

Following

3K

Media

19

Statuses

286

cs phd student at brown

https://t.co/tWbJXVTMkN

Providence, RI

Joined April 2019

Don't wanna be here? Send us removal request.

Apoorv Khandelwal

@apoorvkh

1 month

In our new paper, we ask whether language models solve compositional tasks using compositional mechanisms. 🧵

4

27

184

Aryaman Arora

@aryaman2020

6 days

i hate ML conference reviewers. i take back everything bad i ever said about ACL. every ACL reviewer i ever got was at least literate

15

20

479

Apoorv Khandelwal

@apoorvkh

9 days

Would highly recommend working with Noah and Hadar!

Noah Snavely

@Jimantha

10 days

This is a great program for folks interested in postdocs @Cornell. For vision/graphics folks interested in NYC, @ElorHadar, @andrewhowens, and I are potentially recruiting a joint postdoc. Please apply!

0

7

Jack Merullo

@jack_merullo_

11 days

How is memorized data stored in a model? We disentangle MLP weights in LMs and ViTs into rank-1 components based on their curvature in the loss, and find representational signatures of both generalizing structure and memorized training data

9

62

492

Yong Zheng-Xin (Yong)

@yong_zhengxin

20 days

🚨 Reasoning models can “self-jailbreak”: they recognize a request is harmful, invent a reason why it’s fine, then help with it. We found that after training on benign math/code reasoning, models emergently start to reason themselves out of safety alignment. 🧵👇

1

6

17

Matthew Finlayson

@mattf1n

1 month

We discovered that language models leave a natural "signature" on their API outputs that's extremely hard to fake. Here's how it works 🔍 📄 https://t.co/Yc7mnhZS96 1/

arxiv.org

The ubiquity of closed-weight language models with public-facing APIs has generated interest in forensic methods, both for extracting hidden model details (e.g., parameters) and for identifying...

3

22

116

Alexis Ross

@alexisjross

1 month

Can LLMs reason like a student? 👩🏻‍🎓📚✏️ For educational tools like AI tutors, modeling how students make mistakes is crucial. But current LLMs are much worse at simulating student errors ❌ than performing correct ✅ reasoning. We try to fix that with our method MISTAKE 🤭👇

11

55

337

Michael Lepori

@Michael_Lepori

1 month

What does your favorite language model know about the real world? 🌎 Can it distinguish between possible and impossible events? We find that LM representations not only encode these distinctions, but that they predict human judgments of event plausibility!

1

7

22

Sonia Murthy

@soniakmurthy

1 month

Excited to present our new paper as a spotlight talk 🌟 at the Pragmatic Reasoning in LMs workshop at #COLM2025 this Friday! 🍁 Come by room 520B @ 11:30am tomorrow to learn more about how LLMs' pluralistic values evolve over reasoning budgets and alignment 🧵

1

5

28

William Merrill

@lambdaviking

1 month

My thesis, 𝘈 𝘵𝘩𝘦𝘰𝘳𝘺 𝘰𝘧 𝘵𝘩𝘦 𝘤𝘰𝘮𝘱𝘶𝘵𝘢𝘵𝘪𝘰𝘯𝘢𝘭 𝘱𝘰𝘸𝘦𝘳 𝘢𝘯𝘥 𝘭𝘪𝘮𝘪𝘵𝘢𝘵𝘪𝘰𝘯𝘴 𝘰𝘧 𝘭𝘢𝘯𝘨𝘶𝘢𝘨𝘦 𝘮𝘰𝘥𝘦𝘭𝘪𝘯𝘨 𝘢𝘳𝘤𝘩𝘪𝘵𝘦𝘤𝘵𝘶𝘳𝘦𝘴, is now online:

8

46

386

Apoorv Khandelwal

@apoorvkh

1 month

Which mechanism is used seems tied to the geometry of embedding space. Models tend to employ the direct mechanism if there's a linear mapping from x to y in the embedding spaces. (Each dot represents a compositional task, see paper for details.) [End of 🧵]

0

9

Apoorv Khandelwal

@apoorvkh

1 month

We use logit lens to identify two processing mechanisms in Llama 3 (3B) as it solves compositions. Left: a compositional mechanism, which computes z = f(x) along the way to y = g(f(x)). Right: a direct mechanism, which doesn't.

1

7

Apoorv Khandelwal

@apoorvkh

1 month

Paper: https://t.co/rVtBeqMb7r Code: https://t.co/u0XjYzqPca Brief hightlights below!

1

0

7

Apoorv Khandelwal

@apoorvkh

2 months

Our “academic pre-training” paper was accepted to COLM! I’ll be presenting at the Tuesday (11 AM) poster session!

Apoorv Khandelwal

@apoorvkh

1 year

Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵 See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs! https://t.co/hvrjwlApN8 https://t.co/1JnEe2CCLr

0

3

19

Akshat Shrivastava

@AkshatS07

2 months

1/ While the world focuses on cognitive intelligence and digital agents, AI deployments in the real world have been stagnant. We are excited to announce our first milestone in building a Physical AI system - Isaac 0.1 - establishing the efficient frontier for perception.

Perceptron AI

@perceptroninc

2 months

1/ Introducing Isaac 0.1 — our first perceptive-language model. 2B params, open weights. Matches or beats models significantly larger on core perception. We are pushing the efficient frontier for physical AI. https://t.co/dJ1Wjh2ARK

7

31

Edward Z. Yang

@ezyang

3 months

Without further ado, The Parallelism Mesh Zoo

8

55

512

Kenneth Marino

@Kenneth_Marino

3 months

Super excited that the Computer Use survey I've been working on w/ @anmarasovic for a while now is ready! Originally we were planning on a more traditional survey paper but as more surveys came out we decided on an interactive website survey.

2

9

24

Sarah Catanzaro

@sarahcat21

3 months

13/ If we fix it, we can scale good science as quickly as we scale bigger models. That’s the future worth betting on. https://t.co/yqOdUjAAyM

amplifypartners.com

What if the biggest bottleneck in AI research isn't more ideas – it's actually testing them?

3

6

23

David Bau

@davidbau

3 months

Thanks to all for making NEMI 2025 a wonderful event. Fascinating talks, inspiring posters, important discussions. You surfaced the questions animating our growing field. I learned many things and hope you did too! Looking forward to what the next year will bring.

2

15

99

Najoung Kim 🫠

@najoungkim

3 months

Pulling this opportunity on research agent evaluation up one more time! The official title of the position will be "Senior research technician". Feel free to email either @sebschu or me directly if you have any questions. Link for more detailed info and where to apply in 🧵

Najoung Kim 🫠

@najoungkim

4 months

👾 Full-time research assistant position (1 year) with @sebschu and me! 👾 We're looking for someone to join the research agent evaluation team, starting Fall 2025. Application link to be available soon, but feel free to send us your CV and/or come talk to us at #ACL2025. 🧵

2

7

18