Apoorv Khandelwal Profile
Apoorv Khandelwal

@apoorvkh

Followers
639
Following
3K
Media
19
Statuses
286

cs phd student at brown

Providence, RI
Joined April 2019
Don't wanna be here? Send us removal request.
@apoorvkh
Apoorv Khandelwal
1 month
In our new paper, we ask whether language models solve compositional tasks using compositional mechanisms. ๐Ÿงต
4
27
184
@aryaman2020
Aryaman Arora
6 days
i hate ML conference reviewers. i take back everything bad i ever said about ACL. every ACL reviewer i ever got was at least literate
15
20
479
@apoorvkh
Apoorv Khandelwal
9 days
Would highly recommend working with Noah and Hadar!
@Jimantha
Noah Snavely
10 days
This is a great program for folks interested in postdocs @Cornell. For vision/graphics folks interested in NYC, @ElorHadar, @andrewhowens, and I are potentially recruiting a joint postdoc. Please apply!
0
0
7
@jack_merullo_
Jack Merullo
11 days
How is memorized data stored in a model? We disentangle MLP weights in LMs and ViTs into rank-1 components based on their curvature in the loss, and find representational signatures of both generalizing structure and memorized training data
9
62
492
@yong_zhengxin
Yong Zheng-Xin (Yong)
20 days
๐Ÿšจ Reasoning models can โ€œself-jailbreakโ€: they recognize a request is harmful, invent a reason why itโ€™s fine, then help with it. We found that after training on benign math/code reasoning, models emergently start to reason themselves out of safety alignment. ๐Ÿงต๐Ÿ‘‡
1
6
17
@mattf1n
Matthew Finlayson
1 month
We discovered that language models leave a natural "signature" on their API outputs that's extremely hard to fake. Here's how it works ๐Ÿ” ๐Ÿ“„ https://t.co/Yc7mnhZS96 1/
Tweet card summary image
arxiv.org
The ubiquity of closed-weight language models with public-facing APIs has generated interest in forensic methods, both for extracting hidden model details (e.g., parameters) and for identifying...
3
22
116
@alexisjross
Alexis Ross
1 month
Can LLMs reason like a student? ๐Ÿ‘ฉ๐Ÿปโ€๐ŸŽ“๐Ÿ“šโœ๏ธ For educational tools like AI tutors, modeling how students make mistakes is crucial. But current LLMs are much worse at simulating student errors โŒ than performing correct โœ… reasoning. We try to fix that with our method MISTAKE ๐Ÿคญ๐Ÿ‘‡
11
55
337
@Michael_Lepori
Michael Lepori
1 month
What does your favorite language model know about the real world? ๐ŸŒŽ Can it distinguish between possible and impossible events? We find that LM representations not only encode these distinctions, but that they predict human judgments of event plausibility!
1
7
22
@soniakmurthy
Sonia Murthy
1 month
Excited to present our new paper as a spotlight talk ๐ŸŒŸ at the Pragmatic Reasoning in LMs workshop at #COLM2025 this Friday! ๐Ÿ Come by room 520B @ 11:30am tomorrow to learn more about how LLMs' pluralistic values evolve over reasoning budgets and alignment ๐Ÿงต
1
5
28
@lambdaviking
William Merrill
1 month
My thesis, ๐˜ˆ ๐˜ต๐˜ฉ๐˜ฆ๐˜ฐ๐˜ณ๐˜บ ๐˜ฐ๐˜ง ๐˜ต๐˜ฉ๐˜ฆ ๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ถ๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ข๐˜ญ ๐˜ฑ๐˜ฐ๐˜ธ๐˜ฆ๐˜ณ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ด ๐˜ฐ๐˜ง ๐˜ญ๐˜ข๐˜ฏ๐˜จ๐˜ถ๐˜ข๐˜จ๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ณ๐˜ค๐˜ฉ๐˜ช๐˜ต๐˜ฆ๐˜ค๐˜ต๐˜ถ๐˜ณ๐˜ฆ๐˜ด, is now online:
8
46
386
@apoorvkh
Apoorv Khandelwal
1 month
Which mechanism is used seems tied to the geometry of embedding space. Models tend to employ the direct mechanism if there's a linear mapping from x to y in the embedding spaces. (Each dot represents a compositional task, see paper for details.) [End of ๐Ÿงต]
0
0
9
@apoorvkh
Apoorv Khandelwal
1 month
We use logit lens to identify two processing mechanisms in Llama 3 (3B) as it solves compositions. Left: a compositional mechanism, which computes z = f(x) along the way to y = g(f(x)). Right: a direct mechanism, which doesn't.
1
1
7
@apoorvkh
Apoorv Khandelwal
1 month
Paper: https://t.co/rVtBeqMb7r Code: https://t.co/u0XjYzqPca Brief hightlights below!
1
0
7
@apoorvkh
Apoorv Khandelwal
2 months
Our โ€œacademic pre-trainingโ€ paper was accepted to COLM! Iโ€™ll be presenting at the Tuesday (11 AM) poster session!
@apoorvkh
Apoorv Khandelwal
1 year
Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? ๐Ÿงต See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs! https://t.co/hvrjwlApN8 https://t.co/1JnEe2CCLr
0
3
19
@AkshatS07
Akshat Shrivastava
2 months
1/ While the world focuses on cognitive intelligence and digital agents, AI deployments in the real world have been stagnant. We are excited to announce our first milestone in building a Physical AI system - Isaac 0.1 - establishing the efficient frontier for perception.
@perceptroninc
Perceptron AI
2 months
1/ Introducing Isaac 0.1 โ€” our first perceptive-language model. 2B params, open weights. Matches or beats models significantly larger on core perception. We are pushing the efficient frontier for physical AI. https://t.co/dJ1Wjh2ARK
7
7
31
@ezyang
Edward Z. Yang
3 months
Without further ado, The Parallelism Mesh Zoo
8
55
512
@Kenneth_Marino
Kenneth Marino
3 months
Super excited that the Computer Use survey I've been working on w/ @anmarasovic for a while now is ready! Originally we were planning on a more traditional survey paper but as more surveys came out we decided on an interactive website survey.
2
9
24
@sarahcat21
Sarah Catanzaro
3 months
13/ If we fix it, we can scale good science as quickly as we scale bigger models. Thatโ€™s the future worth betting on. https://t.co/yqOdUjAAyM
amplifypartners.com
What if the biggest bottleneck in AI research isn't more ideas โ€“ it's actually testing them?
3
6
23
@davidbau
David Bau
3 months
Thanks to all for making NEMI 2025 a wonderful event. Fascinating talks, inspiring posters, important discussions. You surfaced the questions animating our growing field. I learned many things and hope you did too! Looking forward to what the next year will bring.
2
15
99
@najoungkim
Najoung Kim ๐Ÿซ 
3 months
Pulling this opportunity on research agent evaluation up one more time! The official title of the position will be "Senior research technician". Feel free to email either @sebschu or me directly if you have any questions. Link for more detailed info and where to apply in ๐Ÿงต
@najoungkim
Najoung Kim ๐Ÿซ 
4 months
๐Ÿ‘พ Full-time research assistant position (1 year) with @sebschu and me! ๐Ÿ‘พ We're looking for someone to join the research agent evaluation team, starting Fall 2025. Application link to be available soon, but feel free to send us your CV and/or come talk to us at #ACL2025. ๐Ÿงต
2
7
18