Apoorv Khandelwal Profile
Apoorv Khandelwal

@apoorvkh

Followers
548
Following
3K
Media
16
Statuses
256

cs phd student at brown

Providence, RI
Joined April 2019
Don't wanna be here? Send us removal request.
@apoorvkh
Apoorv Khandelwal
8 months
Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵. See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs!.
10
98
656
@apoorvkh
Apoorv Khandelwal
23 hours
RT @kpal_koyena: 🚨 Registration is live! 🚨. The New England Mechanistic Interpretability (NEMI) Workshop is happening August 22nd 2025 at N….
0
25
0
@apoorvkh
Apoorv Khandelwal
24 hours
RT @beenwrekt: The NeurIPS paper checklist corroborates the bureaucratic theory of statistics.
0
25
0
@apoorvkh
Apoorv Khandelwal
12 days
Is there a clear choice or difference between Cursor, VS Code + Copilot, or something else? They both seem quite similar to me (VS Code-based, chat, tab complete, same downstream LLMs, etc). Thoughts?.
2
0
1
@apoorvkh
Apoorv Khandelwal
22 days
RT @mattdeitke: Molmo won the Best Paper Honorable Mention award @CVPR!. This work was a long journey over 1.5 years, from failing to get s….
0
16
0
@apoorvkh
Apoorv Khandelwal
1 month
RT @ruochenz_: 🤔Ever wonder why LLMs give inconsistent answers in different languages?. In our paper, we identify two failure points in the….
0
15
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @jxmnop: excited to finally share on arxiv what we've known for a while now:. All Embedding Models Learn The Same Thing. embeddings fro….
0
626
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @lilianweng: Giving your models more time to think before prediction, like via smart decoding, chain-of-thoughts reasoning, latent thoug….
0
432
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @NickATomlin: The long-term goal of AI is to build models that can handle arbitrary tasks, not just ones they’ve been trained on. We hop….
0
29
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @charliermarsh: Today, we’re announcing the preview release of ty, an extremely fast type checker and language server for Python, writte….
0
514
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @amasad: Can’t believe ChatGPT delved the em dash. What a loss.
0
30
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @yong_zhengxin: 📣 New paper!. We observe that reasoning language models finetuned only on English data are capable of zero-shot cross-li….
0
42
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @lambdaviking: Excited to announce I'll be starting as an assistant professor at @TTIC_Connect for fall 2026!. In the meantime, I'll be….
0
24
0
@apoorvkh
Apoorv Khandelwal
2 months
RT @Vercept_ai: Today we're excited to introduce Vy, our AI that sees and acts on your computer. At Vercept, our mission is to reinvent ho….
0
40
0
@apoorvkh
Apoorv Khandelwal
3 months
RT @pycoders: 14 Advanced Python Features
0
4
0
@apoorvkh
Apoorv Khandelwal
3 months
RT @Harvard: The university will not surrender its independence or relinquish its constitutional rights. Neither Harvard nor any other priv….
0
15K
0
@apoorvkh
Apoorv Khandelwal
3 months
RT @jack_merullo_: I joined @GoodfireAI a little over a month ago to do interpretability! I am really excited to extend my work beyond just….
0
5
0
@apoorvkh
Apoorv Khandelwal
3 months
RT @deviparikh: Introducing API. A new era of agentic computer use begins today.
0
110
0
@apoorvkh
Apoorv Khandelwal
4 months
RT @davidbau: Why is interpretability the key to dominance in AI?. Not winning the scaling race, or banning China. Our answer to OSTP/NSF,….
0
69
0
@apoorvkh
Apoorv Khandelwal
4 months
RT @HanSineng: We’re thrilled to welcome you to New England NLP 2025 at Yale University on April 11th in New Haven, CT 🎉 .
0
24
0
@apoorvkh
Apoorv Khandelwal
4 months
RT @StasBekman: Here is a new way of launching your multi-node pytorch trainings - instead of CLI it's a python library that automatically….
0
2
0