sumeetrm Profile Banner
Sumeet Motwani Profile
Sumeet Motwani

@sumeetrm

Followers
1K
Following
4K
Media
43
Statuses
295

Research Intern@Microsoft Phi | ML PhD at Oxford, Previously CS at UC Berkeley

Redmond, WA
Joined February 2024
Don't wanna be here? Send us removal request.
@sumeetrm
Sumeet Motwani
6 months
Introducing MALT: Improving Reasoning with Multi-Agent LLM Training🫔. We present a new multi-agent post-training method that uses credit assigned synthetic data to improve the reasoning capabilities and self-correction rates of a generator, critic, and refinement model working
Tweet media one
13
52
308
@sumeetrm
Sumeet Motwani
4 hours
I'm not sure why people use the human mind as an example of intelligence not requiring a ton of energy. Evolution took plenty, and so does/will model training. Am I missing something?.
1
0
8
@sumeetrm
Sumeet Motwani
2 days
RT @ryan_kidd44: MATS 9.0 applications are open! Launch your career in AI alignment, governance, and security with our 12-week research pro….
0
53
0
@sumeetrm
Sumeet Motwani
4 days
cs 189.
@polynoamial
Noam Brown
4 days
To all undergrads interested in learning about AI: be wary of taking ā€œIntro to AIā€ as your first AI course. In many programs, the class you actually want first is ā€œIntro to Machine Learningā€. AI technology has exploded in the past 15 years thanks to deep neural networks. Yet at
Tweet media one
Tweet media two
0
0
4
@sumeetrm
Sumeet Motwani
7 days
Very interesting paper
0
0
1
@sumeetrm
Sumeet Motwani
14 days
RT @pratyushmaini: 1/Pretraining is hitting a data wall; scaling raw web data alone leads to diminishing returns. Today @datologyai shares….
0
125
0
@sumeetrm
Sumeet Motwani
17 days
Glad to see the focus on Information Theory and Cryptography. Probably one of the most important (yet understudied) areas in AI security
Tweet media one
@robertwiblin
Rob Wiblin
18 days
New £15,000,000 available for technical AI alignment and security work. International coalition includes UK AISI, Canadian AISI, Schmidt, AWS, UK ARIA. Likely more £ coming in future. 🚨🚨 Please help make sure all potential good applicants know & apply by 10 Sept. 🚨🚨
Tweet media one
0
0
7
@sumeetrm
Sumeet Motwani
17 days
0:07.
@TheHumanoidHub
The Humanoid Hub
17 days
Unitree wins the gold medal for the 1500m run at the World Humanoid Robot Games, setting a world record time of 6 minutes and 34 seconds. The current men's world record is 3:26.
1
0
4
@sumeetrm
Sumeet Motwani
17 days
RT @VaishShrivas: Test-time scaling w/ GRPO boosts accuracy, but also adds ā€œfiller tokensā€ increasing length w/o real progress. We present….
0
48
0
@sumeetrm
Sumeet Motwani
18 days
RT @winglian: I was excited to try out the Dynamic Fine-Tuning proposed in this paper, but as all things that seem too good to be true, it….
0
9
0
@sumeetrm
Sumeet Motwani
24 days
RT @valentina__py: šŸ”ˆFor the SoLaR workshop @COLM_conf we are soliciting opinion abstracts to encourage new perspectives and opinions on res….
0
12
0
@sumeetrm
Sumeet Motwani
25 days
New figure on their blog, looks more reasonable
Tweet media one
0
0
0
@sumeetrm
Sumeet Motwani
25 days
Ironic deception eval
Tweet media one
1
0
12
@sumeetrm
Sumeet Motwani
1 month
RT @kuchaev: Everything about Llama-Nemotron-Super-V1.5 post-training is now open:.Synthetic data: Human data: http….
Tweet card summary image
github.com
Scalable toolkit for efficient model reinforcement - NVIDIA-NeMo/RL
0
49
0
@sumeetrm
Sumeet Motwani
1 month
RT @guohao_li: Introducing Eigent — the first multi-agent workforce on your desktop. Eigent is a team of AI agents collaborating to comple….
0
138
0
@sumeetrm
Sumeet Motwani
1 month
RT @prfsanjeevarora: Completely misses the point. Nobody is suggesting that solving IMO problems is useful for math research. The point is….
0
38
0
@sumeetrm
Sumeet Motwani
1 month
Given the recent IMO results, OAI seems to have figured out reasoning *reliably* with at least 4 Million tokens.
@polynoamial
Noam Brown
1 month
Also this model thinks for a *long* time. o1 thought for seconds. Deep Research for minutes. This one thinks for hours. Importantly, it’s also more efficient with its thinking. And there’s a lot of room to push the test-time compute and efficiency further.
1
0
12
@sumeetrm
Sumeet Motwani
2 months
RT @DulhanJay: Come and find me today at #ICML2025 and let's talk about speech šŸ’¬ decoding from the brain and scaling brain-computer interfa….
0
3
0
@sumeetrm
Sumeet Motwani
2 months
RT @JamesAlcorn94: Plenty of brittle + narrow tooling in this wild west era of codegen can be characterized—not unfairly, and with just a h….
0
5
0
@sumeetrm
Sumeet Motwani
2 months
🤔.
@ebbyamir
Ebby Amir
2 months
literally no one asked us to launch waifus, but we did so anyway. update your Grok app now.
1
0
8