
Manasi Sharma
@ManasiSharma_
Followers
344
Following
114
Media
13
Statuses
44
research engineer @scale_AI, working on reasoning for frontier models, agents, rl | prev @stanford, @StanfordAILab, @mitll, @Columbia
Stanford, CA
Joined July 2020
Happy CS Commencement Day! .S/o to Prof. @chrmanning for making CS 224N (Natural Language Processing) one of the most fun and rewarding courses that I have ever TA'd for😊.@stanfordnlp @StanfordEng @Stanford @StanfordAILab
2
5
61
new work on rlvr + natural language guidance to enhance a model's ability to reason in its own words.
What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning.
1
1
6
first work since joining Scale is out!.
Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models. "Guide adaptively incorporates hints into the model’s context on problems for which all rollouts were initially incorrect and adjusts the importance sampling ratio for the "off-policy" trajectories in order
2
0
4
excited to share that i have joined the frontier data research team at @scale_AI!🎉 i will be working on research in the domains of LLM reasoning, agents and rl.
0
0
6
RT @askalphaxiv: DeepSeek-R1 now offers an open source model with the ability to explain its intermediate reasoning. But how do explanation….
0
2
0
Amazing initiative by @rajpalleti314 and team!.
Goodreads for arXiv papers💡. What if instead of arbitrary algorithms and tweets, arXiv papers were curated by your research community?. Introducing communities on alphaXiv: bridging papers, discussions, and people in one space.
1
2
6
s/o @shaneguML for a very fun chat on multilinguality (and for taking the best pictures at the conference)
1
0
2
Been a while since I taught 224n, thank you for the kind words :).
Day 20 of NLP: .- finished cs224n lecture 19 .- it was purely python tutorial .- did a recap on numpy on:. > broadcasting . > matrix multiplication . > hardamard nproduct . @ManasiSharma_ is good instructor.
0
0
6