Suuraj Profile Banner
Suraj Srinivas Profile
Suraj Srinivas

@Suuraj

Followers
1K
Following
8K
Media
38
Statuses
959

ml researcher / trying to understand why deep learning works

Sunnyvale, CA
Joined June 2009
Don't wanna be here? Send us removal request.
@Suuraj
Suraj Srinivas
1 day
RT @jxmnop: first i thought scaling laws originated in OpenAI (2020). then i thought they came from Baidu (2017). now i am enlightened:.Sca….
0
121
0
@Suuraj
Suraj Srinivas
6 days
RT @fchollet: LLM adoption among US workers is closing in on 50%. Meanwhile labor productivity growth is lower than in 2020. Many counter-….
0
588
0
@grok
Grok
8 hours
Join millions who have switched to Grok.
37
69
503
@Suuraj
Suraj Srinivas
10 days
RT @DimitrisPapail: Thinking about model generalization is quite painful. We observe empirically that models trained with SGD on cross-en….
0
57
0
@Suuraj
Suraj Srinivas
1 month
RT @Michael_J_Black: Here's how my recent papers & reviews are going:. * To solve a vision problem today, the sensible thing is to leverage….
0
58
0
@Suuraj
Suraj Srinivas
1 month
RT @alex_oesterling: ‼️🕚New paper alert with @ushabhalla_: Leveraging the Sequential Nature of Language for Interpretability ( https://t.co/….
0
8
0
@Suuraj
Suraj Srinivas
2 months
Also, I'll be at ICML next week presenting this. Come say hi if you're around!.
0
0
3
@Suuraj
Suraj Srinivas
2 months
It turns out that you can train your LLM by injecting benchmark eval data into your train data, and still have no effect on benchmark evals! . Accepted at @icmlconf. Joint work with @sbordt @valentynepii and Ulrike von Luxburg.
@sbordt
Sebastian Bordt@ICML
2 months
Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper about the effect of data contamination on LLM evals might be for you!🚀. "How Much Can We Forget about Data Contamination?" (accepted at #ICML2025) shows
1
1
7
@Suuraj
Suraj Srinivas
2 months
RT @sbordt: Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper a….
0
6
0
@Suuraj
Suraj Srinivas
3 months
RT @jxmnop: ## The case for more ambition. i wrote about how AI researchers should ask bigger and simpler questions, and publish fewer pap….
0
96
0
@Suuraj
Suraj Srinivas
3 months
RT @ML_Theorist: Why does Chain of Thought prompting actually work?.@bohang_zhang will be talking about it today. Join us!. @Suuraj @tverven.
0
2
0
@Suuraj
Suraj Srinivas
3 months
RT @GoodfireAI: We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting wit….
0
98
0
@Suuraj
Suraj Srinivas
3 months
we live in a world where "verification is easier than generation" is no longer true.
@gson_AI
arlo_son
3 months
#NLProc.AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫). In my recent paper, we introduce SPOT, a dataset of STEM manuscripts (math, materials science, chemistry, physics, etc), annotated with real errors. SOTA models like o3, gemini-2.5-pro
Tweet media one
0
0
6
@Suuraj
Suraj Srinivas
3 months
RT @gson_AI: #NLProc.AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫). In my recent paper, we introduce SPOT….
0
38
0
@Suuraj
Suraj Srinivas
3 months
RT @ML_Theorist: ⏰⏰ Theory of Interpretable AI Seminar ⏰⏰.Chain-of-Thought: Why does explaining to LLMs using CoT prompting work?. Join us….
0
2
0
@Suuraj
Suraj Srinivas
3 months
RT @norabelrose: data attribution is the most neglected thing in interpretability and people should join me in working on it.
0
4
0
@Suuraj
Suraj Srinivas
4 months
RT @ML_Theorist: Curious about feature attribution? . SHAP & LIME treat features independently—but features interact!.Come hear how to "Dis….
0
1
0
@Suuraj
Suraj Srinivas
4 months
RT @ML_Theorist: In April 2024, we launched the Theory of Interpretable XAI seminar, aiming to build a community—unsure if we’d even have e….
0
3
0
@Suuraj
Suraj Srinivas
4 months
RT @ML_Theorist: ⏰⏰Theory of Interpretable AI Seminar ⏰⏰. Interested in Feature Attribution Explanations?. In two weeks, May 6, Gunnar Köni….
0
1
0
@Suuraj
Suraj Srinivas
5 months
RT @ML_Theorist: Today in **two hours** @mirco_mutti will talk about interpretable bandits. Zoom link: @Suuraj @t….
0
1
0
@Suuraj
Suraj Srinivas
5 months
RT @orthonormalist: DID I CRACK IT?. I think I figured out at least a chunk of the math. It's trade deficit divided by their exports. EU:….
0
2K
0