Suraj Srinivas @Suuraj X Profile

Suraj Srinivas

@Suuraj

Followers

1K

Following

8K

Media

38

Statuses

959

ml researcher / trying to understand why deep learning works

Sunnyvale, CA

Joined June 2009

Don't wanna be here? Send us removal request.

Suraj Srinivas

@Suuraj

1 day

RT @jxmnop: first i thought scaling laws originated in OpenAI (2020). then i thought they came from Baidu (2017). now i am enlightened:.Sca….

0

121

0

Suraj Srinivas

@Suuraj

6 days

RT @fchollet: LLM adoption among US workers is closing in on 50%. Meanwhile labor productivity growth is lower than in 2020. Many counter-….

0

588

0

Grok

@grok

8 hours

Join millions who have switched to Grok.

37

69

503

Suraj Srinivas

@Suuraj

10 days

RT @DimitrisPapail: Thinking about model generalization is quite painful. We observe empirically that models trained with SGD on cross-en….

0

57

0

Suraj Srinivas

@Suuraj

1 month

RT @Michael_J_Black: Here's how my recent papers & reviews are going:. * To solve a vision problem today, the sensible thing is to leverage….

0

58

0

Suraj Srinivas

@Suuraj

1 month

RT @alex_oesterling: ‼️🕚New paper alert with @ushabhalla_: Leveraging the Sequential Nature of Language for Interpretability ( https://t.co/….

0

8

0

Suraj Srinivas

@Suuraj

2 months

Also, I'll be at ICML next week presenting this. Come say hi if you're around!.

0

3

Suraj Srinivas

@Suuraj

2 months

It turns out that you can train your LLM by injecting benchmark eval data into your train data, and still have no effect on benchmark evals! . Accepted at @icmlconf. Joint work with @sbordt @valentynepii and Ulrike von Luxburg.

Sebastian Bordt@ICML

@sbordt

2 months

Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper about the effect of data contamination on LLM evals might be for you!🚀. "How Much Can We Forget about Data Contamination?" (accepted at #ICML2025) shows

1

7

Suraj Srinivas

@Suuraj

2 months

RT @sbordt: Have you ever wondered whether a few times of data contamination really lead to benchmark overfitting?🤔 Then our latest paper a….

0

6

0

Suraj Srinivas

@Suuraj

3 months

RT @jxmnop: ## The case for more ambition. i wrote about how AI researchers should ask bigger and simpler questions, and publish fewer pap….

0

96

0

Suraj Srinivas

@Suuraj

3 months

RT @ML_Theorist: Why does Chain of Thought prompting actually work?.@bohang_zhang will be talking about it today. Join us!. @Suuraj @tverven.

0

2

0

Suraj Srinivas

@Suuraj

3 months

RT @GoodfireAI: We created a canvas that plugs into an image model’s brain. You can use it to generate images in real-time by painting wit….

0

98

0

Suraj Srinivas

@Suuraj

3 months

we live in a world where "verification is easier than generation" is no longer true.

arlo_son

@gson_AI

3 months

#NLProc.AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫). In my recent paper, we introduce SPOT, a dataset of STEM manuscripts (math, materials science, chemistry, physics, etc), annotated with real errors. SOTA models like o3, gemini-2.5-pro

0

6

Suraj Srinivas

@Suuraj

3 months

RT @gson_AI: #NLProc.AI Co-Scientists 🤖 can generate ideas, but can they spot mistakes? (not yet! 🚫). In my recent paper, we introduce SPOT….

0

38

0

Suraj Srinivas

@Suuraj

3 months

RT @ML_Theorist: ⏰⏰ Theory of Interpretable AI Seminar ⏰⏰.Chain-of-Thought: Why does explaining to LLMs using CoT prompting work?. Join us….

0

2

0

Suraj Srinivas

@Suuraj

3 months

RT @norabelrose: data attribution is the most neglected thing in interpretability and people should join me in working on it.

0

4

0

Suraj Srinivas

@Suuraj

4 months

RT @ML_Theorist: Curious about feature attribution? . SHAP & LIME treat features independently—but features interact!.Come hear how to "Dis….

0

1

0

Suraj Srinivas

@Suuraj

4 months

RT @ML_Theorist: In April 2024, we launched the Theory of Interpretable XAI seminar, aiming to build a community—unsure if we’d even have e….

0

3

0

Suraj Srinivas

@Suuraj

4 months

RT @ML_Theorist: ⏰⏰Theory of Interpretable AI Seminar ⏰⏰. Interested in Feature Attribution Explanations?. In two weeks, May 6, Gunnar Köni….

0

1

0

Suraj Srinivas

@Suuraj

5 months

RT @ML_Theorist: Today in **two hours** @mirco_mutti will talk about interpretable bandits. Zoom link: @Suuraj @t….

0

1

0

Suraj Srinivas

@Suuraj

5 months

RT @orthonormalist: DID I CRACK IT?. I think I figured out at least a chunk of the math. It's trade deficit divided by their exports. EU:….

0

2K

0