Ekdeep Singh @EkdeepL X Profile

Ekdeep Singh

@EkdeepL

Followers

2K

Following

3K

Media

126

Statuses

553

Member of Technical Staff @GoodfireAI; Previously: Postdoc / PhD at Center for Brain Science, Harvard and University of Michigan

San Francisco, CA

Joined December 2017

Don't wanna be here? Send us removal request.

Ekdeep Singh

@EkdeepL

2 months

🚨New paper! We know models learn distinct in-context learning strategies, but *why*? Why generalize instead of memorize to lower loss? And why is generalization transient?. Our work explains this & *predicts Transformer behavior throughout training* without its weights! 🧵. 1/

9

64

347

Ekdeep Singh

@EkdeepL

5 days

A beauty of a result <3 . Adds to the narrative that models learning the data distribution will represent its factors as meaningful representations: in this case, a hierarchical space with a nicely endowed metric!.

Goodfire

@GoodfireAI

5 days

Arc Institute trained their foundation model Evo 2 on DNA from all domains of life. What has it learned about the natural world?.Our new research finds that it represents the tree of life, spanning thousands of species, as a curved manifold in its neuronal activations. (1/8)

1

2

55

Grok

@grok

21 days

The most fun image & video creation tool in the world is here. Try it for free in the Grok App.

0

151

2K

Ekdeep Singh

@EkdeepL

6 days

RT @GoodfireAI: Adversarial examples - a vulnerability of every AI model, and a “mystery” of deep learning - may simply come from models cr….

0

24

0

Ekdeep Singh

@EkdeepL

10 days

RT @CogInterp: Due to popular demand, we are extending the CogInterp submission deadline again! Submit by 8/27 (midnight AoE).

0

2

0

Ekdeep Singh

@EkdeepL

15 days

Hehe, @_vaishnavh is too generous 💙, but in my totally unbiased opinion people should certainly check out the paper :).

Vaishnavh Nagarajan

@_vaishnavh

15 days

@ArvidFrydenlund @roydanroy The last one (Towards an Understanding of Stepwise Inference in Transformers. by Khona et al.,) is one of my favorite papers and is way ahead of its time for various reasons (I only know one author on it @EkdeepL).

1

0

9

Ekdeep Singh

@EkdeepL

18 days

RT @lightspeedvp: 🗓️ Mark your calendars for August 26 and join us for a #GenSF meetup covering mechanistic interpretability in modern AI m….

0

8

0

Ekdeep Singh

@EkdeepL

19 days

I am not leaving academia though! I'll be applying for academic jobs this fall. My primary motivation to join @GoodfireAI in the interim was to demonstrate that science and interpretability, when done right, can help us develop safe, reliable, and helpful models.

0

1

58

Ekdeep Singh

@EkdeepL

19 days

Super excited to be joining @GoodfireAI! I'll be scaling up the line of work our group started at Harvard: making predictive accounts of model representations by assuming a model behaves optimally (i.e., good old rational analysis from cogsci!).

Goodfire

@GoodfireAI

19 days

Thrilled to welcome @EkdeepL to the team! Ekdeep is working on a new research agenda on “cognitive interpretability”, aimed at adapting and improving theories of human cognition to design tools for explaining model cognition.

41

18

329

Ekdeep Singh

@EkdeepL

21 days

RT @CogInterp: 📆 The deadline for submission to CogInterp has officially been extended to 8/22 (midnight AoE). We look forward to seeing wh….

0

2

0

Ekdeep Singh

@EkdeepL

24 days

Exciting labs starting everywhere :).

Blake Bordelon ☕️🧪👨‍💻

@blake__bordelon

24 days

Excited to announce that I will be joining @UTAustin with a joint position between @OdenInstitute for Computational Science and dept of Neuroscience in FL 2026! I plan on recruiting PhD students and postdocs interested in mathematics of neural computation (more details to come).

0

11

Ekdeep Singh

@EkdeepL

25 days

RT @AmirZur2000: 1/6 🦉Did you know that telling an LLM that it loves the number 087 also makes it love owls?. In our new blogpost, It's Owl….

owls.baulab.info

Entangled tokens help explain subliminal learning.

0

72

0

Ekdeep Singh

@EkdeepL

26 days

Tubingen just got ultra-exciting :D.

Maksym Andriushchenko

@maksym_andr

26 days

🚨 Incredibly excited to share that I'm starting my research group focusing on AI safety and alignment at the ELLIS Institute Tübingen and Max Planck Institute for Intelligent Systems in September 2025! 🚨. Hiring. I'm looking for multiple PhD students: both those able to start

0

2

18

Ekdeep Singh

@EkdeepL

27 days

RT @GoodfireAI: New research with coauthors at @Anthropic, @GoogleDeepMind, @AiEleuther, and @decode_research! We expand on and open-source….

0

22

0

Ekdeep Singh

@EkdeepL

27 days

RT @victoria_r_li: Charts and graphs help people analyze data, but can they also help AI?.In a new paper, we provide initial evidence that….

0

3

0

Ekdeep Singh

@EkdeepL

1 month

RT @SaxeLab: Excited to share new work @icmlconf by Loek van Rossem exploring the development of computational algorithms in recurrent neur….

openreview.net

Even when massively overparameterized, deep neural networks show a remarkable ability to generalize. Research on this phenomenon has focused on generalization within distribution, via smooth...

0

19

0

Ekdeep Singh

@EkdeepL

1 month

Check out my boy @dmkrash presenting our “outstanding paper award” winner at the Actionable Interpretability workshop today!.

Dima Krasheninnikov

@dmkrash

1 month

Check out my posters today if you're at ICML!.1) Detecting high-stakes interactions with activation probes — Outstanding paper @ Actionable interp workshop, 10:40-11:40.2) LLMs’ activations linearly encode training-order recency — Best paper runner up @ MemFM workshop, 2:30-3:45.

0

4

20

Ekdeep Singh

@EkdeepL

2 months

While you still can, snatch this prodigy undergrad for your lab when he applies for PhDs this fall!.

Kento Nishi｜AI Researcher, LiveTL+HyperChat Dev🐔

@kento_nishi

2 months

Thank you to everyone who swung by our poster presentation!!! So many engaging conversations today. #ICML2025

0

14

Ekdeep Singh

@EkdeepL

2 months

Submit to our workshop on contextualizing Cogsci approaches for understanding neural networks---"Cognitive interpretability"!.

CogInterp Workshop @ NeurIPS 2025

@CogInterp

2 months

We’re excited to announce the first workshop on CogInterp: Interpreting Cognition in Deep Learning Models @ NeurIPS 2025! 📣. How can we interpret the algorithms and representations underlying complex behavior in deep learning models?. 🌐 1/.

0

7

22

Ekdeep Singh

@EkdeepL

2 months

I'll be at ICML beginning this Monday---hit me up if you'd like to chat!.

2

0

20

Ekdeep Singh

@EkdeepL

2 months

RT @nsaphra: 🚨 New preprint! 🚨. Everyone loves causal interp. It’s coherently defined! It makes testable predictions about mechanistic inte….

0

24

0

Ekdeep Singh

@EkdeepL

2 months

RT @Cohere_Labs: Don't forget to tune in tomorrow, July 10th for a session with @EkdeepL on "Rational Analysis of In-Context Learning Elici….

0

4

0