ChrisGPotts Profile Banner
Christopher Potts Profile
Christopher Potts

@ChrisGPotts

Followers
14K
Following
2K
Media
116
Statuses
2K

Stanford Professor of Linguistics and, by courtesy, of Computer Science. Member of technical staff @stanfordnlp and @StanfordAILab. Co-founder @ Bigspin AI.

Joined November 2011
Don't wanna be here? Send us removal request.
@Theoryvc
Theory Ventures
1 day
Every week brings a new agent framework or model. It's exciting, but founders need to know: what’s actually being used in production? We asked 413 senior builders how they’re adopting AI today, where adoption breaks, and where the real gaps are. Interactive findings ↓
6
6
31
@ChrisGPotts
Christopher Potts
16 hours
Hat-tip to @vennemeyerd and colleagues for this insightful and timely work!
0
0
0
@ChrisGPotts
Christopher Potts
16 hours
My full talk:
0
3
2
@ChrisGPotts
Christopher Potts
16 hours
Just after the above clip, I consider the case of extreme sycophancy in 4o. Here's a lovely new paper applying mech-interp methods to the sycophancy problem:
Tweet card summary image
arxiv.org
Large language models (LLMs) often exhibit sycophantic behaviors -- such as excessive agreement with or flattery of the user -- but it is unclear whether these behaviors arise from a single...
2
0
2
@ChrisGPotts
Christopher Potts
16 hours
Interpretability research has made only minor contributions to AI safety so far. What can we do to change that? (Clip from a longer talk; YouTube link in the thread):
@ChrisGPotts
Christopher Potts
3 days
The Anthropic perspective on interpretability is prominent and significant, but not inevitable. My own take is quite different. (Clip from a talk I gave; YouTube link in the thread):
4
10
105
@stanfordnlp
Stanford NLP Group
2 days
🌲 The Stanford School of Neural Network Interpretability 🌲
@ChrisGPotts
Christopher Potts
3 days
The Anthropic perspective on interpretability is prominent and significant, but not inevitable. My own take is quite different. (Clip from a talk I gave; YouTube link in the thread):
4
7
66
@ChrisGPotts
Christopher Potts
3 days
The Anthropic perspective on interpretability is prominent and significant, but not inevitable. My own take is quite different. (Clip from a talk I gave; YouTube link in the thread):
@ChrisGPotts
Christopher Potts
5 days
Severance as a show about interpretability research in AI (a clip from a talk; YouTube link just below):
3
25
198
@lateinteraction
Omar Khattab
4 days
I just learned that the one-of-a-kind Herumb Shandilya @krypticmouse, who is soon to graduate from Stanford with Master’s in CS, is on the job market for training infra, inference infra, or other ML engineering roles. Here’s a PSA to y’all: you should hire him.
6
9
51
@pretendsmarts
Andy Wojcicki
15 days
We went from papers and no code, through PapersWithCode, to end up with ...
0
1
5
@tpimentelms
Tiago Pimentel
4 days
I'd also recommend working with @kmahowald if you can :) It doesn't just lead to award-winning papers, but it's always a great experience in itself!
@ChrisGPotts
Christopher Potts
8 days
If you are interested in winning an ACL paper award, it's a very smart move is to write something with @kmahowald. Historically, he has won at least one each year for as long as I can remember.
0
1
7
@ChrisGPotts
Christopher Potts
5 days
Link to the full talk:
1
1
14
@ChrisGPotts
Christopher Potts
5 days
Severance as a show about interpretability research in AI (a clip from a talk; YouTube link just below):
7
26
217
@ChrisGPotts
Christopher Potts
6 days
I am delighted to be collaborating with both Alexa (@ARTartaglini) and Siri (@srihita_raju).
1
1
33
@ElisaKreiss
Elisa Kreiss
7 days
I'm recruiting PhD students this year with strong interest in NLP @UCLA! If you are interested in understanding how we use language to effectively communicate, and how those insights translate to NLP models, apply to join the @CoalasLab. Continue reading for details! (1/5)
11
100
529
@ChrisGPotts
Christopher Potts
8 days
If you are interested in winning an ACL paper award, it's a very smart move is to write something with @kmahowald. Historically, he has won at least one each year for as long as I can remember.
@kmahowald
Kyle Mahowald
8 days
Delighted Sasha's work using mech interp to study complex syntax constructions won an Outstanding Paper Award at EMNLP! And delighted the ACL community continues to recognize unabashedly linguistic topics like filler-gaps, and the huge potential for LMs to inform such topics!
2
15
109
@kmahowald
Kyle Mahowald
8 days
Delighted Sasha's work using mech interp to study complex syntax constructions won an Outstanding Paper Award at EMNLP! And delighted the ACL community continues to recognize unabashedly linguistic topics like filler-gaps, and the huge potential for LMs to inform such topics!
@SashaBoguraev
Sasha Boguraev
6 months
A key hypothesis in the history of linguistics is that different constructions share underlying structure. We take advantage of recent advances in mechanistic interpretability to test this hypothesis in Language Models. New work with @kmahowald and @ChrisGPotts! 🧵👇
1
22
90
@LaudeInstitute
Laude Institute
8 days
Meet Slingshots // One. This inaugural batch includes leading-edge researchers advancing the science and practice of AI - with benchmarks, frameworks, and agents that ship real impact into the world. We're honored to support research from: @alexgshaw @Mike_A_Merrill
2
17
62
@sarahwiegreffe
Sarah Wiegreffe
10 days
I am recruiting 2 PhD students to work on LM interpretability at UMD @umdcs starting in fall 2026! We are #3 in AI and #4 in NLP research on @CSrankings. Come join us in our lovely building just a few miles from Washington, D.C. Details in 🧵
14
171
776
@JulieKallini
Julie Kallini ✨
12 days
In a last-minute change of events, I won’t be attending #EMNLP2025 in person. Still, I’m excited to share our poster for our paper, False Friends! https://t.co/gBFnoCbgRE
@JulieKallini
Julie Kallini ✨
2 months
New paper! 🌈 In English, pie = 🥧. In Spanish, pie = 🦶. Multilingual tokenizers often share such overlapping tokens between languages. Do these “False Friends” hurt or help multilingual LMs? We find that overlap consistently improves transfer—even when it seems misleading. 🧵
0
5
51