Stanford AI Lab
@StanfordAILab
Followers
218K
Following
701
Media
70
Statuses
3K
The Stanford Artificial Intelligence Laboratory (SAIL), a leading #AI lab since 1963. ⛵️🤖 Emmy-winning video: https://t.co/lV9smZTC1m
Stanford, CA
Joined November 2018
I'm recruiting multiple PhD students for Fall 2026 in Computer Science at @JHUCompSci 🍂 Apply to work on AI for social sciences/human behavior, social NLP, and LLMs for real-world applied domains you're passionate about! Learn more https://t.co/KbTJevMb8J & help spread the word!
14
154
655
New eval! Code duels for LMs ⚔️ Current evals test LMs on *tasks*: "fix this bug," "write a test" But we code to achieve *goals*: maximize revenue, cut costs, win users Meet CodeClash: LMs compete via their codebases across multi-round tournaments to achieve high-level goals
26
89
364
Abaxx Technologies (a name we own in two of our funds) joined the MSCI Canada Index on November 5, marking a key milestone for this fast-growing digital commodities platform. Positioned as a next-generation exchange and clearinghouse for LNG, carbon, and precious metals, Abaxx
1
0
1
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -
1K
1K
8K
Millions of children need help with speech, but there are far too few clinicians. Want to know if AI can responsibly bridge this gap? Check out our EMNLP'25 paper Finetuning and Comprehensive Evaluation of Language Models for Speech Pathology https://t.co/56NCKisT24 🧵
4
11
27
Millions of children face speech disorders—but few get timely care. Our new benchmark, SLP-Helm, tests how AI models diagnose pediatric speech—revealing promises, pitfalls, and bias. Read more on our blog w/ @sangttruong, @nickhaber, @sanmikoyejo, @stai_research
ai.stanford.edu
We introduce SLPHelm, the first-ever benchmark for AI in speech-language pathology. Testing 15 models on 5 diagnostic tasks revealed that today's AI isn't ready for clinical use—but targeted fine-t...
3
8
19
apparently the bubble in people searching for ‘AI bubble’ has burst
12
3
100
Ctrl-World is a controllable world model that generalizes zero-shot to new environments, cameras, and objects. Paper: https://t.co/Bog5hk0h28 Model & code: https://t.co/3OFHRAXv2O The results are exciting — a short thread on why. 🧵
Rollouts in the real world are slow and expensive. What if we could rollout trajectories entirely inside a world model (WM)? Introducing 🚀Ctrl-World🚀, a generative manipulation WM that can interact with advanced VLA policy in imagination. 🧵1/6
9
37
362
How can we help LLMs move beyond the obvious toward generating more creative and diverse ideas? In our new TACL paper, we propose a novel approach to enhance LLM creative generation! https://t.co/AFCpQddN6j
@ChenShani2 @GabiStanovsky @jurafsky @HyadataLab @stanfordnlp @nlphuji
6
25
83
Most models watch every frame - T* learns to search. In our latest blog post, we show how T* rethinks long-form video understanding as temporal search, finding the “needles” in long video haystacks with just a few key frames.
ai.stanford.edu
We propose a more efficient way to locate the
3
2
10
New work! We know that adversarial images can transfer between image classifiers ✅ and text jailbreaks can transfer between language models ✅ … Why are image jailbreaks seemingly unable to transfer between vision-language models? ❌ We might know why… 🧵
7
10
65
📣 Stanford researchers are organizing the first-of-its-kind conference, where AI agents are the primary authors and reviewers. Join the #agents4science online conference on Oct. 22:
agents4science.stanford.edu
The 1st Open Conference where AI serves as both primary authors and reviewers of research papers.
We experimented with using LLM reviewers for #Agents4Science Each paper was assessed by 3 LLM reviewers, w/ reference + code checks. Top scorers are then checked by human experts. All reviews are public https://t.co/1lQs5LCn5d Join our conf to learn more
2
21
53
As of June 2025, 66% of Americans have never used ChatGPT. Our new position paper, Attention to Non-Adopters, explores why this matters: LLM research is being shaped around adopters, leaving non-adopters’ needs and key research opportunities behind. https://t.co/YprwsthysY
1
36
81
A @Stanford study reveals that leading AI companies are pulling user conversations for training. Should users of AI chatbots worry about their privacy?
hai.stanford.edu
A Stanford study reveals that leading AI companies are pulling user conversations for training, highlighting privacy risks and a need for clearer policies.
7
11
22
Look who’s talking now! $KOID-bot speaks to @cvpayne for the first time in public for a segment about why 2025 could be the year of humanoid robots. The KOID-bot joined Jonathan Krane, CEO of KraneShares, and Teddy Haggerty, CEO of @RobostoreUS, who discussed the current state of
17
8
11
📅 Wednesday, October 22 🎤 9:00 am: AI Agents for Biomedical Discovery: Specialized Builds and General Methods 💬 @hcwww_ 🏫 @StanfordAILab, @genentech 🎤 10:00 am: A General-Purpose Biomedical AI Agent 💬 @KexinHuang5 🏫 @Stanford Computer Science @broadinstitute
3
10
18
If Prometheus could choose again, he would steal AI from the gods. It is the most profound innovation in history. Yet, unlike fire, the nature of AI is still ours to choose. Our best path forward is to center relationships – optimizing AI for interdependence.
4
4
27