StefanoErmon Profile Banner
Stefano Ermon Profile
Stefano Ermon

@StefanoErmon

Followers
24K
Following
1K
Media
22
Statuses
482

AI Prof @Stanford | CEO & Cofounder @_inception_ai | Co-inventor of DDIM, FlashAttention, DPO, GAIL, and score-based/diffusion models

Stanford, CA
Joined February 2013
Don't wanna be here? Send us removal request.
@yashasannadani
Yashas Annadani ✈️ NeurIPS2025
5 days
Excited to present our work on multi-objective scientific discovery at #NeurIPS2025! 🎉 We present Preference-Guided Diffusion, a novel method to sample diverse designs from a diffusion model that corresponds to the Pareto Front of black-box objectives. Most (if not all)
0
6
18
@StefanoErmon
Stefano Ermon
16 days
Congratulations!! This is amazing
@aliniikk
Ali Ansari
17 days
Today I’m excited to introduce micro1 Intelligence, the world’s most advanced platform for training frontier AI models. Achieving AGI is bottlenecked by one main thing: high-quality data. Data based on real-world environments that capture human expert workflows, complex
7
6
74
@aliniikk
Ali Ansari
17 days
Today I’m excited to introduce micro1 Intelligence, the world’s most advanced platform for training frontier AI models. Achieving AGI is bottlenecked by one main thing: high-quality data. Data based on real-world environments that capture human expert workflows, complex
83
141
764
@_inception_ai
Inception
18 days
Mercury is now available on Azure AI Foundry! This means you can leverage Mercury's speeds with the security of a private Azure instance and all the features of the broader Azure ecosystem. Read more: https://t.co/6Hotow3ruD #dLLM #AzureAI
5
7
34
@askalphaxiv
alphaXiv
19 days
We just raised a $7M Seed round co-led by @MenloVentures and @haystackvc with participation from @Shakti_VC, @conviction and @upfrontvc 🚀 We're honored to have the support of incredible angels including @ericschmidt, @SebastianThrun, @sarahookr Join us: https://t.co/IKwK8KsG96
47
55
631
@appenz
Guido Appenzeller
28 days
Diffusion LLM + Agents are 🔥 This is @_inception_ai's Diffusion LLM with @huggingface SmolAgents: - Planning tool use - Executing 20 web searches and parsing results - Synthesizing the data All in 3.5 seconds. With 10 searches it took only 1.6 seconds. Source on GitHub below.
8
20
119
@StefanoErmon
Stefano Ermon
1 month
We just shipped a major Mercury refresh. ⚡ Best-in-class quality at up to 10× lower latency. Still the only commercial diffusion LLM in the world. Try the new model.
@_inception_ai
Inception
1 month
Mercury is refreshed – with across-the-board improvements in coding, instruction following, math, and knowledge recall. Start building responsive, in-the-flow AI solutions! Read more: https://t.co/QyTVaHAIue
6
17
131
@_inception_ai
Inception
1 month
Mercury runs five times faster than Claude 4.5 Haiku at less than one-fourth the price, while maintaining higher quality.
0
7
45
@StefanoErmon
Stefano Ermon
1 month
When we began applying diffusion to language in my lab at Stanford, many doubted it could work. That research became Mercury diffusion LLM: 10X faster, more efficient, and now the foundation of @_inception_ai. Proud to raise $50M with support from top investors.
@_inception_ai
Inception
1 month
Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in
40
82
1K
@_inception_ai
Inception
1 month
Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in
Tweet card summary image
techcrunch.com
Diffusion models already power AI image generators, but Inception thinks they can be even more powerful applied in software development.
15
48
435
@StefanoErmon
Stefano Ermon
1 month
Tired of chasing references across dozens of papers? This monograph distills it all: the principles, intuition, and math behind diffusion models. Thrilled to share!
@JCJesseLai
Chieh-Hsin (Jesse) Lai
1 month
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core
13
136
1K
@JCJesseLai
Chieh-Hsin (Jesse) Lai
1 month
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core
47
453
2K
@StefanoErmon
Stefano Ermon
1 month
🚀 Excited to see ProxyAI now powered by Mercury Coder! If you’re using JetBrains IDEs, give it a try: lightning-fast autocomplete, next-edit, and apply-edit powered by diffusion LLMs
@_inception_ai
Inception
1 month
🚀We've partnered with ProxyAI! Our Mercury Coder dLLM is now the default for ProxyAI's autocomplete, next edit, and auto apply tooling, providing developers with lightning-fast and accurate code edits. Read more: https://t.co/eWtOFVXpgk #AI #DiffusionModels #dLLM
0
1
8
@_inception_ai
Inception
1 month
🚀We've partnered with ProxyAI! Our Mercury Coder dLLM is now the default for ProxyAI's autocomplete, next edit, and auto apply tooling, providing developers with lightning-fast and accurate code edits. Read more: https://t.co/eWtOFVXpgk #AI #DiffusionModels #dLLM
Tweet card summary image
tryproxy.io
1
2
13
@krandiash
Karan Goel
1 month
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -
1K
1K
8K
@mittu1204
Yuki Mitsufuji
2 months
Ok, I wasn’t aware J2C didn’t exist before the announcement. Happy to receive one for our #TMLR paper, led by Naoki Murata (@smiurtitkii), on guidance for discrete diffusion! Title: G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving pdf:
openreview.net
Recent literature has effectively leveraged diffusion models trained on continuous variables as priors for solving inverse problems. Notably, discrete diffusion models with discrete latent codes...
@hugo_larochelle
Hugo Larochelle
2 months
We at TMLR are proud to announce that selected papers will now be eligible for an opportunity to present at the joint NeurIPS/ICML/ICLR Journal-to-Conference (J2C) Track: https://t.co/CyjZtqbnBS
0
4
11
@_inception_ai
Inception
2 months
We’re in! We are now part of the #AWSGenAIAccelerator2025. We’re looking forward to working with @AWSstartups to help us deliver ultra-fast and efficient diffusion large language models.
0
4
10
@StefanoErmon
Stefano Ermon
2 months
Editing workflows are a perfect fit for diffusion LLMs. Apply-Edit in Mercury Coder shows why. GPT-5 quality, 46x faster. 🚀
@_inception_ai
Inception
2 months
Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!
1
14
113
@_inception_ai
Inception
2 months
Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!
3
12
53
@aliniikk
Ali Ansari
3 months
I’m excited to announce micro1 has raised a $35M Series A, valuing us at $500M. This round was led by 01A with @adambain joining our board of directors. We’re grateful to be partnering with leading AI Labs & fortune 10s, such as Microsoft, to train frontier LLMs. We’re just
253
251
2K