Stefano Ermon
@StefanoErmon
Followers
24K
Following
1K
Media
22
Statuses
482
AI Prof @Stanford | CEO & Cofounder @_inception_ai | Co-inventor of DDIM, FlashAttention, DPO, GAIL, and score-based/diffusion models
Stanford, CA
Joined February 2013
Excited to present our work on multi-objective scientific discovery at #NeurIPS2025! 🎉 We present Preference-Guided Diffusion, a novel method to sample diverse designs from a diffusion model that corresponds to the Pareto Front of black-box objectives. Most (if not all)
0
6
18
Congratulations!! This is amazing
Today I’m excited to introduce micro1 Intelligence, the world’s most advanced platform for training frontier AI models. Achieving AGI is bottlenecked by one main thing: high-quality data. Data based on real-world environments that capture human expert workflows, complex
7
6
74
Today I’m excited to introduce micro1 Intelligence, the world’s most advanced platform for training frontier AI models. Achieving AGI is bottlenecked by one main thing: high-quality data. Data based on real-world environments that capture human expert workflows, complex
83
141
764
Mercury is now available on Azure AI Foundry! This means you can leverage Mercury's speeds with the security of a private Azure instance and all the features of the broader Azure ecosystem. Read more: https://t.co/6Hotow3ruD
#dLLM #AzureAI
5
7
34
We just raised a $7M Seed round co-led by @MenloVentures and @haystackvc with participation from @Shakti_VC, @conviction and @upfrontvc 🚀 We're honored to have the support of incredible angels including @ericschmidt, @SebastianThrun, @sarahookr Join us: https://t.co/IKwK8KsG96
47
55
631
Diffusion LLM + Agents are 🔥 This is @_inception_ai's Diffusion LLM with @huggingface SmolAgents: - Planning tool use - Executing 20 web searches and parsing results - Synthesizing the data All in 3.5 seconds. With 10 searches it took only 1.6 seconds. Source on GitHub below.
8
20
119
We just shipped a major Mercury refresh. ⚡ Best-in-class quality at up to 10× lower latency. Still the only commercial diffusion LLM in the world. Try the new model.
Mercury is refreshed – with across-the-board improvements in coding, instruction following, math, and knowledge recall. Start building responsive, in-the-flow AI solutions! Read more: https://t.co/QyTVaHAIue
6
17
131
Mercury runs five times faster than Claude 4.5 Haiku at less than one-fourth the price, while maintaining higher quality.
0
7
45
When we began applying diffusion to language in my lab at Stanford, many doubted it could work. That research became Mercury diffusion LLM: 10X faster, more efficient, and now the foundation of @_inception_ai. Proud to raise $50M with support from top investors.
Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in
40
82
1K
Today’s LLMs are painfully slow and expensive. They are autoregressive and spit out words sequentially. One. At. A. Time. Our dLLMs generate text in parallel, delivering answers up to 10X faster. Now we’ve raised $50M to scale them. Full story from @russellbrandom in
techcrunch.com
Diffusion models already power AI image generators, but Inception thinks they can be even more powerful applied in software development.
15
48
435
Tired of chasing references across dozens of papers? This monograph distills it all: the principles, intuition, and math behind diffusion models. Thrilled to share!
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core
13
136
1K
Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core
47
453
2K
🚀 Excited to see ProxyAI now powered by Mercury Coder! If you’re using JetBrains IDEs, give it a try: lightning-fast autocomplete, next-edit, and apply-edit powered by diffusion LLMs
🚀We've partnered with ProxyAI! Our Mercury Coder dLLM is now the default for ProxyAI's autocomplete, next edit, and auto apply tooling, providing developers with lightning-fast and accurate code edits. Read more: https://t.co/eWtOFVXpgk
#AI #DiffusionModels #dLLM
0
1
8
🚀We've partnered with ProxyAI! Our Mercury Coder dLLM is now the default for ProxyAI's autocomplete, next edit, and auto apply tooling, providing developers with lightning-fast and accurate code edits. Read more: https://t.co/eWtOFVXpgk
#AI #DiffusionModels #dLLM
tryproxy.io
1
2
13
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA. Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation. What makes Sonic-3 great: - Breakthrough naturalness - laughter and full emotional range - Lightning fast -
1K
1K
8K
Ok, I wasn’t aware J2C didn’t exist before the announcement. Happy to receive one for our #TMLR paper, led by Naoki Murata (@smiurtitkii), on guidance for discrete diffusion! Title: G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving pdf:
openreview.net
Recent literature has effectively leveraged diffusion models trained on continuous variables as priors for solving inverse problems. Notably, discrete diffusion models with discrete latent codes...
We at TMLR are proud to announce that selected papers will now be eligible for an opportunity to present at the joint NeurIPS/ICML/ICLR Journal-to-Conference (J2C) Track: https://t.co/CyjZtqbnBS
0
4
11
We’re in! We are now part of the #AWSGenAIAccelerator2025. We’re looking forward to working with @AWSstartups to help us deliver ultra-fast and efficient diffusion large language models.
0
4
10
Mercury Coder now supports Apply-Edit capabilities, providing quality on par with GPT-5 at speeds 46x faster!
3
12
53
I’m excited to announce micro1 has raised a $35M Series A, valuing us at $500M. This round was led by 01A with @adambain joining our board of directors. We’re grateful to be partnering with leading AI Labs & fortune 10s, such as Microsoft, to train frontier LLMs. We’re just
253
251
2K