
Subham Sahoo
@ssahoo_
Followers
708
Following
592
Media
23
Statuses
266
PhD candidate @cornell working on Diffusion Language Models. Previously @GoogleAI, @IITKgp.
New York, USA
Joined June 2010
RT @jiqizhixin: A Survey on Diffusion Language Models. ⚡ Diffusion Language Models (DLMs) are faster inference, bidirectional context, fine….
0
132
0
RT @jwthickstun: Is this a claim about regularization? We've known since the Penn Treebank era that we can squeeze more out of limited data….
0
1
0
Honored to see MDLM featured in the tutorial 😊.
Diffusion LLMs are promising ways to overcome the limitations of autoregressive LLMs. Less error propagation, easier to control, and faster to sample! . But how do Diffusion LLMs actually work? 🤔. Let's explore some ideas on this fascinating topic!
0
2
18
📢 @BytedanceTalk just dropped their diffusion LLM!!! And boy it's fast 💨. From their technical report, it seems like they are using MDLM (my research) 😊.
0
0
29
RT @ArashVahdat: 📢 Excited to announce that GenMol is now open-sourced. GenMol: A Drug Discovery Generalist with Discrete Diffusion.Paper….
0
30
0
📢 Duo and Eso-LMs at 2B scale on Slim Pajama. These models will finish training in a few days. While HF release may take time due to corporate red tape, we'll try providing early access case-by-case. Email susahoo@nvidia.com with the subject “Early access”. Duo:.
0
1
20
Attending ICML ✈️Tues-Fri to present "The Diffusion Duality".🗓️Wed, July 16 @ 4:30pm.📍East Exhibition Hall A-B (E-3003). DM if you want to chat about diffusion LMs, or my current work on Duality or Esoteric LMs!.
🚨 “The Diffusion Duality” is out! @ICML2025 . ⚡️ Few-step generation in discrete diffusion language models by exploiting the underlying Gaussian diffusion. 🦾Beats AR on 3/7 zero-shot likelihood benchmarks. 📄 Paper: 💻 Code: 🧠
1
17
157
Ouch, my ego took a hit. Chemistry is a subject that can be gamed with rote learning, yet surprisingly, Gemini performs worse in it than in physics and math.
AI now beats every single human in the hardest college entrance exam in India, the IIT JEE. Bytedance silently published this result this week. The top scorer was Rajit Gupta with 332/360, but Google's Gemini 2.5 Pro was at rank 1 with 336/360.
0
0
2
RT @EthanEvansVP: I screwed over one of my top engineers when I was a Senior Manager at Amazon. He felt betrayed, found another job, and re….
0
748
0
RT @Machinelearrn: 🌟 Esoteric Language Models: гибридные AR+MDM языковые модели. Eso-LM ( - это новый класс языковы….
0
1
0
RT @daizhe9898: 学术界又现重大突破!康奈尔大学、CMU等多机构研究者共同提出Esoteric Language Models(Eso - LMs)这一创新语言建模框架,堪称语言模型领域的一次大胆革新. Eso -…
0
1
0
RT @dreamingtulpa: The Diffusion Duality. few-step generation in discrete diffusion language models via the underlying gaussian diffusion h….
0
4
0