Joel Jang @jang_yoel X Profile

Joel Jang

@jang_yoel

Followers

2K

Following

2K

Media

50

Statuses

369

Senior Research Scientist @nvidiaai GEAR Lab, world modeling lead. On leave from PhD at @uwcse

https://t.co/jZdaVN93Uu

Seattle, US

Joined March 2021

Don't wanna be here? Send us removal request.

Joel Jang

@jang_yoel

7 months

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

8

80

381

NVIDIA Robotics

@NVIDIARobotics

3 months

The rise of humanoid platforms presents new opportunities and unique challenges. 🤖 Join @yukez at #CoRL2025 as he shares the latest research on robot foundation models and presents new updates with the #NVIDIAIsaac GR00T platform. Learn more 👉 https://t.co/LrzONs1Gzc

49

24

148

RoboPapers

@RoboPapers

3 months

Full episode dropping soon! Geeking out with @jang_yoel on DreamGen - Unlocking Generalization in Robot Learning through Video World Models https://t.co/4GkmxHMqSW Co-hosted by @chris_j_paxton @micoolcho

2

7

32

Elon Musk

@elonmusk

4 months

@DrJimFan Tesla has this too for Optimus. As you say, it is essential for humanoid robot training.

92

162

2K

Jim Fan

@DrJimFan

4 months

World modeling for robotics is incredibly hard because (1) control of humanoid robots & 5-finger hands is wayyy harder than ⬆️⬅️⬇️➡️ in games (Genie 3); and (2) object interaction is much more diverse than FSD, which needs to *avoid* coming into contact. Our GR00T Dreams work was

Jim Fan

@DrJimFan

7 months

What if robots could dream inside a video generative model? Introducing DreamGen, a new engine that scales up robot learning not with fleets of human operators, but with digital dreams in pixels. DreamGen produces massive volumes of neural trajectories - photorealistic robot

36

164

1K

The Humanoid Hub

@TheHumanoidHub

5 months

A humanoid robot policy trained solely on synthetic data generated by a world model. Research Scientist Joel Jang presents NVIDIA's DreamGen pipeline: ⦿ Post-train the world model Cosmos-Predict2 with a small set of real teleoperation demos. ⦿ Prompt the world model to

10

41

218

Jim Fan

@DrJimFan

5 months

I've been a bit quiet on X recently. The past year has been a transformational experience. Grok-4 and Kimi K2 are awesome, but the world of robotics is a wondrous wild west. It feels like NLP in 2018 when GPT-1 was published, along with BERT and a thousand other flowers that

188

328

4K

Joel Jang

@jang_yoel

5 months

Check out Cosmos-Predict2, a new SOTA video world model trained specifically for Physical AI (powering GR00T Dreams & DreamGen)!

Hanzi Mao

@hanna_mao

5 months

We build Cosmos-Predict2 as a world foundation model for Physical AI builders — fully open and adaptable. Post-train it for specialized tasks or different output types. Available in multiple sizes, resolutions, and frame rates. 📷 Watch the repo walkthrough

0

6

44

Zhengyi “Zen” Luo

@zhengyiluo

6 months

Nvidia GEAR RSS 2025 Squad Rolling Out

10

6

151

Joel Jang

@jang_yoel

6 months

🚀 GR00T Dreams code is live! NVIDIA GEAR Lab's open-source solution for robotics data via video world models. Fine-tune on any robot, generate 'dreams', extract actions with IDM, and train visuomotor policies with LeRobot datasets (GR00T N1.5, SmolVLA). https://t.co/7Fndn7zDJB

github.com

Nvidia GEAR Lab's initiative to solve the robotics data problem using world models - NVIDIA/GR00T-Dreams

Joel Jang

@jang_yoel

7 months

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

6

44

151

youliang

@youliangtan

6 months

How we improve VLA generalization? 🤔 Last week we upgraded #NVIDIA GR00T N1.5 with minor VLM tweaks, FLARE, and richer data mixtures (DreamGen, etc.) ✨. N1.5 yields better language following — post-trained on unseen Unitree G1 with 1K trajectories, it follows commands on

2

23

187

Qinsheng Zhang

@qsh_zh

6 months

🚀 Introducing Cosmos-Predict2! Our most powerful open video foundation model for Physical AI. Cosmos-Predict2 significantly improves upon Predict1 in visual quality, prompt alignment, and motion dynamics—outperforming popular open-source video foundation models. It’s openly

7

63

204

Chris Paxton

@chris_j_paxton

6 months

Assuming that we need ~2 trillion tokens to get to a robot GPT, how can we get there? I went through a few scenarios looking at how we can combine simulation data, human video data, and looking at the size of existing robot fleets. Some assumptions: - We probably need some real

12

35

214

Yiyang Zhou

@AiYiyangZ

6 months

🔥 ReAgent-V Released! 🔥 A unified video framework with reflection and reward-driven optimization. ✨ Real-time self-correction. ✨ Triple-view reflection. ✨ Auto-selects high-reward samples for training.

1

20

44

Joel Jang

@jang_yoel

6 months

Giving a talk about GR00T N1, GR00T N1.5, and GR00T Dreams in NVIDIA GTC Paris 06.11 2PM - 2:45PM CEST. If you are at Vivatech in Paris, please stop by the "An Introduction to Humanoid Robotics" Session!

NVIDIA Robotics

@NVIDIARobotics

6 months

Are you curious about #humanoidrobotics? Join our experts at #GTCParis for a deep dive into the #NVIDIAIsaac GR00T platform and its four pillars: 🧠 Robot foundation models for cognition and control 🌐 Simulation frameworks built on @nvidiaomniverse and #NVIDIACosmos 📊 Data

1

6

63

Ruijie Zheng

@ruijie_zheng12

6 months

Representation also matters for VLA models! Introducing FLARE: Robot Learning with Implicit World Modeling. With future latent alignment objective, FLARE significantly improves policy performance on multitask imitation learning & unlocks learning from egocentric human videos.

6

24

118

Brett Adcock

@adcock_brett

6 months

Nvidia also announced DreamGen, a new engine that scales robot learning with digital dreams It produces large volumes of photorealistic robot videos (using video models) paired with motor action labels and unlocks generalization to new environments https://t.co/rWTboFmM7z

3

7

94

The Humanoid Hub

@TheHumanoidHub

7 months

NVIDIA has published a paper on DREAMGEN – a powerful 4-step pipeline for generating synthetic data for humanoids that enables task and environment generalization. - Step 1: Fine-tune a video generation model using a small number of human teleoperation videos - Step 2: Prompt

2

31

156

Soroush Nasiriany

@snasiriany

7 months

It’s not a matter of if, it’s a matter of when, video models and world models are going to be a central tool for building robot foundation models.

Joel Jang

@jang_yoel

7 months

Introducing 𝐃𝐫𝐞𝐚𝐦𝐆𝐞𝐧! We got humanoid robots to perform totally new 𝑣𝑒𝑟𝑏𝑠 in new environments through video world models. We believe video world models will solve the data problem in robotics. Bringing the paradigm of scaling human hours to GPU hours. Quick 🧵

0

1

11