Chuanruo Ning @TritiumAc X Profile

Chuanruo Ning

@TritiumAc

Followers

91

Following

269

Media

5

Statuses

10

PhD student at Cornell working on leveraging 3D vision for robot manipulation. Previously at Peking University

https://t.co/Px9aGoIfWx

Joined April 2022

Don't wanna be here? Send us removal request.

Kushal

@kushalk_

4 months

Teleoperation is slow, expensive, and difficult to scale. So how can we train our robots instead? Introducing X-Sim: a real-to-sim-to-real framework that trains image-based policies 1) learned entirely in simulation 2) using rewards from human videos. https://t.co/5yt2iTFYF4

4

42

114

Chuanruo Ning

@TritiumAc

5 months

Check out our paper and project page for more information. Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins 📄 https://t.co/rQhV0uIETo 🌐 https://t.co/U3m6v1oSCu 💻 https://t.co/tpXOnRMgLP Huge thanks to my advisors: @KuanFang and

0

2

Chuanruo Ning

@TritiumAc

5 months

By explicitly modeling dynamics with an interactive digital twin, our method significantly outperforms baselines that directly prompt or fine-tune the VLM to handle both semantics and physics.

1

0

1

Chuanruo Ning

@TritiumAc

5 months

Our method enables the robot to perform diverse, previously unseen manipulation tasks, involving 6-DoF manipulation, tool use, and precise manipulation.

1

0

1

Chuanruo Ning

@TritiumAc

5 months

To robustly model the dynamics and provide informative inputs to the VLM, we build the digital twin from a real scene scan using a hybrid representation: Meshes for physics simulation Gaussians for photorealistic rendering

1

2

Chuanruo Ning

@TritiumAc

5 months

At the core of our method, we integrate a pretrained VLM with an interactive digital twin in a Model Predictive Control paradigm. 🌏 The digital twin serves as a dynamic model to predict the outcomes of different actions. 🏆 The VLM evaluates and selects the best action sequence

1

0

1

Chuanruo Ning

@TritiumAc

5 months

How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data? We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world

7

34

151

Zhihan Yang

@zhihanyang_

5 months

📢Thrilled to share our new paper: Esoteric Language Models (Eso-LMs) > 🔀Fuses autoregressive (AR) and masked diffusion (MDM) paradigms > 🚀First to unlock KV caching for MDMs (65x speedup!) > 🥇Sets new SOTA on generation speed-vs-quality Pareto frontier How? Dive in👇

5

61

276

Oliver (Aochong) Li

@oliveraochongli

6 months

🤯 GPT-4o knows H&M left Russia in 2022 but still recommends shopping at H&M in Moscow. 🤔 LLMs store conflicting facts from different times, leading to inconsistent responses. We dig into how to better update LLMs with fresh facts that contradict their prior knowledge. 🧵 1/6

3

11

26