TritiumAc Profile Banner
Chuanruo Ning Profile
Chuanruo Ning

@TritiumAc

Followers
91
Following
269
Media
5
Statuses
10

PhD student at Cornell working on leveraging 3D vision for robot manipulation. Previously at Peking University

Joined April 2022
Don't wanna be here? Send us removal request.
@kushalk_
Kushal
4 months
Teleoperation is slow, expensive, and difficult to scale. So how can we train our robots instead? Introducing X-Sim: a real-to-sim-to-real framework that trains image-based policies 1) learned entirely in simulation 2) using rewards from human videos. https://t.co/5yt2iTFYF4
4
42
114
@TritiumAc
Chuanruo Ning
5 months
Check out our paper and project page for more information. Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins 📄 https://t.co/rQhV0uIETo 🌐 https://t.co/U3m6v1oSCu 💻 https://t.co/tpXOnRMgLP Huge thanks to my advisors: @KuanFang and
0
0
2
@TritiumAc
Chuanruo Ning
5 months
By explicitly modeling dynamics with an interactive digital twin, our method significantly outperforms baselines that directly prompt or fine-tune the VLM to handle both semantics and physics.
1
0
1
@TritiumAc
Chuanruo Ning
5 months
Our method enables the robot to perform diverse, previously unseen manipulation tasks, involving 6-DoF manipulation, tool use, and precise manipulation.
1
0
1
@TritiumAc
Chuanruo Ning
5 months
To robustly model the dynamics and provide informative inputs to the VLM, we build the digital twin from a real scene scan using a hybrid representation: Meshes for physics simulation Gaussians for photorealistic rendering
1
1
2
@TritiumAc
Chuanruo Ning
5 months
At the core of our method, we integrate a pretrained VLM with an interactive digital twin in a Model Predictive Control paradigm. 🌏 The digital twin serves as a dynamic model to predict the outcomes of different actions. 🏆 The VLM evaluates and selects the best action sequence
1
0
1
@TritiumAc
Chuanruo Ning
5 months
How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data? We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world
7
34
151
@zhihanyang_
Zhihan Yang
5 months
📢Thrilled to share our new paper: Esoteric Language Models (Eso-LMs) > 🔀Fuses autoregressive (AR) and masked diffusion (MDM) paradigms > 🚀First to unlock KV caching for MDMs (65x speedup!) > 🥇Sets new SOTA on generation speed-vs-quality Pareto frontier How? Dive in👇
5
61
276
@oliveraochongli
Oliver (Aochong) Li
6 months
🤯 GPT-4o knows H&M left Russia in 2022 but still recommends shopping at H&M in Moscow. 🤔 LLMs store conflicting facts from different times, leading to inconsistent responses. We dig into how to better update LLMs with fresh facts that contradict their prior knowledge. 🧵 1/6
3
11
26