Embodied AI Reading Notes Profile
Embodied AI Reading Notes

@EmbodiedAIRead

Followers
558
Following
3
Media
49
Statuses
57

Sharing daily personal notes on selected interesting Embodied AI papers, blogs and talks | Maintained by @yilun_chen_ | Opinions are my own.

California, USA
Joined July 2025
Don't wanna be here? Send us removal request.
@EmbodiedAIRead
Embodied AI Reading Notes
21 hours
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots. Project: Paper: Code: Tutorial: New open-source Camera Depth Models (CDM) for depth cameras to enable
Tweet media one
2
15
74
@EmbodiedAIRead
Embodied AI Reading Notes
3 days
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control. Project: Paper: Code: New open-source unified 3B embodied foundation model that enables perception, planning,
Tweet media one
3
34
201
@EmbodiedAIRead
Embodied AI Reading Notes
5 days
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control. Project: Paper: Code: A high performant variant of TD3 algorithm that’s optimized for humanoid tasks from Pieter Abbeel’s
Tweet media one
2
27
150
@EmbodiedAIRead
Embodied AI Reading Notes
7 days
HITTER: A HumanoId Table TEnnis Robot via Hierarchical Planning and Learning. Project: Paper: This project shows amazing videos of Unitree G1 playing table tennis against human with agile and fluent motion for 106 consecutive
Tweet media one
1
0
14
@EmbodiedAIRead
Embodied AI Reading Notes
9 days
RICL: Adding In-Context Adaptability to Pre-Trained Vision-Language-Action Models. Project: Paper: This project adapts the popular in-context learning (ICL) and retrieval-augmented generation (RAG) ideas from LLM, and successfully
Tweet media one
0
15
99
@EmbodiedAIRead
Embodied AI Reading Notes
12 days
Advancing the Frontier of Silicon Intelligence: the Past, Open Problems, and the Future. YouTube: Recent lecture given by Shuchao Bi, lead researcher at OpenAI, now Meta SuperIntelligence at Columbia University. Having a unique experience of math PhD.
0
0
2
@EmbodiedAIRead
Embodied AI Reading Notes
14 days
Neural Robot Dynamics. Project: Paper: An interesting new work on learning robot-specific dynamics models for predicting future states on robot as articulated rigid body, which can serve as replacement for low-level dynamics and
Tweet media one
1
16
113
@EmbodiedAIRead
Embodied AI Reading Notes
15 days
Masquerade: Learning from In-the-wild Human Videos using Data-Editing. Project: Paper: A simple yet effective way to improve pre-training robot policies from egocentric human videos: edit video input by replacing human with robot
Tweet media one
0
12
77
@EmbodiedAIRead
Embodied AI Reading Notes
16 days
Large Behavior Models and Atlas Find New Footing. Blog: Amazing new loco-manipulation demos on Atlas from Boston Dynamics and Toyota TRI collaboration. The blog shows some interesting videos covering a wide range of new capabilities, featured by one
Tweet media one
2
25
141
@EmbodiedAIRead
Embodied AI Reading Notes
18 days
Video Generators are Robot Policies. Project: Paper: Video is an abundant and scalable data source for robot learning, but it’s hard to use as it lacks action information in it. This project proposes a clever way to leverage
Tweet media one
Tweet media two
1
21
142
@EmbodiedAIRead
Embodied AI Reading Notes
20 days
BeyondMimic: From Motion Tracking to Versatile Humanoid Control via Guided Diffusion. Project: Paper: This project shows videos on amazing agile and versatile humanoid whole-body motion surpassing prior works, so it’s worthwhile to
Tweet media one
3
21
163
@EmbodiedAIRead
Embodied AI Reading Notes
22 days
DinoV3. Website: Code: Models: Paper: DinoV3 is released publicly today! Nice performance upgrade as shown below. Major updates from DinoV2:.- DinoV2 1.1B Parameters -> DinoV3 7B
Tweet media one
0
2
13
@EmbodiedAIRead
Embodied AI Reading Notes
23 days
Understanding Multimodal LLMs. Blog: A nice blog covering the fundamentals of multimodal LLMs. It’s important to understand how multimodal LLM works, as it is highly related and often serves as a major building block in modern embodied ai model
Tweet media one
0
0
7
@EmbodiedAIRead
Embodied AI Reading Notes
24 days
MolmoAct: Action Reasoning Models that can Reason in Space. Blog: Paper: New paper from Ai2 on a new kind of robot foundation model, Action Reasoning Model as they named it. - The new architecture extends a typical VLM model by
Tweet media one
0
1
13
@EmbodiedAIRead
Embodied AI Reading Notes
25 days
Evaluating Pi0 in the Wild: Strengths, Problems, and the Future of Generalist Robot Policies. Blog: An interesting “vibe-checking” on Pi0 policy in a kitchen-like environment. It’s interesting to see examples on what works well and what doesn’t work out
Tweet media one
0
5
22
@EmbodiedAIRead
Embodied AI Reading Notes
25 days
The unique difference here is they, at scale, used a video prediction model (or called world model) to replace the VLM often used in recent VLAs to serve as the pre-trained robot foundation model backbone. One could argue this shifts towards more "vision-centric" rather than.
0
0
4
@EmbodiedAIRead
Embodied AI Reading Notes
26 days
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation. Project: Paper: New paper from AgiBot on World Model for manipulation. - GE-Base: a video diffusion model trained on 3000hours, over 1million
Tweet media one
9
52
308
@EmbodiedAIRead
Embodied AI Reading Notes
28 days
Real-Time Execution of Action Chunking Flow Policies. Paper: A followup blog from Cobot to further investigate the hyper-parameters used in RTC: The latest research paper from Physical Intelligence trying to tackle the
Tweet media one
0
1
9
@EmbodiedAIRead
Embodied AI Reading Notes
1 month
The Emerging Humanoid Motor Cortex: An Inventory of RL-Trained Controllers. Blog: Whole Body Controllers comparison table: A great blog by Alan Fern reviewing the current field of WBC (whole-body controller) for humanoids. Highly
Tweet media one
0
0
8