Pieter Abbeel
@pabbeel
Followers
89K
Following
11K
Media
346
Statuses
3K
FastTD3: "Minimum innovation, maximum results". Not the paper we had planned to write, but one of the works I am most proud of. We wanted to make sure our baseline (TD3) was a very solid baseline, so we added a few things that are already known to help in RL (large,.
Excited to present FastTD3: a simple, fast, and capable off-policy RL algorithm for humanoid control -- with an open-source code to run your own humanoid RL experiments in no time!. Thread below đź§µ
14
31
252
RT @robertnishihara: Congratulations to my brilliant co-founder Philipp Moritz (@pcmoritz) and the legendary John Schulman, Sergey Levine,….
0
11
0
RT @RaquelUrtasun: Today we're unveiling something truly extraordinary—one of the coolest and most transformative technologies: Mixed Reali….
0
13
0
RT @shaneguML: @MishaLaskin @reflection_ai Congratulations Misha! Something many engineers in enterprises wanted for a long time. A great e….
0
1
0
RT @MishaLaskin: Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The….
0
179
0
RT @HaoranGeng2: 🤖 What if a humanoid robot could make a hamburger from raw ingredients—all the way to your plate?. 🔥 Excited to announce V….
0
120
0
RT @berkeley_ai: Congratulations to BAIR researchers @kevin_zakka @qiayuanliao @arthurallshire @carlo_sferrazza @KoushilSreenath @pabbeel a….
playground.mujoco.org
An open-source framework for GPU-accelerated robot learning and sim-to-real transfer
0
7
0
RT @carlo_sferrazza: Sim2real is getting so mature that with great hardware (thanks @clemens_chr @katzschmann), you can get things running….
0
25
0
RT @zhaohengyin: Just open-sourced Geometric Retargeting (GeoRT) — the kinematic retargeting module behind DexterityGen. Includes tools fo….
0
11
0
RT @AdemiAdeniji: Everyday human data is robotics’ answer to internet-scale tokens. But how can robots learn to feel—just from videos?📹. I….
0
38
0
RT @HaoranGeng2: 🚀Check out our new work, FastTD3, a reinforcement learning algorithm that is simple, efficient, and highly capable. It ac….
0
6
0
RT @younggyoseo: Excited to present FastTD3: a simple, fast, and capable off-policy RL algorithm for humanoid control -- with an open-sourc….
0
114
0
RT @carlo_sferrazza: Off-policy learning transfers from sim to real-world humanoids!. Off-policy methods have pushed RL sample efficiency,….
0
5
0
RT @vincentjliu: The future of robotics isn't in the lab – it's in your hands. Can we teach robots to act in the real world without a singl….
0
40
0
RT @AdemiAdeniji: Closed-loop robot policies directly from human interactions. No teleop, no robot data co-training, no RL, and no sim. Jus….
0
12
0
RT @rocky_duan: Our robotics team will be at ICRA next week in Atlanta! Having started a new research team at Amazon building robot foundat….
0
21
0