
Zechu Li
@softraeh
Followers
243
Following
89
Media
8
Statuses
20
Unlike humans who favor a dominant hand for fine dexterous skills, robots should execute ambidextrous manipulation with equal proficiency. Introducing SYMDEX, our algorithm for Ambidextrous Bimanual Manipulation leveraging robot’s inherent bilateral symmetry as an inductive bias.
3
9
108
🧩Web: https://t.co/kUplDJIjOg, 📷Paper: https://t.co/ypyoHzmRWs. It is a pleasure to work with my amazing collaborators Yufeng Jin, @OrdonezApraez and supervisors @GeorgiaChal, @liu_puze, and Claudio Semini.
arxiv.org
Humans naturally exhibit bilateral symmetry in their gross manipulation skills, effortlessly mirroring simple actions between left and right hands. Bimanual robots-which also feature bilateral...
0
0
8
Notably, our algorithm naturally scales to multi-arm setup with more complicated symmetry groups!
1
1
7
We further distill them into a global policy that is independent of the hand-task assignment and allows smooth sim-to-real transfer.
1
0
6
SYMDEX frames complex bimanual manipulation as a multi-task multi-agent problem and learns dedicated equivariant policies for each subtask.
1
0
3
Our method significantly outperforms other diffusion-based methods on high-dimensional control benchmarks, and is also competitive with state-of-the-art non-diffusion based RL methods while requiring fewer algorithmic design choices and smaller update-to-data ratios!
I am happy to share that our work ‘DIME: Diffusion-Based Maximum Entropy Reinforcement Learning’ has been accepted to ICML 2025. Many thanks to my colleagues and collaborators @softraeh, @DenBless94, @BruceGeLi, @DPalenicek, @Jan_R_Peters, @GeorgiaChal,@geri_neumann
0
1
8
Meet our first general-purpose robot at @DexmateAI
https://t.co/jFabzsDSHJ Adjustable height from 0.66m to 2.2m: compact enough for an SUV, tall enough to reach those impossible high shelves. Powerful dual arms (15lbs payload each) and omni-directional mobility for ultimate
Introducing Vega: 🤖 @DexmateAI's newest robot that makes complex manipulation tasks simple. ✨ A step closer to intelligence and automation. 🚀 🎥 Watch now: https://t.co/Eg5AI8vVHY
#Robotics #AI #Automation
13
34
210
Grateful to have worked with my awesome collaborators Rickmer Krohn, @taochenshh, @aajay3110, and supervisors @GeorgiaChal, @pulkitology!
0
0
3
We design a series of new challenging robot navigation and manipulation tasks with a high degree of multimodality, serving as a testbed for multimodal policy learning. (5/5) 🧩Web: https://t.co/akgO1y7Egr. 📔Paper: https://t.co/YxchrzotWy.
arxiv.org
Deep reinforcement learning (RL) algorithms typically parameterize the policy as a deep network that outputs either a deterministic action or a stochastic one modeled as a Gaussian distribution,...
1
1
8
Moreover, we achieve explicit mode control by policy conditioning on a mode-specific latent embedding during training, shown to be beneficial for online replanning. (4/5)
1
0
6
Second, the greediness of RL objective will collapse the policy into a single mode. We address this issue by using mode-specific Q-functions and constructing multimodal batch for policy learning. (3/5)
1
0
4
First, we explicitly discover behavior modes via an off-the-shelf unsupervised hierarchical clustering approach. Different modes should act differently at certain states and then diverge, therefore being distinguishable from trajectories/sequences of states. (2/5)
1
0
5
How can an RL agent successfully solve a task while showcasing versatility of behaviors—a property intuitive to intelligent systems like humans? Introducing Deep Diffusion Policy Gradient (DDiffPG) - our new algorithm for learning multimodal behaviors from scratch! (1/5)
2
34
161
Robot learning in the real world can be expensive and unsafe in human-centric environments. Solution: Construct simulation on the fly and train in it! Excited to share RialTo, led by @marceltornev on learning resilient policies via real-to-sim-to-real policy learning! A 🧵 (1/12)
2
22
115
Working with massive parallel simulation like Isaac Gym? Wonder how to learn policies faster and better than PPO? Check out our poster #102 (1:30pm on Thur) #ICML2023 . We have also open sourced the code. 📔Paper: https://t.co/a5iNnjCKBl 🧑💻Code:
github.com
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation - Improbable-AI/pql
5
10
90
Introducing Decision Diffuser, a conditional diffusion model that outperforms offline RL across standard benchmarks – using only generative modeling training! Decision Diffusers can also combine multiple constraints and skills at test-time. Website: https://t.co/bQErTTKHEc 1/5
15
132
863
“ElegantRL: Much Much More Stable than Stable-Baseline3” by Xiao-Yang Liu https://t.co/CZMMEwO1Hx
0
2
7
Check out ElegantRL, a lightweight, stable, and efficient deep reinforcement learning library with < 1000 lines of code designed to make our life easier! It also powers FinRL ( https://t.co/kshYms4sVR).
https://t.co/ijpwQijcbH
towardsdatascience.com
Mastering deep reinforcement learning in one day.
2
36
188