Zechu Li Profile
Zechu Li

@softraeh

Followers
243
Following
89
Media
8
Statuses
20

Joined June 2020
Don't wanna be here? Send us removal request.
@softraeh
Zechu Li
5 months
Unlike humans who favor a dominant hand for fine dexterous skills, robots should execute ambidextrous manipulation with equal proficiency. Introducing SYMDEX, our algorithm for Ambidextrous Bimanual Manipulation leveraging robot’s inherent bilateral symmetry as an inductive bias.
3
9
108
@softraeh
Zechu Li
5 months
Notably, our algorithm naturally scales to multi-arm setup with more complicated symmetry groups!
1
1
7
@softraeh
Zechu Li
5 months
We further distill them into a global policy that is independent of the hand-task assignment and allows smooth sim-to-real transfer.
1
0
6
@softraeh
Zechu Li
5 months
SYMDEX frames complex bimanual manipulation as a multi-task multi-agent problem and learns dedicated equivariant policies for each subtask.
1
0
3
@softraeh
Zechu Li
6 months
Our method significantly outperforms other diffusion-based methods on high-dimensional control benchmarks, and is also competitive with state-of-the-art non-diffusion based RL methods while requiring fewer algorithmic design choices and smaller update-to-data ratios!
@onclk_
Onur Celik
6 months
I am happy to share that our work ‘DIME: Diffusion-Based Maximum Entropy Reinforcement Learning’ has been accepted to ICML 2025. Many thanks to my colleagues and collaborators @softraeh, @DenBless94, @BruceGeLi, @DPalenicek, @Jan_R_Peters, @GeorgiaChal,@geri_neumann
0
1
8
@QinYuzhe
Yuzhe Qin
8 months
Meet our first general-purpose robot at @DexmateAI https://t.co/jFabzsDSHJ Adjustable height from 0.66m to 2.2m: compact enough for an SUV, tall enough to reach those impossible high shelves. Powerful dual arms (15lbs payload each) and omni-directional mobility for ultimate
@DexmateAI
Dexmate
8 months
Introducing Vega: 🤖 @DexmateAI's newest robot that makes complex manipulation tasks simple. ✨ A step closer to intelligence and automation. 🚀 🎥 Watch now: https://t.co/Eg5AI8vVHY #Robotics #AI #Automation
13
34
210
@softraeh
Zechu Li
1 year
Grateful to have worked with my awesome collaborators Rickmer Krohn, @taochenshh, @aajay3110, and supervisors @GeorgiaChal, @pulkitology!
0
0
3
@softraeh
Zechu Li
1 year
We design a series of new challenging robot navigation and manipulation tasks with a high degree of multimodality, serving as a testbed for multimodal policy learning. (5/5) 🧩Web: https://t.co/akgO1y7Egr. 📔Paper: https://t.co/YxchrzotWy.
Tweet card summary image
arxiv.org
Deep reinforcement learning (RL) algorithms typically parameterize the policy as a deep network that outputs either a deterministic action or a stochastic one modeled as a Gaussian distribution,...
1
1
8
@softraeh
Zechu Li
1 year
Moreover, we achieve explicit mode control by policy conditioning on a mode-specific latent embedding during training, shown to be beneficial for online replanning. (4/5)
1
0
6
@softraeh
Zechu Li
1 year
Second, the greediness of RL objective will collapse the policy into a single mode. We address this issue by using mode-specific Q-functions and constructing multimodal batch for policy learning. (3/5)
1
0
4
@softraeh
Zechu Li
1 year
First, we explicitly discover behavior modes via an off-the-shelf unsupervised hierarchical clustering approach. Different modes should act differently at certain states and then diverge, therefore being distinguishable from trajectories/sequences of states. (2/5)
1
0
5
@softraeh
Zechu Li
1 year
How can an RL agent successfully solve a task while showcasing versatility of behaviors—a property intuitive to intelligent systems like humans? Introducing Deep Diffusion Policy Gradient (DDiffPG) - our new algorithm for learning multimodal behaviors from scratch! (1/5)
2
34
161
@abhishekunique7
Abhishek Gupta
2 years
Robot learning in the real world can be expensive and unsafe in human-centric environments. Solution: Construct simulation on the fly and train in it! Excited to share RialTo, led by @marceltornev on learning resilient policies via real-to-sim-to-real policy learning! A 🧵 (1/12)
2
22
115
@taochenshh
Tao Chen
2 years
Working with massive parallel simulation like Isaac Gym? Wonder how to learn policies faster and better than PPO? Check out our poster #102 (1:30pm on Thur) #ICML2023 . We have also open sourced the code. 📔Paper: https://t.co/a5iNnjCKBl 🧑‍💻Code:
Tweet card summary image
github.com
Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation - Improbable-AI/pql
5
10
90
@du_yilun
Yilun Du
3 years
Introducing Decision Diffuser, a conditional diffusion model that outperforms offline RL across standard benchmarks – using only generative modeling training! Decision Diffusers can also combine multiple constraints and skills at test-time. Website: https://t.co/bQErTTKHEc 1/5
15
132
863
@XiaoYangLiu10
Xiao-Yang Liu
4 years
“ElegantRL: Much Much More Stable than Stable-Baseline3” by Xiao-Yang Liu https://t.co/CZMMEwO1Hx
0
2
7
@zhaoran_wang
Zhaoran Wang
5 years
Check out ElegantRL, a lightweight, stable, and efficient deep reinforcement learning library with < 1000 lines of code designed to make our life easier! It also powers FinRL ( https://t.co/kshYms4sVR). https://t.co/ijpwQijcbH
Tweet card summary image
towardsdatascience.com
Mastering deep reinforcement learning in one day.
2
36
188