Zechu Li @softraeh X Profile

Zechu Li

@softraeh

Followers

243

Following

89

Media

8

Statuses

20

https://t.co/hJByTsbUnb

Joined June 2020

Don't wanna be here? Send us removal request.

Zechu Li

@softraeh

5 months

Unlike humans who favor a dominant hand for fine dexterous skills, robots should execute ambidextrous manipulation with equal proficiency. Introducing SYMDEX, our algorithm for Ambidextrous Bimanual Manipulation leveraging robot’s inherent bilateral symmetry as an inductive bias.

3

9

108

Zechu Li

@softraeh

5 months

🧩Web: https://t.co/kUplDJIjOg, 📷Paper: https://t.co/ypyoHzmRWs. It is a pleasure to work with my amazing collaborators Yufeng Jin, @OrdonezApraez and supervisors @GeorgiaChal, @liu_puze, and Claudio Semini.

arxiv.org

Humans naturally exhibit bilateral symmetry in their gross manipulation skills, effortlessly mirroring simple actions between left and right hands. Bimanual robots-which also feature bilateral...

0

8

Zechu Li

@softraeh

5 months

Notably, our algorithm naturally scales to multi-arm setup with more complicated symmetry groups!

1

7

Zechu Li

@softraeh

5 months

We further distill them into a global policy that is independent of the hand-task assignment and allows smooth sim-to-real transfer.

1

0

6

Zechu Li

@softraeh

5 months

SYMDEX frames complex bimanual manipulation as a multi-task multi-agent problem and learns dedicated equivariant policies for each subtask.

1

0

3

Zechu Li

@softraeh

6 months

Our method significantly outperforms other diffusion-based methods on high-dimensional control benchmarks, and is also competitive with state-of-the-art non-diffusion based RL methods while requiring fewer algorithmic design choices and smaller update-to-data ratios!

Onur Celik

@onclk_

6 months

I am happy to share that our work ‘DIME: Diffusion-Based Maximum Entropy Reinforcement Learning’ has been accepted to ICML 2025. Many thanks to my colleagues and collaborators @softraeh, @DenBless94, @BruceGeLi, @DPalenicek, @Jan_R_Peters, @GeorgiaChal,@geri_neumann

0

1

8

Yuzhe Qin

@QinYuzhe

8 months

Meet our first general-purpose robot at @DexmateAI https://t.co/jFabzsDSHJ Adjustable height from 0.66m to 2.2m: compact enough for an SUV, tall enough to reach those impossible high shelves. Powerful dual arms (15lbs payload each) and omni-directional mobility for ultimate

Dexmate

@DexmateAI

8 months

Introducing Vega: 🤖 @DexmateAI's newest robot that makes complex manipulation tasks simple. ✨ A step closer to intelligence and automation. 🚀 🎥 Watch now: https://t.co/Eg5AI8vVHY #Robotics #AI #Automation

13

34

210

Zechu Li

@softraeh

1 year

Grateful to have worked with my awesome collaborators Rickmer Krohn, @taochenshh, @aajay3110, and supervisors @GeorgiaChal, @pulkitology!

0

3

Zechu Li

@softraeh

1 year

We design a series of new challenging robot navigation and manipulation tasks with a high degree of multimodality, serving as a testbed for multimodal policy learning. (5/5) 🧩Web: https://t.co/akgO1y7Egr. 📔Paper: https://t.co/YxchrzotWy.

arxiv.org

Deep reinforcement learning (RL) algorithms typically parameterize the policy as a deep network that outputs either a deterministic action or a stochastic one modeled as a Gaussian distribution,...

1

8

Zechu Li

@softraeh

1 year

Moreover, we achieve explicit mode control by policy conditioning on a mode-specific latent embedding during training, shown to be beneficial for online replanning. (4/5)

1

0

6

Zechu Li

@softraeh

1 year

Second, the greediness of RL objective will collapse the policy into a single mode. We address this issue by using mode-specific Q-functions and constructing multimodal batch for policy learning. (3/5)

1

0

4

Zechu Li

@softraeh

1 year

First, we explicitly discover behavior modes via an off-the-shelf unsupervised hierarchical clustering approach. Different modes should act differently at certain states and then diverge, therefore being distinguishable from trajectories/sequences of states. (2/5)

1

0

5

Zechu Li

@softraeh

1 year

How can an RL agent successfully solve a task while showcasing versatility of behaviors—a property intuitive to intelligent systems like humans? Introducing Deep Diffusion Policy Gradient (DDiffPG) - our new algorithm for learning multimodal behaviors from scratch! (1/5)

2

34

161

Abhishek Gupta

@abhishekunique7

2 years

Robot learning in the real world can be expensive and unsafe in human-centric environments. Solution: Construct simulation on the fly and train in it! Excited to share RialTo, led by @marceltornev on learning resilient policies via real-to-sim-to-real policy learning! A 🧵 (1/12)

2

22

115

Tao Chen

@taochenshh

2 years

Working with massive parallel simulation like Isaac Gym? Wonder how to learn policies faster and better than PPO? Check out our poster #102 (1:30pm on Thur) #ICML2023 . We have also open sourced the code. 📔Paper: https://t.co/a5iNnjCKBl 🧑‍💻Code:

github.com

Parallel Q-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation - Improbable-AI/pql

5

10

90

Yilun Du

@du_yilun

3 years

Introducing Decision Diffuser, a conditional diffusion model that outperforms offline RL across standard benchmarks – using only generative modeling training! Decision Diffusers can also combine multiple constraints and skills at test-time. Website: https://t.co/bQErTTKHEc 1/5

15

132

863

Xiao-Yang Liu

@XiaoYangLiu10

4 years

“ElegantRL: Much Much More Stable than Stable-Baseline3” by Xiao-Yang Liu https://t.co/CZMMEwO1Hx

0

2

7

Zhaoran Wang

@zhaoran_wang

5 years

Check out ElegantRL, a lightweight, stable, and efficient deep reinforcement learning library with < 1000 lines of code designed to make our life easier! It also powers FinRL ( https://t.co/kshYms4sVR). https://t.co/ijpwQijcbH

towardsdatascience.com

Mastering deep reinforcement learning in one day.

2

36

188