Vedant Gupta Profile
Vedant Gupta

@vedant_gupta_16

Followers
10
Following
2
Media
3
Statuses
8

Founding Member of Technical Staff @AsariAILabs. BS honors in CS+math @BrownUniversity. Formerly at @rai_inst, @BrownBigAI

San Francisco
Joined August 2024
Don't wanna be here? Send us removal request.
@vedant_gupta_16
Vedant Gupta
6 days
Excited to introduce DEPS (Discovery of GenEralizable Parameterized Skills) at #NeurIPS2025! DEPS learns interpretable parameterized skills that drastically improve generalisation to unseen tasks, especially in data-constrained settings and on out-of-distribution tasks. (1/n)
1
12
20
@vedant_gupta_16
Vedant Gupta
6 days
@haotiannnnnnnnn @calvinyluo @yidingjiang For more, please check out our: Website: https://t.co/TUNfZyhxUH Arxiv: https://t.co/hRnlvIlw8R Code: https://t.co/ioQj4psLlk See y’all at NeurIPS! Feel free to message me here or at vedantgupta@gmail.com with questions:) (n/n)
Tweet card summary image
github.com
Contribute to guptbot/DEPS development by creating an account on GitHub.
0
0
3
@vedant_gupta_16
Vedant Gupta
6 days
Bonus: DEPS discovers interpretable skills! Visualisations are on our website ⬇️ I had a great time working on DEPS with my amazing collaborators @haotiannnnnnnnn, @calvinyluo, @yidingjiang and George Konidaris! (7/n)
1
0
4
@vedant_gupta_16
Vedant Gupta
6 days
The result is superior generalization across diverse evaluation regimes: E.g. 2.3x higher average success than baselines on out-of-distribution LIBERO tasks. 2.4x better with 3-shot learning 4x better with limited pretraining In short, DEPS learns skills that transfer. (6/n)
1
0
3
@vedant_gupta_16
Vedant Gupta
6 days
To make this work, we make several key architectural choices. E.g., we view parameterized skills as low-dimensional trajectory manifolds. Trajectories can be indexed into with a scalar → DEPS compresses the robot’s state to 1D before feeding it to the low-level policy (5/n)
1
0
3
@vedant_gupta_16
Vedant Gupta
6 days
To address this, prior work uses "staged" training, e.g. using VLMs to pre-segment trajectories. If the segmentation goes wrong, you’re left with bad skills. OTOH, DEPS learns parameterized skills in an end-to-end fashion - single training process, no pretrained models. (4/n)
1
0
2
@vedant_gupta_16
Vedant Gupta
6 days
The challenge? Naively training hierarchical policies (discrete skill → continuous params → actions) doesn’t really work… There are just too many ways to fit the training data without learning the nice, reusable skills we’re looking for. (3/n)
1
0
2
@vedant_gupta_16
Vedant Gupta
6 days
What's a parameterized skill👀 Key idea: parameterized skills = discrete behaviors + continuous parameters Think: pick(x,y,z) where the skill is "pick" but (x,y,z) specify where Here’s a pick skill discovered by DEPS. Different continuous parameters pick different objects (2/n)
1
0
2