KyleVedder Profile Banner
Kyle Vedder Profile
Kyle Vedder

@KyleVedder

Followers
2K
Following
4K
Media
176
Statuses
996

Robot Learning Research @DynaRobotics | CS PhD from Penn

Redwood City, CA
Joined August 2014
Don't wanna be here? Send us removal request.
@KyleVedder
Kyle Vedder
3 years
For the record: attention, scale, and a sufficiently hard problem is all you need.
0
0
15
@KyleVedder
Kyle Vedder
4 days
get a job that lets you do both
Tweet media one
Tweet media two
3
0
17
@KyleVedder
Kyle Vedder
17 days
we need Chinchilla-style fixed compute budget trade-off curves across. - model size. - pre-training compute. - RL compute . I suspect they'd show Grok4 spent too much time doing RL, not enough time pre-training. No idea what to expect on model size.
@jxmnop
jxmo
18 days
so xAI just 10x’d the amount of compute we use on RL and the models only got a tiny bit better. are we just doing RL wrong? or is pretraining just inherently much more useful
Tweet media one
Tweet media two
0
0
7
@KyleVedder
Kyle Vedder
19 days
really wish @weights_biases had actual job state callbacks instead of forcing me to write polling infrastructure.
2
0
5
@KyleVedder
Kyle Vedder
24 days
here's another (raw) video of us fooling around with the policy for the first time. it was only trained on a few flag waiving episodes and it generalized the hand-off to all kinds of objects including sparklers (and chip bag, which I didn't record 😭)
0
0
10
@KyleVedder
Kyle Vedder
24 days
Happy Independence Day πŸ¦…πŸŒŽπŸ‡ΊπŸ‡Έ. here's another shot of our entirely autonomous interactive policy. despite knowing DYNA-1 can do robust long horizon manipulation, the flag hand-off was a wow moment for me bc it felt like a pet dog (hence the audio 🐢), not alien intelligence
@DynaRobotics
Dyna Robotics
24 days
Happy July 4th from Dyna! πŸ‡ΊπŸ‡ΈπŸ‡ΊπŸ‡Έ
1
7
74
@KyleVedder
Kyle Vedder
26 days
the next few months are going to make abundantly clear the robustness, flexibility, and generalizability of our entire tech stack. everyone squabbling about eval metrics forgets there's one metric to rule them all: ARR 😎.
0
0
8
@KyleVedder
Kyle Vedder
26 days
- new demo task. - new environment with hardware we just setup. - robust to changes in lighting, people visible, etc. - production ready quality and robustness. - real businesses that are interested bc it provides real value. it's a cook or get cooked world and we're cooking.
@DynaRobotics
Dyna Robotics
26 days
We have started taking DYNA-1, our dexterous robust VLA model, to conferences and showcasing it for hours on end!. The model run for 3 days, 8 hours each day at #HITEC2025 3 weeks ago with 99.9% overall success rate (dropped 1 towel in day 2). No intervention, it just works :)
5
7
75
@KyleVedder
Kyle Vedder
27 days
ready for another beautiful day of semi-automated ML engineering. it's crazy how rapidly things have changed
Tweet media one
0
0
15
@KyleVedder
Kyle Vedder
1 month
AV is all about the long tail. there's an ocean of difference between "it almost always works" and "it always works", and that gap is the difference between a fancy L2 and an actual L4 system. and that's why you have e.g. geofenced HD maps. don't be fooled by marketing.
0
0
7
@KyleVedder
Kyle Vedder
1 month
genuinely shocking how bad outsider analysis is of the AV space . it's overwhelmingly. - ideologically possessed. - omitting important caveats. - ignoring the actual open problems.
1
0
9
@KyleVedder
Kyle Vedder
1 month
numerology but for training performance based on SLURM job IDs.
1
0
9
@KyleVedder
Kyle Vedder
2 months
the internet is so full of robot teleop videos or 2 second clips of an "autonomous policy" (that only works on that one setup) that it's hard to put out a video of an *actually* working robot to impress normies. to them a 5 minute video of a robot actually working is just boring.
0
0
18
@KyleVedder
Kyle Vedder
2 months
it's not over for senior+ engineers yet, but wait a year.
0
0
6
@KyleVedder
Kyle Vedder
2 months
I could have shaved *at least* a year off my PhD if I had access to current tools. well designed, clean codebases translate to highly effective agents. in turn, if you guide the design well, you can hit insane research velocity and do far more ambitious experiments with ease.
1
0
5
@KyleVedder
Kyle Vedder
2 months
the capabilities growth of coding agents is crazy. I have stopped coding -- I do design and architecture, the agents do the implementation, and then I read it. it's actually over for junior engineers -- you need the design skills of a senior eng on day 1, and IDK how you get that.
2
0
12
@KyleVedder
Kyle Vedder
2 months
"use a zero vector for noise" doesn't work because, for reasonably high dimensional noise, the model just freaks out because it has never seen a data point from anywhere near the center -- random samples all have extremely large expected radii!.
0
0
2
@KyleVedder
Kyle Vedder
2 months
it would be nice to see more work on controlling diversity in denoising models. in LLMs we can easily move between MAP estimate (temp=0) and highly diverse (temp>>0). currently there's no standard analogous approach in denoising.
2
0
5
@KyleVedder
Kyle Vedder
2 months
this is also why I MIT Licensed everything in SceneFlowZoo -- I want people to actually be able to use it.
Tweet card summary image
github.com
Contribute to kylevedder/SceneFlowZoo development by creating an account on GitHub.
0
0
4
@KyleVedder
Kyle Vedder
2 months
it's very annoying that so many research ML model weights and training codebases are non-commercially licensed -- it makes certain lines of work toxic for any company affiliated researchers. if you want your research to have maximum impact, MIT License everything.
1
0
7
@KyleVedder
Kyle Vedder
2 months
this is even more catastrophic in modern LLM online RL algorithms like GRPO (the RL algo behind DeepSeek R1). they forgo explicit state Advantage modeling in favor of average reward over a batch of rollouts -- every state visited will have a very negative Advantage estimate!
Tweet media one
0
0
6