Kyle Vedder @KyleVedder X Profile

Kyle Vedder

@KyleVedder

Followers

2K

Following

4K

Media

176

Statuses

996

Robot Learning Research @DynaRobotics | CS PhD from Penn

Redwood City, CA

Joined August 2014

Don't wanna be here? Send us removal request.

Kyle Vedder

@KyleVedder

3 years

For the record: attention, scale, and a sufficiently hard problem is all you need.

0

15

Kyle Vedder

@KyleVedder

4 days

get a job that lets you do both

3

0

17

Kyle Vedder

@KyleVedder

17 days

we need Chinchilla-style fixed compute budget trade-off curves across. - model size. - pre-training compute. - RL compute . I suspect they'd show Grok4 spent too much time doing RL, not enough time pre-training. No idea what to expect on model size.

jxmo

@jxmnop

18 days

so xAI just 10x’d the amount of compute we use on RL and the models only got a tiny bit better. are we just doing RL wrong? or is pretraining just inherently much more useful

0

7

Kyle Vedder

@KyleVedder

19 days

really wish @weights_biases had actual job state callbacks instead of forcing me to write polling infrastructure.

2

0

5

Kyle Vedder

@KyleVedder

24 days

here's another (raw) video of us fooling around with the policy for the first time. it was only trained on a few flag waiving episodes and it generalized the hand-off to all kinds of objects including sparklers (and chip bag, which I didn't record 😭)

0

10

Kyle Vedder

@KyleVedder

24 days

Happy Independence Day 🦅🌎🇺🇸. here's another shot of our entirely autonomous interactive policy. despite knowing DYNA-1 can do robust long horizon manipulation, the flag hand-off was a wow moment for me bc it felt like a pet dog (hence the audio 🐶), not alien intelligence

Dyna Robotics

@DynaRobotics

24 days

Happy July 4th from Dyna! 🇺🇸🇺🇸

1

7

74

Kyle Vedder

@KyleVedder

26 days

the next few months are going to make abundantly clear the robustness, flexibility, and generalizability of our entire tech stack. everyone squabbling about eval metrics forgets there's one metric to rule them all: ARR 😎.

0

8

Kyle Vedder

@KyleVedder

26 days

- new demo task. - new environment with hardware we just setup. - robust to changes in lighting, people visible, etc. - production ready quality and robustness. - real businesses that are interested bc it provides real value. it's a cook or get cooked world and we're cooking.

Dyna Robotics

@DynaRobotics

26 days

We have started taking DYNA-1, our dexterous robust VLA model, to conferences and showcasing it for hours on end!. The model run for 3 days, 8 hours each day at #HITEC2025 3 weeks ago with 99.9% overall success rate (dropped 1 towel in day 2). No intervention, it just works :)

5

7

75

Kyle Vedder

@KyleVedder

27 days

ready for another beautiful day of semi-automated ML engineering. it's crazy how rapidly things have changed

0

15

Kyle Vedder

@KyleVedder

1 month

AV is all about the long tail. there's an ocean of difference between "it almost always works" and "it always works", and that gap is the difference between a fancy L2 and an actual L4 system. and that's why you have e.g. geofenced HD maps. don't be fooled by marketing.

0

7

Kyle Vedder

@KyleVedder

1 month

genuinely shocking how bad outsider analysis is of the AV space . it's overwhelmingly. - ideologically possessed. - omitting important caveats. - ignoring the actual open problems.

1

0

9

Kyle Vedder

@KyleVedder

1 month

numerology but for training performance based on SLURM job IDs.

1

0

9

Kyle Vedder

@KyleVedder

2 months

the internet is so full of robot teleop videos or 2 second clips of an "autonomous policy" (that only works on that one setup) that it's hard to put out a video of an *actually* working robot to impress normies. to them a 5 minute video of a robot actually working is just boring.

0

18

Kyle Vedder

@KyleVedder

2 months

it's not over for senior+ engineers yet, but wait a year.

0

6

Kyle Vedder

@KyleVedder

2 months

I could have shaved *at least* a year off my PhD if I had access to current tools. well designed, clean codebases translate to highly effective agents. in turn, if you guide the design well, you can hit insane research velocity and do far more ambitious experiments with ease.

1

0

5

Kyle Vedder

@KyleVedder

2 months

the capabilities growth of coding agents is crazy. I have stopped coding -- I do design and architecture, the agents do the implementation, and then I read it. it's actually over for junior engineers -- you need the design skills of a senior eng on day 1, and IDK how you get that.

2

0

12

Kyle Vedder

@KyleVedder

2 months

"use a zero vector for noise" doesn't work because, for reasonably high dimensional noise, the model just freaks out because it has never seen a data point from anywhere near the center -- random samples all have extremely large expected radii!.

0

2

Kyle Vedder

@KyleVedder

2 months

it would be nice to see more work on controlling diversity in denoising models. in LLMs we can easily move between MAP estimate (temp=0) and highly diverse (temp>>0). currently there's no standard analogous approach in denoising.

2

0

5

Kyle Vedder

@KyleVedder

2 months

this is also why I MIT Licensed everything in SceneFlowZoo -- I want people to actually be able to use it.

github.com

Contribute to kylevedder/SceneFlowZoo development by creating an account on GitHub.

0

4

Kyle Vedder

@KyleVedder

2 months

it's very annoying that so many research ML model weights and training codebases are non-commercially licensed -- it makes certain lines of work toxic for any company affiliated researchers. if you want your research to have maximum impact, MIT License everything.

1

0

7

Kyle Vedder

@KyleVedder

2 months

this is even more catastrophic in modern LLM online RL algorithms like GRPO (the RL algo behind DeepSeek R1). they forgo explicit state Advantage modeling in favor of average reward over a batch of rollouts -- every state visited will have a very negative Advantage estimate!

0

6