Thomas Kipf
@tkipf
Followers
29K
Following
12K
Media
276
Statuses
2K
Sr. Staff RS at @GoogleDeepMind. Veo Team. Controllable World Simulators: GNNs, Structured World Models, Neural Assets, Veo References, Veo Robotics
San Francisco, CA
Joined June 2009
My PhD thesis "Deep Learning with Graph-Structured Representations" is now available for download: https://t.co/hyz0cnoewZ -- It covers a range of emerging topics in Deep Learning: from graph neural nets (and graph convolutions) to structure discovery (objects, relations, events)
41
611
3K
World models are helping us evaluate #GeminiRobotics generalist policies more effectively, including auto-red-teaming to safely find and address vulnerabilities in the models. Learn more at: https://t.co/Nisxb3UTVY
@GoogleDeepMind Robotics
veo-robotics.github.io
Project page: Evaluating Gemini Robotics Policies in a Veo World Simulator.
Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://t.co/ZjvpYXFddZ 🧵👇
3
10
92
World models for policy evaluation; the fact that world model performance is highly correlated with real-world performance is incredibly valuable on its own.
So excited to finally talk about this work! Veo is a surprisingly strong world simulator. We fine-tuned Veo on action-conditioned, multi-view robotics data. Key result: running a policy in the world model is strongly correlated with real-world results. A few important
4
18
155
Check out the paper and website at
veo-robotics.github.io
Project page: Evaluating Gemini Robotics Policies in a Veo World Simulator.
0
0
20
So excited to finally talk about this work! Veo is a surprisingly strong world simulator. We fine-tuned Veo on action-conditioned, multi-view robotics data. Key result: running a policy in the world model is strongly correlated with real-world results. A few important
Generalist robots need a generalist evaluator. But how do you test safety without breaking things? 💥 🌎 Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://t.co/ZjvpYXFddZ 🧵👇
13
32
224
Here’s what unexpected scenarios look like in camera, radar, and LiDAR.
117
85
894
One hour Waymo ride from San Francisco to Menlo Park via highway 280. It’s over, cars are self-driving. Everything else is just about rolling this out to the rest of the world.
290
312
4K
Come work with Joe and you’ll sit by me, @poolio, @jon_barron, @RuiqiGao, @holynski_, @tkipf, and many other fine folks!
We’re hiring student researchers at Google DeepMind for 2026. Come work with our great team in SF on anything from diffusion / world models / 3D. Send me an email if you’re interested!
1
3
30
Finally got freeway access on Waymo. Going fast in a fully driverless car almost feels as magical as stepping into a Waymo for the first time a bit over a year ago.
2
1
38
10
3
255
Gemini 3 Deep Think is now available for Ultra users, making available our IMO & ICPC Gold Medal-winning technology. Deep Think shows improved generalization on difficult benchmarks like ARC-AGI-2, and outperforms Gemini 3 Pro on HLE & GPQA Diamond. We hope this serves as a
blog.google
Today, we’re rolling out Gemini 3 Deep Think mode to Google AI Ultra subscribers in the Gemini app. This new mode delivers a meaningful improvement in reasoning capabili…
56
89
1K
As we work to mitigate the @iclr_conf incident, personal integrity is the only true firewall. The integrity of our science is at stake. Please act responsibly.
0
1
15
I'll be at NeurIPS all week -- reach out if you want to chat! Would love to chat especially if you work on world models (in particular for the physical world / robotics), visual reasoning, or controls for video gen.
7
4
162
Just an absolute triumph of technology. Everyone who worked on this should be so proud.
2
1
50
Veo 3.1 is embarrassingly compositional
New trick for Nano Banana Pro + @FlowbyGoogle Found this one while I was exploring NB's ability to do sprite sheets yesterday. Step 1: Create a multi-frame sprite sheet / image sequence with Nano Banana pro Step 2: Use that single sheet as an Ingredient for Veo 3.1
2
1
31
Character consistency has come a long way! It's such a great compositionality test. And it turns out frontier models are truly "embarassingly compositional".
Gemini 3 Pro 🤝 Nano Banana Pro New SOTA image generation and editing. 🍌Production-ready visuals with improved precision and control 🍌Images grounded in Gemini’s real world understanding 🍌Superior text rendering 🍌Localization in multiple languages 🍌Physically accurate
0
1
21