Valentin Gabeur
@vgabeur
Followers
365
Following
295
Media
25
Statuses
71
Senior Research Scientist @GoogleDeepmind | Prev. Postdoc @MetaAI, PhD @Inria & @GoogleAI
San Francisco
Joined March 2012
Try it yourself! https://t.co/ty0BfAek0X
Here’s how you can start trying Gemini 2.5 Flash image generation in @GeminiApp and @Google AI Studio →
0
0
0
strange object spotted under the microscope over the weekend in the lab...
358
244
4K
Introducing DINOv3 🦕🦕🦕 A SotA-enabling vision foundation model, trained with pure self-supervised learning (SSL) at scale. High quality dense features, combining unprecedented semantic and geometric scene understanding. Three reasons why this matters…
12
141
1K
What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵
834
3K
14K
We just shipped Gemini 2.5 Deep Think it doesn't just recall research papers - it fuses ideas across papers in ways I haven't seen before this level of capability demands careful evaluation model card below 👇
38
146
2K
For researchers, scientists, and academics tackling hard problems: Gemini 2.5 Deep Think is here. 🤯 It doesn't just answer, it brainstorms using parallel thinking and reinforcement learning techniques. We put it into the hands of mathematicians who explored what it can do ↓
140
490
3K
Next-level referring expression segmentation with Gemini 2.5 🤯
Gemini 2.5 introduces conversational image segmentation for AI, enabling advanced visual understanding through object relationships, conditional logic, and in-image text. https://t.co/urfBEAwV7U
1
2
67
🚀 Excited to launch "Conversational Image Segmentation" for Gemini 2.5. Now you can segment any image with natural language. Think complex queries ("people throwing frisbees"), conditional logic ("workers not wearing hard hats"), and even abstract concepts ("areas with weather
Gemini 2.5 introduces conversational image segmentation for AI, enabling advanced visual understanding through object relationships, conditional logic, and in-image text. https://t.co/urfBEAwV7U
9
11
206
Proud to present our work on getting Gemini 2.5 to predict segmentation masks in images! This allows for the next level of complexity in language prompting 🚀
Gemini 2.5 introduces conversational image segmentation for AI, enabling advanced visual understanding through object relationships, conditional logic, and in-image text. https://t.co/urfBEAwV7U
0
0
19
we’re experimenting with SAM2 for the automatic detection of 3-second violations in NBA games it's pretty tricky, but @AlexBodner_ is working hard on the algo! supervision: https://t.co/xXMRaS3Guk
I'm experimenting with using SAM2 for automatic tracking of NBA players; it's mind-blowing how well this model performs out of the box. we might even add a SAM2-based tracker to the trackers package at some point. trackers: https://t.co/9Fam5U1zuC
12
53
557
#Veo3 further blurs the lines between reality and imagination with audio, stronger text adherence, and richer visual details.
60
186
1K
Animate your story in your style with Veo 3. 🖌️ Here are some of our favorite videos. Sound on. 🔈 https://t.co/5wUMEaqNdD 🧵
58
143
754