Hernan Moraldo
@hhm
Followers
1K
Following
11K
Media
88
Statuses
3K
Google DeepMind. Veo 3, Veo 2, Veo 1, Phenaki, and more.
California, USA
Joined December 2007
Proud to be part of the incredible talented team that delivered Veo 3! State of the art video and sound generation, including music, dialogue, and more. You can already use it in Gemini, Flow, and Cloud Vertex API
Video, meet audio. π₯π€π With Veo 3, our new state-of-the-art generative video model, you can add soundtracks to clips you make. Create talking characters, include sound effects, and more while developing videos in a range of cinematic styles. π§΅
2
0
23
Google $GOOGL CEO Sundar Pichai just posted this: "Gemini 3 Flash is our latest model with frontier intelligence built for lightning speed, and pushing the Pareto Frontier of performance and efficiency. It outperforms 2.5 Pro while being 3x faster at a fraction of the cost."
23
48
541
π
Congratulations to 12-year-old Faustino Oro on scoring his 2nd GM norm! He still has 3 months in which to become the youngest grandmaster in chess history: https://t.co/Ikmze9Jnrt
185
1K
16K
So excited to finally talk about this work! Veo is a surprisingly strong world simulator. We fine-tuned Veo on action-conditioned, multi-view robotics data. Key result: running a policy in the world model is strongly correlated with real-world results. A few important
Generalist robots need a generalist evaluator. But how do you test safety without breaking things? π₯ π Introducing our new work from @GoogleDeepMind: Evaluating Gemini Robotics Policies in a Veo World Simulator https://t.co/ZjvpYXFddZ π§΅π
15
32
226
Fantastic result, congratulations!!!
Poetiq has officially shattered the ARC-AGI-2 SOTA π @arcprize has officially verified our results: - 54% Accuracy β first to break the 50% barrier! - $30.57 / problem β less than half the cost of the previous best! We are now #1 on the leaderboard for ARC-AGI-2!
0
0
3
Is more intelligence always more expensive? Not necessarily. Introducing Poetiq. Weβve established a new SOTA and Pareto frontier on @arcprize using Gemini 3 and GPT-5.1.
59
116
950
For example Iβve been doing a bunch of late night vibe coding with Gemini 3 in @GoogleAIStudio, and itβs so much fun! I recreated a testbed of my game Theme Park π’ that I programmed in the 90s in a matter of hours, down to letting players adjust the amount of salt on the chips!
66
70
1K
It's nearly 3 here, my favourite part of the night shiftβ¦ locked in... πͺπ
318
332
7K
Super excited to announce SIMA 2! Itβs a general agent that can understand & reason about complex instructions and complete tasks in simulated game worlds, even ones it has never seen before. Incredible to see how it can learn just from self-playβ¦ a crucial step towards AGI
84
242
2K
Great to see the major jump on quality between our Veo 3.0 and Veo 3.1 models. High quality video generation will unlock all kinds of creative uses!
π¨π¬ Big news from Video Arena! @GoogleDeepMindβs latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. π This is a +30-point leap from Veo 3.0 β 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the
14
27
339
Awesome to see Veo 3.1 top the LMArena video leaderboards by a large distance with big improvements over Veo 3.0 for text-to-video (+30) and image-to-video (+70)! π₯Huge congrats to the team! Try it for yourself in https://t.co/QgTpxTL8DQ and the @GeminiApp
π¨π¬ Big news from Video Arena! @GoogleDeepMindβs latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. π This is a +30-point leap from Veo 3.0 β 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the
50
113
1K
π¨π¬ Big news from Video Arena! @GoogleDeepMindβs latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. π This is a +30-point leap from Veo 3.0 β 3.1, making it the first model to break 1400 in Video Arena history! Huge congrats to the
Veo is getting a major upgrade. π Weβre rolling out Veo 3.1, our updated video generation model, alongside improved creative controls for filmmakers, storytellers, and developers - many of them with audio. π§΅
29
75
566
If youβve considered switching from ChatGPT to pen & paper to save water, think again. And donβt even think about wearing pants.
1
2
6
Veo 3 is the state-of-the-art in video models. Veo 3.1 is our new big upgrade with enhanced realism, richer audio, scene extension, better narrative control, more precise editing capabilities & much more. Enjoy creating with it at https://t.co/QgTpxTKAOi and in the @GeminiApp !
Veo is getting a major upgrade. π Weβre rolling out Veo 3.1, our updated video generation model, alongside improved creative controls for filmmakers, storytellers, and developers - many of them with audio. π§΅
67
223
1K
Veo is getting a major upgrade. π Weβre rolling out Veo 3.1, our updated video generation model, alongside improved creative controls for filmmakers, storytellers, and developers - many of them with audio. π§΅
122
426
2K
Congrats to the sora2 team for a great model. Also nice to see Veo 3 holds up to competition 5 months after our release.
π¨ π¬ Video Arena Disrupted! @Openai's Sora 2 and Sora 2 Pro have landed on the Text-to-Video leaderboard. π Sora 2 Pro is the first to tie rank with Veo 3 variants for #1. π₯ Sora 2 comes in at #3, pushing the non-audio variants of Veo 3 into 5th! Video models with audio
4
10
153
Weβre proud to announce that Genie 3 has been named one of @TIMEβs Best Inventions of 2025. Genie 3 is our groundbreaking world model capable of generating interactive, playable environments from text or image prompts. Find out more β https://t.co/bv1gZaWYtd
101
272
2K
Quite an a-maze-ing discovery: Veo 3 demonstrates emergent visual reasoning, like finding finding the path to the cheese.
4
9
57
Check out @PaulVicol's thread for a lot more details on the "Video models are zero-shot learners and reasoners" paper!
π₯Veo 3 has emergent zero-shot learning and reasoning capabilities! This multitalented model can do a huge range of interesting tasks. It understands physical properties, can manipulate objects, and can even reason. Check out more examples in this thread!
0
1
18
Video models can reason! This is a fantastic paper, really impressive results Congratulations to @thwiedemer, Yuxuan Li, @PaulVicol, @shaneguML, @nmatares, @kswersk, @_beenkim, @priyankjaini & Robert Geirhos
Veo is a more general reasoner than you might think. Check out this super cool paper on "Video models are zero-shot learners and reasoners" from my colleagues at @GoogleDeepMind.
0
0
9