OriolVinyalsML Profile Banner
Oriol Vinyals Profile
Oriol Vinyals

@OriolVinyalsML

Followers
180K
Following
1K
Media
240
Statuses
1K

VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.

London, England
Joined October 2015
Don't wanna be here? Send us removal request.
@OriolVinyalsML
Oriol Vinyals
2 months
Ahead of I/O, weโ€™re releasing an updated Gemini 2.5 Pro! Itโ€™s now #1 on WebDevArena leaderboard, breaking the 1400 ELO barrier! ๐Ÿฅ‡. Our most advanced coding model yet, with stronger performance on code transformation & editing. Excited to build drastic agents on top of this!
Tweet media one
35
63
760
@OriolVinyalsML
Oriol Vinyals
10 days
good trajectory++ โญ๏ธ๐Ÿฟ
Tweet media one
@demishassabis
Demis Hassabis
10 days
good trajectory. .
5
22
260
@OriolVinyalsML
Oriol Vinyals
17 days
"A Neural Conversational Model" is 10 years old, w/ @quocleix . TL;DR you can train a chatbot with a large neural network (~500M params!). Samples ๐Ÿ‘‡. This paper was received with mixed reviews, but I'm glad all the critics are now riding the LLM wave ๐ŸŒŠ
Tweet media one
8
11
152
@OriolVinyalsML
Oriol Vinyals
20 days
Hello Gemini 2.5 Flash-Lite! So fast, it codes *each screen* on the fly (Neural OS concept ๐Ÿ‘‡). The frontier isn't always about large models and beating benchmarks. In this case, a super fast & good model can unlock drastic use cases. Read more:
57
271
2K
@OriolVinyalsML
Oriol Vinyals
1 month
Introducing the new Gemini 2.5 Pro preview, with a +24 LMArena Elo score over its predecessor! ๐Ÿ“ˆ It leads on tough coding (AIME, AIDER), science (GPQA), and reasoning (HLE). Style/structure have improved thanks to your feedback. Learn more:.
Tweet media one
5
25
292
@OriolVinyalsML
Oriol Vinyals
1 month
Cool benchmark! Good to see Gemini 2.5 topping another leaderboard, though we are quite far from the summit ๐Ÿ˜…๐Ÿชœ๐Ÿ”๏ธ.
@a1zhang
Alex Zhang
1 month
Can GPT, Claude, and Gemini play video games like Zelda, Civ, and Doom II?. ๐—ฉ๐—ถ๐—ฑ๐—ฒ๐—ผ๐—š๐—ฎ๐—บ๐—ฒ๐—•๐—ฒ๐—ป๐—ฐ๐—ต evaluates VLMs on Game Boy & MS-DOS games given only raw screen input, just like how a human would play. The best model (Gemini) completes just 0.48% of the benchmark!. ๐Ÿงต๐Ÿ‘‡
3
3
96
@OriolVinyalsML
Oriol Vinyals
2 months
RT @HashemGhaili: Prompt Theory (Made with Veo 3). What if AI-generated characters refused to believe they were AI-generated? https://t.co/โ€ฆ.
0
4K
0
@OriolVinyalsML
Oriol Vinyals
2 months
RT @lmthang: As highlighted yesterday at #GoogleIO, Gemini 2.5 Pro with #DeepThink mode achieved a new state-of-the-art performance of 49.4โ€ฆ.
0
19
0
@OriolVinyalsML
Oriol Vinyals
2 months
Because we're defining the frontier with Deep Think, we're taking extra care. This includes first getting input from safety experts and trusted testers๐Ÿ›ก๏ธ via the Gemini API to gather crucial feedback before wider release.
3
1
24
@OriolVinyalsML
Oriol Vinyals
2 months
Deep Think uses new techniques, allowing the model to consider multiple hypotheses before responding. It's not just about math; it also gets an impressive score on LiveCodeBench ๐Ÿ’ป, MMMU & more!. Learn more:
2
1
42
@OriolVinyalsML
Oriol Vinyals
2 months
Yesterday at #GoogleIO, we introduced Gemini 2.5 Pro Deep Think ๐Ÿง , pushing the frontiers of AI reasoning. This enhanced reasoning mode is built to tackle drastically complex problems โ€“ like USAMO problems that stumped previous models. Super proud of the GDM team for this one!
31
72
639
@OriolVinyalsML
Oriol Vinyals
2 months
Ultimate AGI benchmark: humor!.
@fofrAI
fofr
2 months
NO WAY. It did it. And, was that, actually funny?. Prompt:.> a man doing stand up comedy in a small venue tells a joke (include the joke in the dialogue)
1
1
17
@OriolVinyalsML
Oriol Vinyals
2 months
Spaghetti benchmark.
@GeminiApp
Google Gemini App
2 months
Bon appรฉtit ๐Ÿ
3
0
21
@OriolVinyalsML
Oriol Vinyals
2 months
Veo2 was already SOTA, but Veo3 is something else! Been having a blast playing with it, and loving all the creativity now that it's out (more in ๐Ÿงต). Me walking to I/O thinking about Gemini3, Veo4, .
18
17
316
@OriolVinyalsML
Oriol Vinyals
2 months
Today we introduced Gemini Diffusionโšก๏ธ (& DeepThink, Veo3, Imagen4, 2.5 updates. ). It's been a dream of mine to remove the need for "left to right" text generation. It's so fast, that we had to *slow down* the video during the presentation.
33
103
786
@OriolVinyalsML
Oriol Vinyals
2 months
We may be delaying AGI if we keep playing this game, but OK! I used Copy Canvas and here is my addition to the Cartographer. Coding this in two sentences. drastic & wild times! @NoamShazeer I nominate you : ).
@JeffDean
Jeff Dean
2 months
This is cool! . You can click "Copy Canvas" in the lower right (after logging in) and enable voice activation. (volume up for video below). @OriolVinyalsML tagging you!. Thank you @davemesserx!
6
11
114
@OriolVinyalsML
Oriol Vinyals
2 months
With this update, you can create even more complex web apps from a single prompt. See a side by side (new vs old 2.5 Pro) on building interactive learning apps from YouTube videos. Try the new Gemini 2.5 Pro in AI Studio, Vertex AI, Gemini app & Canvas.
1
1
64
@OriolVinyalsML
Oriol Vinyals
2 months
For some benchmarks & analysis:
Tweet media one
@DillonUzar
Dillon Uzar
3 months
Another update - Ran Gemini 2.5 Flash (Auto Thinking and Non-Thinking). See the comparison below to other thinking models. Interesting curve for Gemini 2.5 Flash Non-Thinking! Meanwhile Gemini 2.5 Flash Thinking (Auto) matches Gemini 2.5 Pro!. I'm still working on o3 access and
Tweet media one
Tweet media two
Tweet media three
Tweet media four
2
3
20
@OriolVinyalsML
Oriol Vinyals
2 months
It's not only about how long your context is, but how well you use it. Great to see Gemini 2.5 models dominating MRCR and other benchmarks on long context!. See 2.5 Pro tackle a complex coding task by reasoning over an entire repo (>500k tokens). Performance and effective use of
15
25
293
@OriolVinyalsML
Oriol Vinyals
3 months
We are blown away by the reception of Gemini 2.5 Pro, and the many positive surprises from benchmarks and community evaluations. That is the ultimate leaderboard!.
@Alber_RomGar
Alberto Romero
3 months
Breaking news: Google is winning on every AI front. This is not just about Gemini 2.5 but about a reality that OpenAI and Anthropic fans have ignored for too long. Here's a non-exhaustive list:. - Gemini 2.5 Pro is the best model in the world according to benchmarks, vibe.
31
33
549