Antoine Miech @antoine77340 X Profile

Antoine Miech

@antoine77340

Followers

1K

Following

1K

Media

27

Statuses

237

Ornithologist @GoogleDeepMind 🦩, Gemini Multimodal

https://t.co/7ueyCZlBjN

Joined June 2010

Don't wanna be here? Send us removal request.

Google DeepMind

@GoogleDeepMind

4 months

What if you could not only watch a generated video, but explore it too? 🌐 Genie 3 is our groundbreaking world model that creates interactive, playable environments from a single text prompt. From photorealistic landscapes to fantasy realms, the possibilities are endless. 🧵

828

3K

14K

Olivia Moore

@omooretweets

7 months

How does Google's new agentic browser (Project Mariner) compare with ChatGPT Operator? I tested them head-to-head, using both platform's suggested prompts (to make it fair!) 👇

24

160

2K

MBZ

@babaeizadeh

7 months

#Veo3 further blurs the lines between reality and imagination with audio, stronger text adherence, and richer visual details.

58

184

1K

Marques Brownlee

@MKBHD

7 months

We're barely 2 years from Will Smith eating spaghetti...

Google

@Google

7 months

Say goodbye to the silent era of video generation: Introducing Veo 3 — with native audio generation. 🗣️ Quality is up from Veo 2, and now you can add dialogue between characters, sound effects and background noise. Veo 3 is available now in the @GeminiApp for Google AI Ultra

411

4K

92K

Antoine Yang

@AntoineYang2

7 months

Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog:

developers.googleblog.com

11

51

381

Antoine Miech

@antoine77340

9 months

😵😵

nicholas

@nicdunz

9 months

this is so fucking crazy

0

2

Logan Kilpatrick

@OfficialLoganK

9 months

Introducing YouTube video 🎥 link support in Google AI Studio and the Gemini API. You can now directly pass in a YouTube video and the model can usage its native video understanding capabilities to use that, with just a link! 🚢

291

374

3K

Antoine Yang

@AntoineYang2

9 months

You can now paste YouTube links *directly* to use Gemini audio-video understanding on https://t.co/8ZC9dwbeXC 😀

4

13

155

Robert Riachi

@robertriachi

9 months

some cool examples with Gemini 2.0 native image output 🧵

66

189

4K

Aishwarya Kamath

@ashkamath20

9 months

Super excited to announce what I’ve been working on for the past few months 💃 GEMMA 3 is out today! It supports 140+ languages, has a context length of 128k tokens and the best part? It’s natively multimodal! 📸

10

26

347

lmarena.ai

@arena

10 months

Introducing Arena-Price Plot! 💰📊 An interactive plot of price vs. performance trade-offs for LLMs. Frontier efficiency models: 🔹 Gemini-2.0-Flash/Lite by @GoogleDeepMind 🔹 DeepSeek-R1 by @deepseek_ai 🔹 GPT-4o by @OpenAI 🔹 Yi-Lightning by @01AI_Yi 🔹 Ministral 8B by

62

137

795

Google DeepMind

@GoogleDeepMind

1 year

Today, we’re announcing Veo 2: our state-of-the-art video generation model which produces realistic, high-quality clips from text or image prompts. 🎥 We’re also releasing an improved version of our text-to-image model, Imagen 3 - available to use in ImageFX through

268

1K

7K

Mostafa Dehghani

@m__dehghani

1 year

Gemini2 Flash on the challenge of what the internet has been asking for: breaking down "draw the rest of the owl" into actual steps with interleaved generation. not perfect yet, but it’s on the edge of something super cool...

18

66

508

Antoine Miech

@antoine77340

1 year

Gemini can now browse the web as you would do on a web browser!

Sundar Pichai

@sundarpichai

1 year

We are investing in the frontiers of agentic capabilities with a few early prototypes. Project Mariner is built with Gemini 2.0 and is able to understand and reason across information - pixels, text, code, images + forms - on your browser screen, and then uses that info to

0

5

26

Google DeepMind

@GoogleDeepMind

1 year

Welcome to the world, Gemini 2.0 ✨ our most capable AI model yet. We're first releasing an experimental version of 2.0 Flash ⚡ It has better performance, new multimodal output, @Google tool use - and paves the way for new agentic experiences. 🧵 https://t.co/ywY2oZv76p

73

424

2K

Logan Kilpatrick

@OfficialLoganK

1 year

7 examples of Gemini's multimodal capabilities in action (with code and prompts) 🤯🧵

28

42

569

lmarena.ai

@arena

1 year

Woah, huge news again from Chatbot Arena🔥 @GoogleDeepMind’s just released Gemini (Exp 1121) is back stronger (+20 points), tied #1🏅Overall with the latest GPT-4o-1120 in Arena! Ranking gains since Gemini-Exp-1114: - Overall #3 → #1 - Overall (StyleCtrl): #5 -> #2 - Hard

Logan Kilpatrick

@OfficialLoganK

1 year

Say hello to gemini-exp-1121! Our latest experimental gemini model, with: - significant gains on coding performance - stronger reasoning capabilities - improved visual understanding Available on Google AI Studio and the Gemini API right now:

49

149

949

Antoine Miech

@antoine77340

1 year

LMSYS now has an image understanding track, and it is pretty addictive!

lmarena.ai

@arena

1 year

Exciting news - Chatbot Arena now supports image uploads📸 Challenge GPT-4o, Gemini, Claude, and LLaVA with your toughest questions. Plot to code, VQA, story telling, you name it. Let's get creative and have fun! Leaderboard coming soon. Credits to builders @chrischou03

0

2

17

Alexander Chen

@alexanderchen

1 year

Ooh! 🎉 @tldraw got spatialized math reasoning working using Gemini 1.5 Flash. ✏️ ➗

tldraw

@tldraw

1 year

finally we can do maths

0

3

10

Lei Li

@_TobiasLee

2 years

Updated results: Gemini 1.5 Flash is rocking, outperforming GPT-4o as well! 🤘

Antoine Miech

@antoine77340

2 years

@_TobiasLee Thank you for releasing such a high quality benchmark, they tend to be quite rare for long form video understanding! Would it be possible to add Gemini 1.5 Flash to the leaderboard as well? 🙏 it is faster and cheaper to run than 1.5 Pro 😊

2

14

111