Haotian Tang Profile
Haotian Tang

@haotiant1998

Followers
2K
Following
252
Media
4
Statuses
100

Research Scientist @Meta. Previously Gemini team @GoogleDeepMind, Ph.D. @MITEECS, B.Eng. @sjtu1896.

Joined September 2021
Don't wanna be here? Send us removal request.
@haotiant1998
Haotian Tang
10 months
Personal update: I am excited to share that I will join @GoogleDeepMind next week after defending my PhD thesis @MITEECS earlier last month. I will be working on generative models that simulate the physical world. Looking forward to the new journey ahead in 2025!
73
53
2K
@hanrui_w
Ryan Hanrui Wang
2 days
Explore Eigen Banana, out post trained image edit model with lightning fast speed! ⚡️
@Eigen_AI_Labs
Eigen AI
2 days
🚀 Releasing open-source Eigen-Banana-Qwen-Image-Edit: 4 seconds ⚡ instruction-based image edits trained on Pico-Banana-400K. Super fast with high image editing quality. Open-source LoRA for Diffusers/DiffSynth-Studio + enterprise stack (EigenTrain/Inference/Deploy). Feel free
0
2
14
@haotiant1998
Haotian Tang
5 months
👀
@Waymo
Waymo
5 months
New York, we're coming back to the Big Apple next month! 🍎🗽We want to serve New Yorkers in the future, and we’re working towards that goal. Here’s how:👇
0
0
3
@haotiant1998
Haotian Tang
5 months
Very cool research!
@tianyuanzhang99
Tianyuan Zhang
5 months
Bored of linear recurrent memories (e.g., linear attention) and want a scalable, nonlinear alternative? Our new paper “Test-Time Training Done Right” propose LaCT (Large Chunk Test-Time Training) — a highly efficient, massively scalable nonlinear memory with: 💡 Pure PyTorch
0
0
3
@haotiant1998
Haotian Tang
5 months
So excited to see Tong’s amazing work! Let’s gooooo 🚀
@GoogleDeepMind
Google DeepMind
5 months
Watch Gemini 2.5 Pro Deep Think tackle the challenging "catch a mole" problem from @Codeforces. 🪤 This new mode is based on our research in parallel thinking and considers multiple hypotheses before responding. See it in action ↓
0
0
3
@GoogleDeepMind
Google DeepMind
6 months
Deep Think in 2.5 Pro has landed. 🤯 It’s a new enhanced reasoning mode using our research in parallel thinking techniques - meaning it explores multiple hypotheses before responding. This enables it to handle incredibly complex math and coding problems more effectively.
72
422
4K
@_tim_brooks
Tim Brooks
6 months
Check out Veo 3 🔥🔥🔥 sound on 🔊
@Google
Google
6 months
Say goodbye to the silent era of video generation: Introducing Veo 3 — with native audio generation. 🗣️ Quality is up from Veo 2, and now you can add dialogue between characters, sound effects and background noise. Veo 3 is available now in the @GeminiApp for Google AI Ultra
6
7
157
@jalayrac
JB Alayrac
6 months
A lot of work went to make Gemini 2.5 SOTA at video understanding, check out this 🧵 for more details! Looking back at where we were a year ago, the progress really feels phenomenal! So many things to unlock and enable from video 🎥 and we are only getting started!
@AntoineYang2
Antoine Yang
6 months
Thrilled to share our latest advances in video understanding 📽️: Gemini 2.5 Pro is a truly magical model to play with, excelling in traditional video analysis and unlocking new use cases I could not imagine a few months ago🪄 More in 🧵 and @Google blog:
5
11
148
@OfficialLoganK
Logan Kilpatrick
6 months
Gemini 2.5 Pro (05-06) is SOTA at most video understanding tasks (by a large margin) 📽️. Lots of work by the Gemini multimodal team to make this happen, excited to see developers push this capability in new ways. More details below!
116
161
2K
@haotiant1998
Haotian Tang
6 months
♊️
@sundarpichai
Sundar Pichai
6 months
What a finish! Gemini 2.5 Pro just completed Pokémon Blue!  Special thanks to @TheCodeOfJoel for creating and running the livestream, and to everyone who cheered Gem on along the way.
0
0
4
@sundarpichai
Sundar Pichai
7 months
1/ Today, Veo 2, our state-of-the-art video model, is rolling out to Gemini Advanced + Whisk! You can create 8s, high-res videos from text prompts in @GeminiApp with fluid character movement + lifelike scenes across a range of styles. Tip: the more detailed your description, the
143
340
3K
@jack_w_rae
Jack Rae
7 months
2.5 Pro is the highest performing model for Aider Polyglot (real-world coding) and has a lower cost than the five next-best models. An amazing model for code 💎
@paulgauthier
Paul Gauthier
7 months
Gemini 2.5 Pro's leaderboard entry has been updated with costs, now that it available through a paid API. It cost $6 to run the aider polyglot coding benchmark on Gemini, lower than the top 10 other entries except for DeepSeek's models. https://t.co/mBVaUPGHPl
4
15
190
@JeffDean
Jeff Dean
7 months
In case it's not clear from @paulgauthier's chart below, the cost differences are quite large among the top 10 models on this benchmark, w/ some (lower quality) models being ~2X, ~3X or ~30X more expensive than the Gemini 2.5 Pro model (the website has a nice table, seen below).
@paulgauthier
Paul Gauthier
7 months
Gemini 2.5 Pro's leaderboard entry has been updated with costs, now that it available through a paid API. It cost $6 to run the aider polyglot coding benchmark on Gemini, lower than the top 10 other entries except for DeepSeek's models. https://t.co/mBVaUPGHPl
43
113
971
@haotiant1998
Haotian Tang
7 months
What an amazing chip! Cannot wait to try it out
@OfficialLoganK
Logan Kilpatrick
7 months
Introducing Ironwood, the first TPU built for the age of inference, and the timing could not be better : ) - Ironwood perf/watt is 2x relative to Trillium, 6th gen TPU - Ironwood offers 192 GB per chip, 6x that of Trillium - 4.5x faster data access https://t.co/doEUgLLgRf
1
0
1
@OfficialLoganK
Logan Kilpatrick
7 months
Deep Research in the Gemini App is now powered by Gemini 2.5 Pro, and our early tests show users prefer this 2:1 vs “other products” ;) https://t.co/O3Nv1uXPnK
Tweet card summary image
gemini.google.com
Meet Gemini, Google’s AI assistant. Get help with writing, planning, brainstorming, and more. Experience the power of generative AI.
201
199
3K
@GoogleDeepMind
Google DeepMind
7 months
Think you know Gemini? 🤔 Think again. Meet Gemini 2.5: our most intelligent model 💡 The first release is Pro Experimental, which is state-of-the-art across many benchmarks - meaning it can handle complex problems and give more accurate responses. Try it now →
91
520
3K
@_tim_brooks
Tim Brooks
9 months
Try out Veo 2 in YouTube! Congrats Veo team 🎉
@GoogleDeepMind
Google DeepMind
9 months
🎥 Our state-of-the-art video generation model Veo 2 is now available in @YouTube Shorts. With the Dream Screen feature, creators can: ✨ Produce new clips that fit seamlessly into their storytelling with a quick text prompt ✨ Use it to make backgrounds for their videos.
6
7
109
@arena
lmarena.ai
9 months
Breaking news from Text-to-Image Arena! 🖼️✨ @GoogleDeepMind’s Imagen 3 debuts at #1, surpassing Recraft-v3 with a remarkable +70-point lead! Congrats to the Google Imagen team for setting a new bar! Try the best text2image at LMArena and cast your vote! More analysis👇
47
141
857
@haotiant1998
Haotian Tang
9 months
What an achievement! Congrats to the team!
@demishassabis
Demis Hassabis
10 months
Our latest update to our Gemini 2.0 Flash Thinking model (available here: https://t.co/Rr9DvqbUdO) scores 73.3% on AIME (math) & 74.2% on GPQA Diamond (science) benchmarks. Thanks for all your feedback, this represents super fast progress from our first release just this past
0
0
4
@_tim_brooks
Tim Brooks
10 months
DeepMind has ambitious plans to make massive generative models that simulate the world. I'm hiring for a new team with this mission. Come build with us! https://t.co/pqvALtAvLs
90
217
2K
@xieenze_jr
Enze Xie
10 months
Thrilled that my citations hit 20,000 on the last day of 2024! Was just 19,980+ yesterday - what a lovely surprise! This year brought changes: first job switch, moved from HK to US, and did amazing projects - e.g. SANA - with interns. Here's to 2025!
7
3
147