Ankur Bapna Profile
Ankur Bapna

@ankurbpn

Followers
1K
Following
3K
Media
27
Statuses
527

Conversational Superintelligence @Meta Ex - Led Native Audio Generation Gemini 2.5, Native Audio Gemini 1.5

Joined February 2014
Don't wanna be here? Send us removal request.
@ankurbpn
Ankur Bapna
16 hours
RT @OfficialLoganK: Gemini's native text to speech (TTS) capabilities are available for scaled production use 🗣️! Both 2.5 Flash and 2.5 Pr….
0
99
0
@ankurbpn
Ankur Bapna
1 month
RT @simonw: Notes on trying out the new Gemini 2.5 models. gemini-2.5-flash-lite-preview-06-17 is cheap but got stuck on an audio transcrip….
Tweet card summary image
simonwillison.net
After many months of previews, Gemini 2.5 Pro and Flash have reached general availability with new, memorable model IDs: gemini-2.5-pro and gemini-2.5-flash. They are joined by a new preview model …
0
27
0
@ankurbpn
Ankur Bapna
1 month
RT @lovable_dev: Google Gemini: Talki – An AI language learning app that helps you learn a language where you can talk with an AI in differ….
0
20
0
@ankurbpn
Ankur Bapna
1 month
RT @arvind_io: amazing multimodality performance (& more) !!.
Tweet media one
0
3
0
@ankurbpn
Ankur Bapna
1 month
Our native audio dialog api is now in public preview - do try it out!!.
@GoogleCloudTech
Google Cloud Tech
1 month
New AI updates → ✨ Gemini 2.5 Flash and 2.5 Pro are now generally available.✨ Supervised Fine-Tuning for Gemini 2.5 Flash is generally available.✨ Gemini 2.5 Flash-Lite in public preview.✨ Updated Live API with native audio in public preview
Tweet media one
0
2
6
@ankurbpn
Ankur Bapna
1 month
RT @Shai_Alon: 🤯My latest AI video just racked up thousands of views on Reddit in under 24 hours! Listen with Volume 🔊. It's a 2-min docume….
0
6
0
@ankurbpn
Ankur Bapna
1 month
RT @svlevine: I always found it puzzling how language models learn so much from next-token prediction, while video models learn so little f….
0
176
0
@ankurbpn
Ankur Bapna
1 month
RT @not_hanjo_mei: Powered by Gemini 2.5 Pro (both LLM and TTS).
0
2
0
@ankurbpn
Ankur Bapna
1 month
RT @DynamicWebPaige: Gemini can be used natively for:. 🖼️ image understanding.📸 image editing.📽️ video understanding.🗣️ speech-to-text.💬 te….
0
23
0
@ankurbpn
Ankur Bapna
2 months
RT @DynamicWebPaige: 👋 Wanted to make sure to share, in case y'all missed it: we recently launched Gemini 2.5 Flash text-to-speech, and it'….
0
13
0
@ankurbpn
Ankur Bapna
2 months
RT @sundarpichai: Our latest Gemini 2.5 Pro update is now in preview. It’s better at coding, reasoning, science + math, shows improved per….
0
753
0
@ankurbpn
Ankur Bapna
2 months
RT @Google: Here’s a closer look at what developers can do with Gemini 2.5 native audio capabilities.
Tweet card summary image
blog.google
Gemini 2.5 has new capabilities in AI-powered audio dialog and generation.
0
27
0
@ankurbpn
Ankur Bapna
2 months
RT @GoogleDeepMind: Our native audio capabilities are making AI conversations more natural – from understanding tone to generating expressi….
0
168
0
@ankurbpn
Ankur Bapna
2 months
RT @Google: New native audio capabilities in Gemini 2.5 enable text-to-speech in over 24 languages. 🔊Voices are more natural and expressive….
0
204
0
@ankurbpn
Ankur Bapna
2 months
RT @ai_for_success: Native audio in Google AI Studio is really underrated 🔥🔥.
0
37
0
@ankurbpn
Ankur Bapna
2 months
RT @googleaidevs: 🔊Native audio outputs in Gemini 2.5 give developers new ways to build richer applications with conversation and speech. ↓….
Tweet card summary image
blog.google
Gemini 2.5 has new capabilities in AI-powered audio dialog and generation.
0
117
0
@ankurbpn
Ankur Bapna
2 months
On native audio dialogue, do try out the affective dialogue mode for our most expressive models, and my personal favorite dialog-with-thinking with google search enabled for the most intelligent native audio dialog agent out there :).
0
0
1
@ankurbpn
Ankur Bapna
2 months
I also enjoyed this video exploring emergent capabilities of our TTS models in the wild - a lot more steerable than any other options out there, vocal bursts, sound effects:
1
0
1
@ankurbpn
Ankur Bapna
2 months
Few reasons why you should try out our native audio dialog and steerable TTS models today ⬇️.
@googleaidevs
Google AI Developers
2 months
🔊Native audio outputs in Gemini 2.5 give developers new ways to build richer applications with conversation and speech. ↓.
3
3
10
@ankurbpn
Ankur Bapna
2 months
RT @DynamicWebPaige: 🐤 This commercial is using Veo 3 to generate the visuals, Gemini text-to-speech for the voiceover, and MusicFX for th….
0
2
0