@liambolling
Liam Bolling
2 months
@samratmansingh @Google it’s native (not text to speech, it’s audio being tokenized and sent to Gemini) but it currently focuses on speech
0
0
3

Replies

@liambolling
Liam Bolling
2 months
🎉 It’s a big day for @Google Gemini. Gemini 1.5 Pro now understands audio, uses unlimited files, acts on your commands, and lets devs build incredible things with JSON mode! It’s all 🆓. Here’s why it’s a big deal 👇 🔈 Gemini can hear Gemini understands audio (up to 9.5…
84
274
2K
@samratmansingh
Samrat Man Singh
2 months
@liambolling @Google This seems to be transcribing the audio and feeding it to the LLM, rather than the model itself being capable of understanding audio?
Tweet media one
1
0
1