
Moses Oh
@MosesOh4
Followers
82
Following
60
Media
3
Statuses
17
RT @axios: .@hume_ai's @gagnecr13 & @inafried test out Hume AI's EVI 3 AI model that's optimized to understand and express human emotions a….
0
11
0
We released our new conversational speech-llm. Fully multimodal (inputting/outputting text&audio tokens). Ability to adopt any voice, modulate speech, and generate language.
Meet EVI 3, another step toward general voice intelligence. EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune,
3
2
11
Literally blown away hearing our model. Amazing work from everyone at Hume. And extra kudos to the research team 🙂.
Today, we’re releasing Octave: the first LLM built for text-to-speech. 🎨Design any voice with a prompt.🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.).🛠️Produce long-form content on our Creator Studio. Unlike traditional TTS that just
0
2
12
RT @josephoh0517: Grateful to train with awesome chiefs like @AF_Haddad. Come check us out at @NeurosurgUCSF ! #subi #neurosurgery
https://….
0
2
0
Excited to share what I've been working on! Over the past few months, we’ve been aligning language and speech both semantically and non-semantically in a fully trained multimodal model, while maintaining language capabilities.
Introducing OCTAVE, a next-generation speech-language model. OCTAVE has new emergent capabilities, like on-the-fly voice and personality creation and much more 👇
2
1
17
It was great running into old Magenta folks at Neurips! @jesseengel @jpgard @wilzh40 @douglas_eck @GoogleMagenta
0
2
15