
coval
@covaldev
Followers
693
Following
47
Media
26
Statuses
75
Simulation and evaluation for AI agents (chat & voice agents)
San Francisco
Joined August 2024
🎉 Exciting News! Please join us in welcoming Rob Young to the Coval team! Before joining Coval, Rob designed products at Google and Apple, and founded and led the product design studio OM. He brings a wealth of experience crafting elegant, user-centered experiences at some of
0
0
8
🧵 This week in conversational AI: 🎤 @OpenAI Realtime takes a big step forward — sub-500ms latency, sharper function calling, non-verbal cue detection, seamless language switching, and SIP (telephony) support. Already benchmarked at https://t.co/VmtL9OxB1Q, and the gains on
1
0
3
🎉 Exciting News! Please join us in welcoming Loren Phillips to the Coval team! Before joining Coval, Loren worked on autonomous vehicle safety validation at Zoox, Amazon’s self-driving subsidiary. He also brings a diverse research background spanning nanophotonics, soft
0
0
2
🎙️ Testing non-deterministic AI requires a new approach. Forget exact outputs; it's about probabilistic success. @bnicholehopkins of @covaldev breaks down the shift on the latest Generationship episode. Tune in! 🎧 https://t.co/s9xQOpKsXh
1
2
3
don't miss that OAI also published a prompting guide WITH RECEIPTS for GPT 4.1 specifically for those building agents... with a new recommendation for: - telling the model to be persistent (+20%) - dont self-inject/parse toolcalls (+2%) - prompted planning (+4%) - JSON BAD - use
o1 is mind-blowing when you know how to use it. it's really not a chat model -- you have to think of it more like a "report generator" (link to article below)
36
166
2K
🎉 Exciting News! Please join us in welcoming Kobi Hudson to the Coval team! Prior to Coval, Kobi spent 8 years at Waymo working alongside our own Brooke, where he played a pivotal role in building Waymo's early simulation infrastructure - some of the foundational systems
1
0
5
very detailed and fair evals and impressions on the new STT+TTS models. thx @covaldev
🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The
1
1
14
The folks at @covaldev did a deep dive with our audio models, some great examples of pronunciation with difficult words
🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The
1
1
7
🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The
1
4
41
Coval + @rimelabs: Bringing Lifelike Voices to AI Simulation Excited to announce our new integration with Rime! 🎉 Coval's mission is to provide the most comprehensive voice simulation platform for testing AI agents, and today we're adding Rime's incredibly lifelike voices to
3
3
12
@cartesia_ai's latest model Sonic 2.0 just dropped and the @covaldev team couldn't wait to try it out, so we already ran some benchmarks. Some of our takeaways: ⚡ Sonic 2.0 is lightning fast 🔢 Cartesia has gotten way better at alphanumeric sequences ($52,000, joe@gmail.com, )
0
5
21
🔒 Proud to announce that @coval is now HIPAA compliant! Healthcare stands at the forefront of the voice AI revolution. From improving patient care to streamlining clinical workflows, voice AI is fundamentally reshaping the healthcare industry. This makes security and compliance
0
2
7
When you want to launch your new voice agent but you haven’t run any evals yet...
1
1
4
NYT article: https://t.co/KD43Ki4U9o The Brutalist voice post-production Controversy: https://t.co/Wn03RRAhym
theconversation.com
The technology can deceive so easily that public confidence in generative AI is only possible with full disclosure.
0
0
3
🧵 This week in conversational AI 🧵 • Conversational AI in healthcare is set for massive growth, according to this week’s report by Coherent Market Insights! Exciting to see use cases booming across the market, from to Assort Health, Counsel Health and more.careCycle (YC W25)
1
0
2
🚀 Optimizing Voice AI with Coval + Retell AI 🚀 Are you building your voice agents with @retellai? Then you'll be excited about this partnership! Coval & Retell AI join forces to enable infrastructure for end-to-end voice agent reliability. Why should you use Coval for testing
1
2
8
Think Voice AI Is Just For Sales? Think Again. Voice AI is so much more. And can have so much more impact on our population. We keep coming across how research institutes publish studies proving the effectiveness of voice AI for caregiving and medical support, particularly in
1
0
3
Butterfly effects cripple AI agents. But there's an art to failure. Multi-step evaluations for conversational agents are fundamentally different from classic single-call evaluations. When a voice agent needs to perform multiple tool calls without human intervention, one small
0
0
2