coval @covaldev X Profile

coval

@covaldev

Followers

693

Following

47

Media

26

Statuses

75

Simulation and evaluation for AI agents (chat & voice agents)

San Francisco

Joined August 2024

Don't wanna be here? Send us removal request.

coval

@covaldev

10 days

🎉 Exciting News! Please join us in welcoming Rob Young to the Coval team! Before joining Coval, Rob designed products at Google and Apple, and founded and led the product design studio OM. He brings a wealth of experience crafting elegant, user-centered experiences at some of

0

8

coval

@covaldev

2 months

🧵 This week in conversational AI: 🎤 @OpenAI Realtime takes a big step forward — sub-500ms latency, sharper function calling, non-verbal cue detection, seamless language switching, and SIP (telephony) support. Already benchmarked at https://t.co/VmtL9OxB1Q, and the gains on

1

0

3

coval

@covaldev

3 months

🎉 Exciting News! Please join us in welcoming Loren Phillips to the Coval team! Before joining Coval, Loren worked on autonomous vehicle safety validation at Zoox, Amazon’s self-driving subsidiary. He also brings a diverse research background spanning nanophotonics, soft

0

2

Heavybit

@heavybit

5 months

🎙️ Testing non-deterministic AI requires a new approach. Forget exact outputs; it's about probabilistic success. @bnicholehopkins of @covaldev breaks down the shift on the latest Generationship episode. Tune in! 🎧 https://t.co/s9xQOpKsXh

1

2

3

swyx

@swyx

6 months

don't miss that OAI also published a prompting guide WITH RECEIPTS for GPT 4.1 specifically for those building agents... with a new recommendation for: - telling the model to be persistent (+20%) - dont self-inject/parse toolcalls (+2%) - prompted planning (+4%) - JSON BAD - use

ben

@benhylak

9 months

o1 is mind-blowing when you know how to use it. it's really not a chat model -- you have to think of it more like a "report generator" (link to article below)

36

166

2K

coval

@covaldev

6 months

🎉 Exciting News! Please join us in welcoming Kobi Hudson to the Coval team! Prior to Coval, Kobi spent 8 years at Waymo working alongside our own Brooke, where he played a pivotal role in building Waymo's early simulation infrastructure - some of the foundational systems

1

0

5

Jeff Harris

@jeffintime

7 months

very detailed and fair evals and impressions on the new STT+TTS models. thx @covaldev

Brooke Hopkins

@bnicholehopkins

7 months

🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The

1

14

Peter Bakkum

@pbbakkum

7 months

The folks at @covaldev did a deep dive with our audio models, some great examples of pronunciation with difficult words

Brooke Hopkins

@bnicholehopkins

7 months

🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The

1

7

Brooke Hopkins

@bnicholehopkins

7 months

🎙 OpenAI’s dropped new voice AI Models- get Coval's scoop and see the benchmark results 🫢 @OpenAI just dropped a new text-to-speech model, We’ve been testing gpt-4o-mini-tts and the prosody, pronunciation, and controllability are next-level. 💭 Prosody that feels real: The

1

4

41

coval

@covaldev

7 months

Coval + @rimelabs: Bringing Lifelike Voices to AI Simulation Excited to announce our new integration with Rime! 🎉 Coval's mission is to provide the most comprehensive voice simulation platform for testing AI agents, and today we're adding Rime's incredibly lifelike voices to

3

12

Brooke Hopkins

@bnicholehopkins

7 months

@cartesia_ai's latest model Sonic 2.0 just dropped and the @covaldev team couldn't wait to try it out, so we already ran some benchmarks. Some of our takeaways: ⚡ Sonic 2.0 is lightning fast 🔢 Cartesia has gotten way better at alphanumeric sequences ($52,000, joe@gmail.com, )

0

5

21

coval

@covaldev

8 months

🎉 Excited to announce that @covaldev and @langfuse are officially integrated - so you can test & debug Voice AI with confidence. Voice agents require both conversation-level testing + message-level observability to be production-ready. Our integration bridges this gap, giving

0

1

8

coval

@covaldev

8 months

🔒 Proud to announce that @coval is now HIPAA compliant! Healthcare stands at the forefront of the voice AI revolution. From improving patient care to streamlining clinical workflows, voice AI is fundamentally reshaping the healthcare industry. This makes security and compliance

0

2

7

coval

@covaldev

8 months

When you want to launch your new voice agent but you haven’t run any evals yet...

1

4

coval

@covaldev

8 months

NYT article: https://t.co/KD43Ki4U9o The Brutalist voice post-production Controversy: https://t.co/Wn03RRAhym

theconversation.com

The technology can deceive so easily that public confidence in generative AI is only possible with full disclosure.

0

3

coval

@covaldev

8 months

🧵 This week in conversational AI 🧵 • Conversational AI in healthcare is set for massive growth, according to this week’s report by Coherent Market Insights! Exciting to see use cases booming across the market, from to Assort Health, Counsel Health and more.careCycle (YC W25)

1

0

2

coval

@covaldev

8 months

🚀 Optimizing Voice AI with Coval + Retell AI 🚀 Are you building your voice agents with @retellai? Then you'll be excited about this partnership! Coval & Retell AI join forces to enable infrastructure for end-to-end voice agent reliability. Why should you use Coval for testing

1

2

8

coval

@covaldev

8 months

Check out the study here:

medicalxpress.com

Imagine caring for a loved one with Alzheimer's disease, facing constant questions, challenges and uncertainty. Now, imagine having a trusted companion, a friendly and knowledgeable assistant, to...

0

3

coval

@covaldev

8 months

Think Voice AI Is Just For Sales? Think Again. Voice AI is so much more. And can have so much more impact on our population. We keep coming across how research institutes publish studies proving the effectiveness of voice AI for caregiving and medical support, particularly in

1

0

3

coval

@covaldev

8 months

Butterfly effects cripple AI agents. But there's an art to failure. Multi-step evaluations for conversational agents are fundamentally different from classic single-call evaluations. When a voice agent needs to perform multiple tool calls without human intervention, one small

0

2