
Building clinical-grade voice agents for Pharma
Increasing physician reach by 30% and cutting admin time by 10 hrs/week
Today we’re introducing Scribe v2 Realtime, the most accurate low-latency Speech to Text model, delivering live transcription in under 150 ms.

Scribe v2 Realtime sets a new standard for low-latency Speech to Text.
Designed for live use cases—voice agents, meeting assistants, and real-time captioning—it transcribes speech in under 150 ms across English, French, German, Italian, Spanish, and Portuguese, and 90 languages.

Scribe v2 Realtime is specifically built for agentic use cases. On 500 hard samples containing background noise and complex information, it significantly outperforms all other models.
Scribe v2 Realtime delivers human-level understanding in real time, enabling natural conversation and immediate response in live environments. Scribe v2 Realtime achieves 93.5% accuracy across 30 commonly used European and Asian languages.
Scribe v2 Realtime is available today through the ElevenLabs API.
Explore the documentation: https://elevenlabs.io/docs/cookbooks/speech-to-text/streaming

Deploy natural, human-sounding agents powered by Scribe v2 Realtime. Build voice assistants for support, sales, or in-product experiences that can understand and respond in real time.
Learn more: https://elevenlabs.io/agents

Use Scribe v2 Realtime through our API or directly within ElevenLabs Agents.
Sign up here: https://elevenlabs.io/app/sign-up

Increasing physician reach by 30% and cutting admin time by 10 hrs/week

AI agents pre-qualify ~210,000 calls per month, concentrating licensed capacity on eligible demand.
Powered by ElevenLabs Agents