April 7, 2025 | ElevenLabs Documentation

Speech to text

scribe_v1_experimental: Launched a new experimental preview of the Scribe v1 model with improvements including improved performance on audio files with multiple languages, reduced hallucinations when audio is interleaved with silence, and improved audio tags. The new model is available via the API under the model name scribe_v1_experimental

A-law format support: Added a-law format with 8kHz sample rate to enable integration with European telephony systems.
Fixed quota issues: Fixed a database bug that caused some requests to be mistakenly rejected as exceeding their quota.

Document type filtering: Added support for filtering knowledge base documents by their type (file, URL, or text).
Non-audio agents: Added support for conversational agents that don’t output audio but still send response transcripts and can use tools. Non-audio agents can be enabled by removing the audio client event.
Improved agent templates: Updated all agent templates with enhanced configurations and prompts. See more about how to improve system prompts here.
Fixed stuck exports: Fixed an issue that caused exports to be stuck for extended periods.

Fixed volume normalization: Fixed issue with streaming project snapshots when volume normalization is enabled.

Forced alignment: Added new forced alignment endpoint for aligning audio with text, perfect for subtitle generation.
Batch calling: Added batch calling endpoint for scheduling calls to multiple recipients

Get knowledge base - Added types parameter for filtering documents by type
General endpoint for creating knowledge base documents marked as deprecated in favor of specialized endpoints

Get user subscription - Added professional_voice_slots_used property to track number of professional voices used in a workspace

Added silence_end_call_timeout parameter to set maximum wait time before terminating a call
Removed /v1/convai/agents/{agent_id}/add-secret endpoint (now handled by workspace secrets endpoints)