Text to Speech
- Language code support: All Text to Speech models now support language codes for improved output. Normalization has been enabled for Eleven v3, Flash, and Turbo models to enhance audio quality and consistency.
Agents Platform
- Multi-voice agent history: Messages from multi-voice agents are now displayed in conversation history with clear separation by voice, making it easier to follow which voice spoke which part of a conversation.
SDK Releases
JavaScript SDK
- v2.15.0 - Adds support for new voice remix functionality
Python SDK
- v2.15.0 - Adds support for new voice remix functionality. Also fixed an issue with Pydantic.
React Components
- @elevenlabs/react@0.6.1 - Fix output bytes and device input/output switching
- @elevenlabs/client@0.6.1 - Fix output bytes and device input/output switching
MCP Server
- v0.7.0 - Latest release of the ElevenLabs MCP Server with new features and improvements for Claude Desktop integration. Includes new
loop
parameter for SFX generation.
API
View API changes
New Endpoints
- Remix a voice - Create voice variations from existing voices
- Get Transcript By Id - Retrieve specific transcription results
Updated Endpoints
Backward Compatible Changes
- Get Project - Added optional
share_id
query parameter for project sharing functionality - Convert Speech to Text - Modified
enable_logging
parameter for improved logging control
All API changes in this release are backward compatible and will not break existing integrations.
Agents Platform
- Gemini 2.5 Flash Lite HIPAA compliance: Added Gemini 2.5 Flash Lite to the list of HIPAA approved models for compliant conversations when a BAA is signed and zero-retention mode is enabled.
- Conversation ID in signed URLs: Added support for including conversation IDs in signed URL requests, providing better tracking and identification capabilities for conversation audio access.
SDK Releases
JavaScript SDK
- v2.13.0 - Released August 29, 2025. Adds support for new
loop
parameter in SFX.
Python SDK
- v2.13.0 - Released August 29, 2025. Adds support for new
loop
parameter in SFX.
ConvAI packages
- @elevenlabs/react v0.6.0 and @elevenlabs/client v0.6.0 - Released August 29, 2025. Fixed setVolume functionality, added client tool debugging, and added audio device controls.
MCP Server
- ElevenLabs MCP Server v0.6.0 - Released August 26, 2025. Fixed diarization functionality in speech-to-text and added music generation endpoints.
API
View API changes
Updated Endpoints
Dubbing
- Render project - Added optional
should_normalize_volume
query parameter to control audio normalization during rendering
Agents Platform
- Get signed URL - Added optional
include_conversation_id
query parameter to include conversation ID in the response
Sound Effects
- Create sound effect - Added optional
loop
parameter to create sound effects that loop smoothly
Removed Endpoints
- Delete workspace member - Removed the
DELETE /v1/workspace/members
endpoint for deleting workspace members. This endpoint was never meant to be publicly available.
Agents Platform
- Agent testing framework: Introduced a comprehensive testing framework for ElevenLabs agents, allowing developers to create, manage, and execute automated tests for their agents. This includes test creation, execution tracking, and result analysis capabilities.
- Test invocation management: Added support for resubmitting failed test invocations and viewing detailed test results to help developers debug and improve their agents.
- Enhanced agent configuration: Improved agent creation and management with additional workspace override capabilities and refined platform settings.
Text to Speech
- Pronunciation dictionary updates: Added support for updating pronunciation dictionaries with PATCH operations, enabling more flexible dictionary management.
- Enhanced timestamp support: Improved timestamp generation for text-to-speech conversions with better alignment data and streaming capabilities.
SDK Releases
- TypeScript SDK v2.12.2: Updated with the latest API schema changes, including full support for the new agent testing endpoints and enhanced Agents Platform capabilities.
- Python SDK v2.12.1: Released with complete support for all new API features, including agent testing framework and improved workspace resource management.
API
View API changes
New Endpoints
Added 10 new endpoints this week:
ElevenLabs agent Testing
POST /v1/convai/agent-testing/create
- Create Agent Response Test - Create automated tests for your ElevenLabs agentsGET /v1/convai/agent-testing/{test_id}
- Get Agent Response Test By Id - Retrieve specific test configurations and resultsPUT /v1/convai/agent-testing/{test_id}
- Update Agent Response Test - Modify existing test setups and parametersDELETE /v1/convai/agent-testing/{test_id}
- Delete Agent Response Test - Remove test configurations from your workspacePOST /v1/convai/agent-testing/summaries
- Get Agent Response Test Summaries By Ids - Retrieve aggregated test results for multiple testsGET /v1/convai/agent-testing
- List Agent Response Tests - Browse all available tests in your workspacePOST /v1/convai/agents/{agent_id}/run-tests
- Run Tests On The Agent - Execute test suites against specific agentsGET /v1/convai/test-invocations/{test_invocation_id}
- Get Test Invocation - Retrieve detailed test execution resultsPOST /v1/convai/test-invocations/{test_invocation_id}/resubmit
- Resubmit Tests - Re-run failed test invocations
Pronunciation Dictionaries
PATCH /v1/pronunciation-dictionaries/{pronunciation_dictionary_id}
- Update Pronunciation Dictionary - Update existing pronunciation dictionaries with new rules or modifications
Eleven v3 API
Eleven v3 is now available via the API.
To start using it, simply specify the model ID eleven_v3
when making Text to Speech requests.
Additionally the Text to Dialogue API endpoint is now available to all.
Music Generation API
The Eleven Music API is now freely available to all paid users.
Visit the quickstart to lean how to integrate. The API section below highlights the new endpoints that have been released.
Global TTS API preview
ElevenLabs is launching inference servers in additional geographical regions to reduce latency for clients outside of the US. Initial request processing will be available in the Netherlands and in Singapore in addition to the US.
To learn how to get started head to the docs.
API
View API changes
New Endpoints
- Added 4 new endpoints:
- Compose music - Create music from text prompts
- Create composition plan - Optimize music generation parameters before processing
- Compose music with details - Advanced music generation with detailed parameters
- Stream music - Real-time streaming music generation
Updated Endpoints
Text to Speech
- Updated Text to Speech endpoints with improved parameter handling:
- Convert text to speech - Enhanced voice settings and text input parameter handling
- Stream text to speech - Improved streaming parameter management
- Convert with timestamps - Better alignment parameter handling
Voice Management
- Updated Voice endpoints with enhanced parameter support:
- Create voice previews - Improved preview generation parameters
- Create voice from preview - Enhanced voice creation options
- Get voice - Updated voice parameter responses
- List voices - Improved voice listing parameters
Speech to Text
- Updated Speech to Text endpoint:
- Convert speech to text - Enhanced transcription parameter handling
Usage and Analytics
- Updated Usage endpoints:
- Get character stats - Added aggregation bucket size parameter and improved breakdown type options
Workspace Management
- Updated Workspace endpoints:
- Get workspace resource - Enhanced resource type parameter handling
- Share workspace resource - Updated sharing parameter structure
- Unshare workspace resource - Updated unsharing parameter structure
Music
Eleven Music: Officially released new music generation model that creates studio-grade music with natural language prompts in any style. See the capabilities page and prompting guide for more information.
SDKs
v2.9.0 of the TypesScript SDK released
- Includes better typing support for Speech to Text requests in webhook mode
- Includes new enums for ChatGPT 5
v2.9.2 of the Python SDK released
- Includes new enums for ChatGPT 5
Agents Platform
Agent response correction: Updated WebSocket event schema and handling for improved agent response correction functionality.
API
View API changes
User Account Changes
- Updated user account endpoint:
- Get user subscription info - Deprecated
convai_chars_per_minute
andconvai_asr_chars_per_minute
fields in the response schema. These fields will now always returnNone
.
- Get user subscription info - Deprecated
Parameter Removals
- Updated conversation token endpoint:
- Get conversation token - Removed
source
andversion
query parameters. These were internal parameters not meant for public use and their removal does not affect functionality.
- Get conversation token - Removed
Agents Platform
- Conversation token generation: Added new route to generate Conversation Tokens for WebRTC connections. Learn more
- Expandable widget options: Our embeddable widget can now be customized to start in the expanded state and disable collapsing altogether.
- Simplified operation IDs: We simplified the OpenAPI operator IDs for Agents Platform endpoints to improve developer experience.
Workspaces
- Simplified operation IDs: We simplified the operation IDs for our workspace endpoints to improve API usability.
SDK Releases
- Python SDK v2.8.2: Released latest version with improvements and bug fixes. View release
NPM Packages
- @elevenlabs/react-native@0.1.2: Enhanced React Native support
- @elevenlabs/client@0.4.4: Client library improvements
- @elevenlabs/react@0.4.5: React component updates
API
View API changes
New Endpoints
Agents Platform
- Get conversation token - Generate authentication token for WebRTC connections
Updated Endpoints
Voice Management
- List voices - Added
voice_ids
query parameter for filtering specific voices
Agents Platform Core
- List conversations - Added
summary_mode
parameter for conversation summaries
Operation ID Improvements
- Agents Platform endpoints: Simplified operation IDs for better developer experience while maintaining full backward compatibility
- Workspace endpoints: Streamlined operation IDs across all workspace-related endpoints to improve API usability
Workspaces
- Service account API key management: Added comprehensive API endpoints for managing service account API keys, including creation, retrieval, updating, and deletion capabilities. See Service Accounts documentation.
Agents Platform
- Post-call webhook migration: The post call webhook format is being migrated so that webhook handlers can be auto generated in the SDKs. This is not a breaking change, and no further action is required if your current handler accepts additional fields. Please see more information here.
- Agent transfer improvements: Fixed system variable
system_agent_id
to properly update after agent-to-agent transfers, ensuring accurate conversation context tracking. Added newsystem_current_agent_id
variable for tracking current active agent. Learn more about dynamic variables. - Enhanced public agent page: Added text input functionality and dynamic variable support to the public talk-to-agent page. You can now pass dynamic variables via URL parameters (e.g.,
?var_username=value
) and use text input during voice conversations. See dynamic variables guide. - Voicemail detection: Added voicemail detection as a built-in tool for ElevenLabs agents to improve call handling. Learn about voicemail detection.
- Conversation filtering: Added
user_id
query parameter to conversation list endpoint for filtering conversations by initiating user.
Speech to Text
- Multi-channel transcription: Added
use_multi_channel
parameter to transcription endpoint for processing audio files with multiple speakers on separate channels. Supports up to 5 channels with per-channel transcription results. See multichannel guide.
Studio
- Caption support: Added caption functionality to Studio projects with new
captions_enabled
andcaption_style
properties for both podcasts and general projects. Learn more about Studio.
SDKs
- JavaScript SDK v2.7.0: Released with latest API support and improvements
- Python SDK v2.8.1: Released with latest API support and improvements
- @elevenlabs/client v0.4.1: Updated client library with latest features, including WebRTC support
- @elevenlabs/react v0.4.1: Enhanced React components with latest features, including WebRTC support
- @elevenlabs/react-native v0.1.1: New React Native package for mobile integration with ElevenLabs Agents, based on WebRTC
- @elevenlabs/convai-widget-embed v0.1.0: New package for embedding Agents Platform widgets into web applications
- Swift SDK v2.0.3: Released with WebRTC support for real-time Agents Platform integration on Apple platforms
API Schema Updates
View API changes
New Endpoints
- Service Account Management: Added 5 new endpoints for service account API key management:
GET /v1/service-accounts/{service_account_user_id}/api-keys
- Retrieve service account API keysPOST /v1/service-accounts/{service_account_user_id}/api-keys
- Create service account API keyDELETE /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id}
- Delete service account API keyPATCH /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id}
- Update service account API keyGET /v1/service-accounts
- Get workspace service accounts
Removed Endpoints
- Legacy Project Endpoints: Removed 22 deprecated project management endpoints as part of Studio API consolidation:
- All
/v1/projects/*
endpoints (replaced by/v1/studio/projects/*
) - Legacy Text to Voice endpoints (
/v1/text-to-voice/create-voice-from-preview
,/v1/text-to-voice/remixing-sessions/*
) - Legacy ConvAI knowledge base endpoints
- All
Updated Endpoints
Speech to Text
- Multi-channel support: Updated
/v1/speech-to-text
endpoint:- Added
use_multi_channel
parameter for processing multi-speaker audio files - Modified response structure to include optional
language_code
,language_probability
,text
, andwords
properties
- Added
Agents Platform
- Enhanced agent configuration: Updated agent creation and management endpoints:
- Added voicemail detection to built-in tools
- Improved RAG configuration with
max_retrieved_rag_chunks_count
parameter - Enhanced conversation token endpoint with
source
andversion
parameters - Added
user_id
filtering to conversations list endpoint
Studio Projects
- Caption support: Updated Studio project endpoints to include:
captions_enabled
property for enabling/disabling captionscaption_style
property for global caption styling configuration
Text to Voice
- Improved voice generation: Enhanced voice creation endpoints with:
loudness
control (-1 to 1 range, 0 corresponds to -24 LUFS)quality
parameter for balancing output quality vs varietyguidance_scale
parameter for controlling AI creativity vs prompt adherence
Agents Platform
- Agent workspace overrides: Enhanced agent configuration with workspace-level overrides for better enterprise management and customization.
- Agent API improvements: Updated agent creation and modification endpoints with enhanced configuration options, though these changes may break backward compatibility.
Dubbing
- Dubbing endpoint access: Added new endpoint to list all available dubs.
API
View API changes
New Endpoints
- Added 1 new endpoints:
- List dubs you have access to -
GET /v1/dubbing
- List dubs you have access to -
Updated Endpoints
Text to Speech
- Updated Text to Speech endpoints with backward compatible changes:
- Convert text to speech - Enhanced response schema
- Convert with timestamps - Improved timestamp handling
- Stream text to speech - Enhanced streaming response
Voice Management
- Updated Voice endpoints with backward compatible improvements:
- Get voices - Enhanced voice information schema
- Get voice - Improved voice details response
- Get voice settings - Enhanced settings schema
Voice Creation
- Updated Voice Creation endpoints:
- Create voice previews - Enhanced preview creation
- Create voice from preview - Improved voice generation
- Create voice - Enhanced voice creation response
Dubbing
- Updated Dubbing endpoints with backward compatible changes:
- Dub a video or audio file - Enhanced dubbing request schema
- Get dubbing project - Improved project response
Workspace Management
- Breaking Change: Updated Workspace endpoints:
- Get workspace resource - Modified
resource_type
query parameter handling and response schema - Share workspace resource - Enhanced sharing configuration
- Unshare workspace resource - Improved unsharing workflow
- Get workspace resource - Modified
Speech to Text
- Updated Speech to Text endpoint:
- Convert speech to text - Enhanced transcription request and response schemas
Agents Platform
Updated Agents Platform endpoints with enhanced changes:
- Create agent - Modified agent creation schema with workspace overrides
- Get agent - Enhanced agent response with new configuration options
- Update agent - Improved agent update capabilities
- Simulate conversation - Enhanced conversation simulation
- Stream conversation simulation - Improved streaming simulation
Other Updates
- Get conversation - Enhanced conversation details
- Get Agents Platform settings - Improved settings response
- Update Agents Platform settings - Enhanced settings modification
Agents Platform
- Azure OpenAI custom LLM support: Added support for Azure-hosted OpenAI models in custom LLM configurations. When using an Azure endpoint, a new required field for API version is now available in the UI.
- Genesys output variables: Added support for output variables when using Genesys integrations, enabling better call analytics and data collection.
- Gemini 2.5 Preview Models Deprecation: Models
gemini-2.5-flash-preview-05-20
andgemini-2.5-flash-preview-04-17
have been deprecated in Agents Platform as they are being deprecated on 15th July by Google. All agents using these models will automatically be transferred togemini-2.5-flash
the next time they are used. No action is required. - WebRTC rollout: Began progressive rollout of WebRTC capabilities for improved connection stability and performance. WebRTC mode can be selected in the React SDK and is used in 11.ai.
- Keypad touch tone: Fixed an issue affecting playing keypad touch tones on Twilio. See keypad touch tone documentation.
Voices
- Language collection navigation: Added quick navigation from language preview collections to view all available voices in that language, making it easier to explore voice options by language.
Text to Voice
- Preview streaming: Added new streaming endpoint for Text to Voice previews, allowing real-time streaming of generated voice previews via
/v1/text-to-voice/{generated_voice_id}/stream
. - Enhanced voice design: Added
stream_previews
option to voice design endpoint, enabling streaming-only preview generation for improved performance. - Improved parameter controls: Enhanced
loudness
, quality, and guidance scale parameters with better control options for more precise voice generation.
Studio
- Podcast customization: Added support for intro and outro text in podcast creation, along with custom instructions prompts for better style and tone control.
SDKs
- JavaScript SDK v2.6.0: Released with latest API support and improvements
- Python SDK v2.7.1: Released with bug fixes and enhancements
- @elevenlabs/client v0.3.0: Updated client library with support for User IDs in Agents Platform.
- @elevenlabs/react v0.3.0: Add WebRTC debug support.
API
View API changes
New Endpoints
- Added 1 new endpoint:
- Stream Text to Voice Preview - Stream generated voice previews in real-time
Updated Endpoints
Text to Voice
- Create voice previews - Enhanced
loudness
,quality
, andguidance_scale
parameter descriptions - Design voice - Added
stream_previews
property for streaming-only preview generation
Studio
- Create podcast - Added
intro
,outro
, andinstructions_prompt
properties
Agents Platform
- Simulate conversation - Enhanced simulation configuration with improved parameter descriptions
- Stream simulate conversation - Enhanced simulation configuration with improved parameter descriptions
- Get Agents Platform settings - Updated RAG retention period configuration
- Update Agents Platform settings - Updated RAG retention period configuration
- Retry batch calling - Added batch retry functionality
Agents Platform
-
HIPAA Compliance: Gemini 2.5 Flash is now available for HIPAA customers, providing enhanced AI capabilities while maintaining strict healthcare compliance standards.
-
Post-call Audio: Added support for returning call audio in post-call webhooks, enabling comprehensive conversation analysis and quality assurance workflows.
-
Enhanced Widget: Added additional text customization options including start chat button text, chatting status text, and input placeholders for text-only and new conversations.
-
Agent Transfers: Improved agent transfer capabilities with transfer delay configuration, custom transfer messages, and control over transferred agent first message behavior.
-
SIP Trunk Enhancements: Added support for separate inbound and outbound SIP trunk configurations with enhanced access control and transfer options.
Dubbing
-
API Schema Update: Updated our API documentation to explicitly require the
target_language
parameter for dubbing projects. This parameter has always been required - we’re just making it clearer in our docs. No code changes needed. -
Duration Validation: Added validation to ensure calculated duration makes sense, preventing zero-credit charges for invalid audio uploads.
Speech to Text
- Deterministic Sampling: Added
seed
parameter support for deterministic sampling, enabling reproducible speech-to-text results.
Forced Alignment
- Confidence Scoring: Added confidence scoring with
loss
field for words and overall transcript accuracy assessment using forced alignment.
Usage Analytics
- Workspace Breakdown: Added reporting workspace ID breakdown for character usage statistics, providing detailed usage insights across workspaces.
SDKs
- React Agents Platform SDK: Released v0.2.0 with support for Indian data residency and WebRTC mode for Agents Platform.
- Python SDK: Released v2.6.1 with enhanced Agents Platform capabilities and bug fixes.
- JavaScript SDK: Released v2.5.0 with improved Agents Platform SDK support and new features.
API
View API changes
Deprecations
POST /v1/convai/phone-numbers/create
has been deprecated in favor of POST /v1/convai/phone-numbers. Please note that migrating to the new endpoint requires a few adjustments:- Replace
provider_config
field withinbound_trunk
andoutbound_trunk
for SIP trunk configurations - Update response parsing to handle the new trunk configuration structure
- Replace
Schema Removals
- Removed
SIPTrunkConfigResponseModel
,SIPTrunkCredentials
,TransferToNumberToolConfig
- Removed
incomplete_expired
andcanceled
subscription statuses
New Features
Enhanced SIP Trunk Support
- SIP trunk configuration now uses separate inbound and outbound trunk configs instead of single configuration
- Deprecated
provider_config
field in SIP trunk response from the new endpoint (replaced withinbound_trunk
andoutbound_trunk
) - Inbound trunk access control with allowed addresses and phone numbers
- SIP URI transfer destinations alongside phone number transfers
- Transfer to number improvements (conference or SIP refer)
Agent Transfers
- Transfer delay configuration with
delay_ms
- Custom transfer messages
- Control over transferred agent first message behavior
Conversation Enhancements
- ElevenLabs Assistant integration tracking
- User ID tracking for conversation participants and initiators
- Audio data in post-call webhooks (configurable)
- MCP (Model Context Protocol) tool call details in conversation history
Widget Improvements
- Additional text customization options:
- Start chat button text
- Chatting status text
- Input placeholders for text-only and new conversations
API Improvements
Speech to Text
- Added deterministic sampling with
seed
parameter in Convert speech to text
Forced Alignment
- Added confidence scoring with
loss
field for words and overall transcript in Forced alignment
Usage Analytics
- Added reporting workspace ID breakdown for character stats in Get characters usage metrics
Tool Configuration
- Client tool response timeout increased from 30 to 120 seconds
Workspace Resources
- Added agent response tests resource type
Deprecations
- Phone number
provider_config
field (useinbound_trunk
/outbound_trunk
instead) phone_number
field in transfer configurations (usetransfer_destination
instead)