Text to Speech

  • Language code support: All Text to Speech models now support language codes for improved output. Normalization has been enabled for Eleven v3, Flash, and Turbo models to enhance audio quality and consistency.

Agents Platform

  • Multi-voice agent history: Messages from multi-voice agents are now displayed in conversation history with clear separation by voice, making it easier to follow which voice spoke which part of a conversation.

SDK Releases

JavaScript SDK

  • v2.15.0 - Adds support for new voice remix functionality

Python SDK

  • v2.15.0 - Adds support for new voice remix functionality. Also fixed an issue with Pydantic.

React Components

  • @elevenlabs/react@0.6.1 - Fix output bytes and device input/output switching
  • @elevenlabs/client@0.6.1 - Fix output bytes and device input/output switching

MCP Server

  • v0.7.0 - Latest release of the ElevenLabs MCP Server with new features and improvements for Claude Desktop integration. Includes new loop parameter for SFX generation.

API

New Endpoints

Updated Endpoints

Backward Compatible Changes

  • Get Project - Added optional share_id query parameter for project sharing functionality
  • Convert Speech to Text - Modified enable_logging parameter for improved logging control

All API changes in this release are backward compatible and will not break existing integrations.

Agents Platform

  • Gemini 2.5 Flash Lite HIPAA compliance: Added Gemini 2.5 Flash Lite to the list of HIPAA approved models for compliant conversations when a BAA is signed and zero-retention mode is enabled.
  • Conversation ID in signed URLs: Added support for including conversation IDs in signed URL requests, providing better tracking and identification capabilities for conversation audio access.

SDK Releases

JavaScript SDK

  • v2.13.0 - Released August 29, 2025. Adds support for new loop parameter in SFX.

Python SDK

  • v2.13.0 - Released August 29, 2025. Adds support for new loop parameter in SFX.

ConvAI packages

MCP Server

  • ElevenLabs MCP Server v0.6.0 - Released August 26, 2025. Fixed diarization functionality in speech-to-text and added music generation endpoints.

API

Updated Endpoints

Dubbing

  • Render project - Added optional should_normalize_volume query parameter to control audio normalization during rendering

Agents Platform

  • Get signed URL - Added optional include_conversation_id query parameter to include conversation ID in the response

Sound Effects

  • Create sound effect - Added optional loop parameter to create sound effects that loop smoothly

Removed Endpoints

  • Delete workspace member - Removed the DELETE /v1/workspace/members endpoint for deleting workspace members. This endpoint was never meant to be publicly available.

Agents Platform

  • Agent testing framework: Introduced a comprehensive testing framework for ElevenLabs agents, allowing developers to create, manage, and execute automated tests for their agents. This includes test creation, execution tracking, and result analysis capabilities.
  • Test invocation management: Added support for resubmitting failed test invocations and viewing detailed test results to help developers debug and improve their agents.
  • Enhanced agent configuration: Improved agent creation and management with additional workspace override capabilities and refined platform settings.

Text to Speech

  • Pronunciation dictionary updates: Added support for updating pronunciation dictionaries with PATCH operations, enabling more flexible dictionary management.
  • Enhanced timestamp support: Improved timestamp generation for text-to-speech conversions with better alignment data and streaming capabilities.

SDK Releases

  • TypeScript SDK v2.12.2: Updated with the latest API schema changes, including full support for the new agent testing endpoints and enhanced Agents Platform capabilities.
  • Python SDK v2.12.1: Released with complete support for all new API features, including agent testing framework and improved workspace resource management.

API

New Endpoints

Added 10 new endpoints this week:

ElevenLabs agent Testing

Pronunciation Dictionaries

  • PATCH /v1/pronunciation-dictionaries/{pronunciation_dictionary_id} - Update Pronunciation Dictionary - Update existing pronunciation dictionaries with new rules or modifications

Eleven v3 API

Eleven v3 is now available via the API.

To start using it, simply specify the model ID eleven_v3 when making Text to Speech requests.

Additionally the Text to Dialogue API endpoint is now available to all.

Music Generation API

The Eleven Music API is now freely available to all paid users.

Visit the quickstart to lean how to integrate. The API section below highlights the new endpoints that have been released.

Global TTS API preview

ElevenLabs is launching inference servers in additional geographical regions to reduce latency for clients outside of the US. Initial request processing will be available in the Netherlands and in Singapore in addition to the US.

To learn how to get started head to the docs.

API

New Endpoints

Updated Endpoints

Text to Speech

Voice Management

Speech to Text

Usage and Analytics

  • Updated Usage endpoints:
    • Get character stats - Added aggregation bucket size parameter and improved breakdown type options

Workspace Management

Music

Eleven Music: Officially released new music generation model that creates studio-grade music with natural language prompts in any style. See the capabilities page and prompting guide for more information.

SDKs

v2.9.0 of the TypesScript SDK released

  • Includes better typing support for Speech to Text requests in webhook mode
  • Includes new enums for ChatGPT 5

v2.9.2 of the Python SDK released

  • Includes new enums for ChatGPT 5

Agents Platform

Agent response correction: Updated WebSocket event schema and handling for improved agent response correction functionality.

API

User Account Changes

  • Updated user account endpoint:
    • Get user subscription info - Deprecated convai_chars_per_minute and convai_asr_chars_per_minute fields in the response schema. These fields will now always return None.

Parameter Removals

  • Updated conversation token endpoint:
    • Get conversation token - Removed source and version query parameters. These were internal parameters not meant for public use and their removal does not affect functionality.

Agents Platform

  • Conversation token generation: Added new route to generate Conversation Tokens for WebRTC connections. Learn more
  • Expandable widget options: Our embeddable widget can now be customized to start in the expanded state and disable collapsing altogether.
  • Simplified operation IDs: We simplified the OpenAPI operator IDs for Agents Platform endpoints to improve developer experience.

Workspaces

  • Simplified operation IDs: We simplified the operation IDs for our workspace endpoints to improve API usability.

SDK Releases

  • Python SDK v2.8.2: Released latest version with improvements and bug fixes. View release

NPM Packages

  • @elevenlabs/react-native@0.1.2: Enhanced React Native support
  • @elevenlabs/client@0.4.4: Client library improvements
  • @elevenlabs/react@0.4.5: React component updates

API

New Endpoints

Agents Platform

Updated Endpoints

Voice Management

  • List voices - Added voice_ids query parameter for filtering specific voices

Agents Platform Core

Operation ID Improvements

  • Agents Platform endpoints: Simplified operation IDs for better developer experience while maintaining full backward compatibility
  • Workspace endpoints: Streamlined operation IDs across all workspace-related endpoints to improve API usability

Workspaces

  • Service account API key management: Added comprehensive API endpoints for managing service account API keys, including creation, retrieval, updating, and deletion capabilities. See Service Accounts documentation.

Agents Platform

  • Post-call webhook migration: The post call webhook format is being migrated so that webhook handlers can be auto generated in the SDKs. This is not a breaking change, and no further action is required if your current handler accepts additional fields. Please see more information here.
  • Agent transfer improvements: Fixed system variable system_agent_id to properly update after agent-to-agent transfers, ensuring accurate conversation context tracking. Added new system_current_agent_id variable for tracking current active agent. Learn more about dynamic variables.
  • Enhanced public agent page: Added text input functionality and dynamic variable support to the public talk-to-agent page. You can now pass dynamic variables via URL parameters (e.g., ?var_username=value) and use text input during voice conversations. See dynamic variables guide.
  • Voicemail detection: Added voicemail detection as a built-in tool for ElevenLabs agents to improve call handling. Learn about voicemail detection.
  • Conversation filtering: Added user_id query parameter to conversation list endpoint for filtering conversations by initiating user.

Speech to Text

  • Multi-channel transcription: Added use_multi_channel parameter to transcription endpoint for processing audio files with multiple speakers on separate channels. Supports up to 5 channels with per-channel transcription results. See multichannel guide.

Studio

  • Caption support: Added caption functionality to Studio projects with new captions_enabled and caption_style properties for both podcasts and general projects. Learn more about Studio.

SDKs

API Schema Updates

New Endpoints

  • Service Account Management: Added 5 new endpoints for service account API key management:
    • GET /v1/service-accounts/{service_account_user_id}/api-keys - Retrieve service account API keys
    • POST /v1/service-accounts/{service_account_user_id}/api-keys - Create service account API key
    • DELETE /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id} - Delete service account API key
    • PATCH /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id} - Update service account API key
    • GET /v1/service-accounts - Get workspace service accounts

Removed Endpoints

  • Legacy Project Endpoints: Removed 22 deprecated project management endpoints as part of Studio API consolidation:
    • All /v1/projects/* endpoints (replaced by /v1/studio/projects/*)
    • Legacy Text to Voice endpoints (/v1/text-to-voice/create-voice-from-preview, /v1/text-to-voice/remixing-sessions/*)
    • Legacy ConvAI knowledge base endpoints

Updated Endpoints

Speech to Text

  • Multi-channel support: Updated /v1/speech-to-text endpoint:
    • Added use_multi_channel parameter for processing multi-speaker audio files
    • Modified response structure to include optional language_code, language_probability, text, and words properties

Agents Platform

  • Enhanced agent configuration: Updated agent creation and management endpoints:
    • Added voicemail detection to built-in tools
    • Improved RAG configuration with max_retrieved_rag_chunks_count parameter
    • Enhanced conversation token endpoint with source and version parameters
    • Added user_id filtering to conversations list endpoint

Studio Projects

  • Caption support: Updated Studio project endpoints to include:
    • captions_enabled property for enabling/disabling captions
    • caption_style property for global caption styling configuration

Text to Voice

  • Improved voice generation: Enhanced voice creation endpoints with:
    • loudness control (-1 to 1 range, 0 corresponds to -24 LUFS)
    • quality parameter for balancing output quality vs variety
    • guidance_scale parameter for controlling AI creativity vs prompt adherence

Agents Platform

  • Agent workspace overrides: Enhanced agent configuration with workspace-level overrides for better enterprise management and customization.
  • Agent API improvements: Updated agent creation and modification endpoints with enhanced configuration options, though these changes may break backward compatibility.

Dubbing

  • Dubbing endpoint access: Added new endpoint to list all available dubs.

API

New Endpoints

Updated Endpoints

Text to Speech

Voice Management

  • Updated Voice endpoints with backward compatible improvements:

Voice Creation

Dubbing

Workspace Management

Speech to Text

  • Updated Speech to Text endpoint:

Agents Platform

Updated Agents Platform endpoints with enhanced changes:

Other Updates

Agents Platform

  • Azure OpenAI custom LLM support: Added support for Azure-hosted OpenAI models in custom LLM configurations. When using an Azure endpoint, a new required field for API version is now available in the UI.
  • Genesys output variables: Added support for output variables when using Genesys integrations, enabling better call analytics and data collection.
  • Gemini 2.5 Preview Models Deprecation: Models gemini-2.5-flash-preview-05-20 and gemini-2.5-flash-preview-04-17 have been deprecated in Agents Platform as they are being deprecated on 15th July by Google. All agents using these models will automatically be transferred to gemini-2.5-flash the next time they are used. No action is required.
  • WebRTC rollout: Began progressive rollout of WebRTC capabilities for improved connection stability and performance. WebRTC mode can be selected in the React SDK and is used in 11.ai.
  • Keypad touch tone: Fixed an issue affecting playing keypad touch tones on Twilio. See keypad touch tone documentation.

Voices

  • Language collection navigation: Added quick navigation from language preview collections to view all available voices in that language, making it easier to explore voice options by language.

Text to Voice

  • Preview streaming: Added new streaming endpoint for Text to Voice previews, allowing real-time streaming of generated voice previews via /v1/text-to-voice/{generated_voice_id}/stream.
  • Enhanced voice design: Added stream_previews option to voice design endpoint, enabling streaming-only preview generation for improved performance.
  • Improved parameter controls: Enhanced loudness, quality, and guidance scale parameters with better control options for more precise voice generation.

Studio

  • Podcast customization: Added support for intro and outro text in podcast creation, along with custom instructions prompts for better style and tone control.

SDKs

API

New Endpoints

Updated Endpoints

Text to Voice

  • Create voice previews - Enhanced loudness, quality, and guidance_scale parameter descriptions
  • Design voice - Added stream_previews property for streaming-only preview generation

Studio

  • Create podcast - Added intro, outro, and instructions_prompt properties

Agents Platform

Agents Platform

  • HIPAA Compliance: Gemini 2.5 Flash is now available for HIPAA customers, providing enhanced AI capabilities while maintaining strict healthcare compliance standards.

  • Post-call Audio: Added support for returning call audio in post-call webhooks, enabling comprehensive conversation analysis and quality assurance workflows.

  • Enhanced Widget: Added additional text customization options including start chat button text, chatting status text, and input placeholders for text-only and new conversations.

  • Agent Transfers: Improved agent transfer capabilities with transfer delay configuration, custom transfer messages, and control over transferred agent first message behavior.

  • SIP Trunk Enhancements: Added support for separate inbound and outbound SIP trunk configurations with enhanced access control and transfer options.

Dubbing

  • API Schema Update: Updated our API documentation to explicitly require the target_language parameter for dubbing projects. This parameter has always been required - we’re just making it clearer in our docs. No code changes needed.

  • Duration Validation: Added validation to ensure calculated duration makes sense, preventing zero-credit charges for invalid audio uploads.

Speech to Text

  • Deterministic Sampling: Added seed parameter support for deterministic sampling, enabling reproducible speech-to-text results.

Forced Alignment

  • Confidence Scoring: Added confidence scoring with loss field for words and overall transcript accuracy assessment using forced alignment.

Usage Analytics

  • Workspace Breakdown: Added reporting workspace ID breakdown for character usage statistics, providing detailed usage insights across workspaces.

SDKs

  • React Agents Platform SDK: Released v0.2.0 with support for Indian data residency and WebRTC mode for Agents Platform.
  • Python SDK: Released v2.6.1 with enhanced Agents Platform capabilities and bug fixes.
  • JavaScript SDK: Released v2.5.0 with improved Agents Platform SDK support and new features.

API

Deprecations

  • POST /v1/convai/phone-numbers/create has been deprecated in favor of POST /v1/convai/phone-numbers. Please note that migrating to the new endpoint requires a few adjustments:
    • Replace provider_config field with inbound_trunk and outbound_trunk for SIP trunk configurations
    • Update response parsing to handle the new trunk configuration structure

Schema Removals

  • Removed SIPTrunkConfigResponseModel, SIPTrunkCredentials, TransferToNumberToolConfig
  • Removed incomplete_expired and canceled subscription statuses

New Features

Enhanced SIP Trunk Support

  • SIP trunk configuration now uses separate inbound and outbound trunk configs instead of single configuration
  • Deprecated provider_config field in SIP trunk response from the new endpoint (replaced with inbound_trunk and outbound_trunk)
  • Inbound trunk access control with allowed addresses and phone numbers
  • SIP URI transfer destinations alongside phone number transfers
  • Transfer to number improvements (conference or SIP refer)

Agent Transfers

Conversation Enhancements

Widget Improvements

  • Additional text customization options:
    • Start chat button text
    • Chatting status text
    • Input placeholders for text-only and new conversations

API Improvements

Speech to Text

Forced Alignment

  • Added confidence scoring with loss field for words and overall transcript in Forced alignment

Usage Analytics

Tool Configuration

  • Client tool response timeout increased from 30 to 120 seconds

Workspace Resources

  • Added agent response tests resource type

Deprecations

  • Phone number provider_config field (use inbound_trunk/outbound_trunk instead)
  • phone_number field in transfer configurations (use transfer_destination instead)