Conversational AI

  • Agent testing framework: Introduced a comprehensive testing framework for conversational AI agents, allowing developers to create, manage, and execute automated tests for their agents. This includes test creation, execution tracking, and result analysis capabilities.
  • Test invocation management: Added support for resubmitting failed test invocations and viewing detailed test results to help developers debug and improve their agents.
  • Enhanced agent configuration: Improved agent creation and management with additional workspace override capabilities and refined platform settings.

Text to Speech

  • Pronunciation dictionary updates: Added support for updating pronunciation dictionaries with PATCH operations, enabling more flexible dictionary management.
  • Enhanced timestamp support: Improved timestamp generation for text-to-speech conversions with better alignment data and streaming capabilities.

SDK Releases

  • TypeScript SDK v2.12.2: Updated with the latest API schema changes, including full support for the new agent testing endpoints and enhanced conversational AI capabilities.
  • Python SDK v2.12.1: Released with complete support for all new API features, including agent testing framework and improved workspace resource management.

API

New Endpoints

Added 10 new endpoints this week:

Conversational AI Agent Testing

Pronunciation Dictionaries

  • PATCH /v1/pronunciation-dictionaries/{pronunciation_dictionary_id} - Update Pronunciation Dictionary - Update existing pronunciation dictionaries with new rules or modifications

Eleven v3 API

Eleven v3 is now available via the API.

To start using it, simply specify the model ID eleven_v3 when making Text to Speech requests.

Additionally the Text to Dialogue API endpoint is now available to all.

Music Generation API

The Eleven Music API is now freely available to all paid users.

Visit the quickstart to lean how to integrate. The API section below highlights the new endpoints that have been released.

Global TTS API preview

ElevenLabs is launching inference servers in additional geographical regions to reduce latency for clients outside of the US. Initial request processing will be available in the Netherlands and in Singapore in addition to the US.

To learn how to get started head to the docs.

API

New Endpoints

Updated Endpoints

Text to Speech

Voice Management

Speech to Text

Usage and Analytics

  • Updated Usage endpoints:
    • Get character stats - Added aggregation bucket size parameter and improved breakdown type options

Workspace Management

Music

Eleven Music: Officially released new music generation model that creates studio-grade music with natural language prompts in any style. See the capabilities page and prompting guide for more information.

SDKs

v2.9.0 of the TypesScript SDK released

  • Includes better typing support for Speech to Text requests in webhook mode
  • Includes new enums for ChatGPT 5

v2.9.2 of the Python SDK released

  • Includes new enums for ChatGPT 5

Conversational AI

Agent response correction: Updated WebSocket event schema and handling for improved agent response correction functionality.

API

User Account Changes

  • Updated user account endpoint:
    • Get user subscription info - Deprecated convai_chars_per_minute and convai_asr_chars_per_minute fields in the response schema. These fields will now always return None.

Parameter Removals

  • Updated conversation token endpoint:
    • Get conversation token - Removed source and version query parameters. These were internal parameters not meant for public use and their removal does not affect functionality.

Conversational AI

  • Conversation token generation: Added new route to generate Conversation Tokens for WebRTC connections. Learn more
  • Expandable widget options: Our embeddable widget can now be customized to start in the expanded state and disable collapsing altogether.
  • Simplified operation IDs: We simplified the OpenAPI operator IDs for conversational AI endpoints to improve developer experience.

Workspaces

  • Simplified operation IDs: We simplified the operation IDs for our workspace endpoints to improve API usability.

SDK Releases

  • Python SDK v2.8.2: Released latest version with improvements and bug fixes. View release

NPM Packages

  • @elevenlabs/react-native@0.1.2: Enhanced React Native support
  • @elevenlabs/client@0.4.4: Client library improvements
  • @elevenlabs/react@0.4.5: React component updates

API

New Endpoints

Conversational AI

Updated Endpoints

Voice Management

  • List voices - Added voice_ids query parameter for filtering specific voices

Conversational AI Core

Operation ID Improvements

  • Conversational AI endpoints: Simplified operation IDs for better developer experience while maintaining full backward compatibility
  • Workspace endpoints: Streamlined operation IDs across all workspace-related endpoints to improve API usability

Workspaces

  • Service account API key management: Added comprehensive API endpoints for managing service account API keys, including creation, retrieval, updating, and deletion capabilities. See Service Accounts documentation.

Conversational AI

  • Post-call webhook migration: The post call webhook format is being migrated so that webhook handlers can be auto generated in the SDKs. This is not a breaking change, and no further action is required if your current handler accepts additional fields. Please see more information here.
  • Agent transfer improvements: Fixed system variable system_agent_id to properly update after agent-to-agent transfers, ensuring accurate conversation context tracking. Added new system_current_agent_id variable for tracking current active agent. Learn more about dynamic variables.
  • Enhanced public agent page: Added text input functionality and dynamic variable support to the public talk-to-agent page. You can now pass dynamic variables via URL parameters (e.g., ?var_username=value) and use text input during voice conversations. See dynamic variables guide.
  • Voicemail detection: Added voicemail detection as a built-in tool for conversational AI agents to improve call handling. Learn about voicemail detection.
  • Conversation filtering: Added user_id query parameter to conversation list endpoint for filtering conversations by initiating user.

Speech to Text

  • Multi-channel transcription: Added use_multi_channel parameter to transcription endpoint for processing audio files with multiple speakers on separate channels. Supports up to 5 channels with per-channel transcription results. See multichannel guide.

Studio

  • Caption support: Added caption functionality to Studio projects with new captions_enabled and caption_style properties for both podcasts and general projects. Learn more about Studio.

SDKs

API Schema Updates

New Endpoints

  • Service Account Management: Added 5 new endpoints for service account API key management:
    • GET /v1/service-accounts/{service_account_user_id}/api-keys - Retrieve service account API keys
    • POST /v1/service-accounts/{service_account_user_id}/api-keys - Create service account API key
    • DELETE /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id} - Delete service account API key
    • PATCH /v1/service-accounts/{service_account_user_id}/api-keys/{api_key_id} - Update service account API key
    • GET /v1/service-accounts - Get workspace service accounts

Removed Endpoints

  • Legacy Project Endpoints: Removed 22 deprecated project management endpoints as part of Studio API consolidation:
    • All /v1/projects/* endpoints (replaced by /v1/studio/projects/*)
    • Legacy Text to Voice endpoints (/v1/text-to-voice/create-voice-from-preview, /v1/text-to-voice/remixing-sessions/*)
    • Legacy ConvAI knowledge base endpoints

Updated Endpoints

Speech to Text

  • Multi-channel support: Updated /v1/speech-to-text endpoint:
    • Added use_multi_channel parameter for processing multi-speaker audio files
    • Modified response structure to include optional language_code, language_probability, text, and words properties

Conversational AI

  • Enhanced agent configuration: Updated agent creation and management endpoints:
    • Added voicemail detection to built-in tools
    • Improved RAG configuration with max_retrieved_rag_chunks_count parameter
    • Enhanced conversation token endpoint with source and version parameters
    • Added user_id filtering to conversations list endpoint

Studio Projects

  • Caption support: Updated Studio project endpoints to include:
    • captions_enabled property for enabling/disabling captions
    • caption_style property for global caption styling configuration

Text to Voice

  • Improved voice generation: Enhanced voice creation endpoints with:
    • loudness control (-1 to 1 range, 0 corresponds to -24 LUFS)
    • quality parameter for balancing output quality vs variety
    • guidance_scale parameter for controlling AI creativity vs prompt adherence

Conversational AI

  • Agent workspace overrides: Enhanced agent configuration with workspace-level overrides for better enterprise management and customization.
  • Agent API improvements: Updated agent creation and modification endpoints with enhanced configuration options, though these changes may break backward compatibility.

Dubbing

  • Dubbing endpoint access: Added new endpoint to list all available dubs.

API

New Endpoints

Updated Endpoints

Text to Speech

Voice Management

  • Updated Voice endpoints with backward compatible improvements:

Voice Creation

Dubbing

Workspace Management

Speech to Text

  • Updated Speech to Text endpoint:

Conversational AI

Updated Conversational AI endpoints with enhanced changes:

Other Updates

Conversational AI

  • Azure OpenAI custom LLM support: Added support for Azure-hosted OpenAI models in custom LLM configurations. When using an Azure endpoint, a new required field for API version is now available in the UI.
  • Genesys output variables: Added support for output variables when using Genesys integrations, enabling better call analytics and data collection.
  • Gemini 2.5 Preview Models Deprecation: Models gemini-2.5-flash-preview-05-20 and gemini-2.5-flash-preview-04-17 have been deprecated in Conversational AI as they are being deprecated on 15th July by Google. All agents using these models will automatically be transferred to gemini-2.5-flash the next time they are used. No action is required.
  • WebRTC rollout: Began progressive rollout of WebRTC capabilities for improved connection stability and performance. WebRTC mode can be selected in the React SDK and is used in 11.ai.
  • Keypad touch tone: Fixed an issue affecting playing keypad touch tones on Twilio. See keypad touch tone documentation.

Voices

  • Language collection navigation: Added quick navigation from language preview collections to view all available voices in that language, making it easier to explore voice options by language.

Text to Voice

  • Preview streaming: Added new streaming endpoint for Text to Voice previews, allowing real-time streaming of generated voice previews via /v1/text-to-voice/{generated_voice_id}/stream.
  • Enhanced voice design: Added stream_previews option to voice design endpoint, enabling streaming-only preview generation for improved performance.
  • Improved parameter controls: Enhanced loudness, quality, and guidance scale parameters with better control options for more precise voice generation.

Studio

  • Podcast customization: Added support for intro and outro text in podcast creation, along with custom instructions prompts for better style and tone control.

SDKs

API

New Endpoints

Updated Endpoints

Text to Voice

  • Create voice previews - Enhanced loudness, quality, and guidance_scale parameter descriptions
  • Design voice - Added stream_previews property for streaming-only preview generation

Studio

  • Create podcast - Added intro, outro, and instructions_prompt properties

Conversational AI

Conversational AI

  • HIPAA Compliance: Gemini 2.5 Flash is now available for HIPAA customers, providing enhanced AI capabilities while maintaining strict healthcare compliance standards.

  • Post-call Audio: Added support for returning call audio in post-call webhooks, enabling comprehensive conversation analysis and quality assurance workflows.

  • Enhanced Widget: Added additional text customization options including start chat button text, chatting status text, and input placeholders for text-only and new conversations.

  • Agent Transfers: Improved agent transfer capabilities with transfer delay configuration, custom transfer messages, and control over transferred agent first message behavior.

  • SIP Trunk Enhancements: Added support for separate inbound and outbound SIP trunk configurations with enhanced access control and transfer options.

Dubbing

  • API Schema Update: Updated our API documentation to explicitly require the target_language parameter for dubbing projects. This parameter has always been required - we’re just making it clearer in our docs. No code changes needed.

  • Duration Validation: Added validation to ensure calculated duration makes sense, preventing zero-credit charges for invalid audio uploads.

Speech to Text

  • Deterministic Sampling: Added seed parameter support for deterministic sampling, enabling reproducible speech-to-text results.

Forced Alignment

  • Confidence Scoring: Added confidence scoring with loss field for words and overall transcript accuracy assessment using forced alignment.

Usage Analytics

  • Workspace Breakdown: Added reporting workspace ID breakdown for character usage statistics, providing detailed usage insights across workspaces.

SDKs

  • React Conversational AI SDK: Released v0.2.0 with support for Indian data residency and WebRTC mode for Conversational AI.
  • Python SDK: Released v2.6.1 with enhanced Conversational AI capabilities and bug fixes.
  • JavaScript SDK: Released v2.5.0 with improved Conversational AI SDK support and new features.

API

Deprecations

  • POST /v1/convai/phone-numbers/create has been deprecated in favor of POST /v1/convai/phone-numbers. Please note that migrating to the new endpoint requires a few adjustments:
    • Replace provider_config field with inbound_trunk and outbound_trunk for SIP trunk configurations
    • Update response parsing to handle the new trunk configuration structure

Schema Removals

  • Removed SIPTrunkConfigResponseModel, SIPTrunkCredentials, TransferToNumberToolConfig
  • Removed incomplete_expired and canceled subscription statuses

New Features

Enhanced SIP Trunk Support

  • SIP trunk configuration now uses separate inbound and outbound trunk configs instead of single configuration
  • Deprecated provider_config field in SIP trunk response from the new endpoint (replaced with inbound_trunk and outbound_trunk)
  • Inbound trunk access control with allowed addresses and phone numbers
  • SIP URI transfer destinations alongside phone number transfers
  • Transfer to number improvements (conference or SIP refer)

Agent Transfers

Conversation Enhancements

Widget Improvements

  • Additional text customization options:
    • Start chat button text
    • Chatting status text
    • Input placeholders for text-only and new conversations

API Improvements

Speech to Text

Forced Alignment

  • Added confidence scoring with loss field for words and overall transcript in Forced alignment

Usage Analytics

Tool Configuration

  • Client tool response timeout increased from 30 to 120 seconds

Workspace Resources

  • Added agent response tests resource type

Deprecations

  • Phone number provider_config field (use inbound_trunk/outbound_trunk instead)
  • phone_number field in transfer configurations (use transfer_destination instead)

Text to Voice

  • Voice Design: Launched new Text to Voice Design with Eleven v3 for creating custom voices from text descriptions.

Speech to Text

  • Enhanced Diarization: Added diarization_threshold parameter to the Speech to Text endpoint. Fine-tune the balance between speaker accuracy and total speaker count by adjusting the threshold between 0.1 and 0.4.

Professional Voice Cloning

  • Background Noise Removal: Added remove_background_noise to clean up voice samples using audio isolation models for better quality training data.

Studio

Workspaces

  • Service Account Groups: Service accounts can now be added to workspace groups for better permission management and access control.

  • Workspace Authentication: Added support for workspace authentication connections, enabling secure webhook tool integrations with external services.

SDKs

  • Python SDK: Released v2.6.0 with latest API support and bug fixes.
  • JavaScript SDK: Released v2.5.0 with latest API support and bug fixes.
  • React Conversational AI SDK: Added WebRTC support in 0.2.0

API

New Endpoints

Updated Endpoints

Speech to Text

Voice Management

  • Get voice sample audio - Added remove_background_noise query parameter and moved from request body to query parameters

Tools migration

Text to Speech

  • Audio tags automatic removal: Audio tags are now automatically removed when switching from V3 to V2 models, ensuring optimal compatibility and performance.

Conversational AI

  • Tools management UI: Added a new comprehensive tools management interface for creating, configuring, and managing tools across all agents in your workspace.
  • Streamlined agent creation: Introduced a new agent creation flow with improved user experience and better configuration options.
  • Agent duplication: Added the ability to duplicate existing agents, allowing you to quickly create variations of successful agent configurations.

SIP Trunking

Voices

  • Famous voice category: Added a new “famous” voice category to the voice library, expanding the available voice options for users.

Dubbing

  • CSV frame rate control: Added csv_fps parameter to control frame rate when parsing CSV files for dubbing projects, providing more precise timing control.

SDKs

  • ElevenLabs JavaScript SDK v2.4.0: Released with new Conversational AI SDK support for Node.js. View release notes
  • ElevenLabs Python SDK v2.5.0: Updated with enhanced Conversational AI capabilities. View release notes

API

New Endpoints

Conversational AI

Updated Endpoints

Conversational AI

  • Agent configuration:

    • Added built_in_tools configuration for system tools management
    • Deprecated inline tools configuration in favor of tool_ids for better tool management
  • Tool system:

    • Refactored tool configuration structure to use centralized tool management

Dubbing

SIP Trunking

Voice Library

  • Voice categories:
    • Updated voice response models to include “famous” as a new voice category option
    • Enhanced voice search and filtering capabilities