Changelog

June 23, 2025

Tools migration

Conversational AI tools migration: The way tools in Conversational AI are handled is being migrated, please see the guide here to understand what’s changing and how to migrate

Text to Speech

Audio tags automatic removal: Audio tags are now automatically removed when switching from V3 to V2 models, ensuring optimal compatibility and performance.

Conversational AI

Tools management UI: Added a new comprehensive tools management interface for creating, configuring, and managing tools across all agents in your workspace.
Streamlined agent creation: Introduced a new agent creation flow with improved user experience and better configuration options.
Agent duplication: Added the ability to duplicate existing agents, allowing you to quickly create variations of successful agent configurations.

SIP Trunking

Inbound media encryption: Added support for configurable inbound media encryption settings for SIP trunk phone numbers, enhancing security options.

Voices

Famous voice category: Added a new “famous” voice category to the voice library, expanding the available voice options for users.

Dubbing

CSV frame rate control: Added csv_fps parameter to control frame rate when parsing CSV files for dubbing projects, providing more precise timing control.

SDKs

ElevenLabs JavaScript SDK v2.4.0: Released with new Conversational AI SDK support for Node.js. View release notes
ElevenLabs Python SDK v2.5.0: Updated with enhanced Conversational AI capabilities. View release notes

API

View API changes

New Endpoints

Conversational AI

Duplicate agent - Create a new agent by duplicating an existing one
Create tool - Add a new tool to the available tools in the workspace
List tools - Retrieve all tools available in the workspace
Get tool - Retrieve a specific tool configuration
Update tool - Update an existing tool configuration
Delete tool - Remove a tool from the workspace
Get tool dependent agents - List all agents that depend on a specific tool

Updated Endpoints

Conversational AI

Agent configuration:
- Added built_in_tools configuration for system tools management
- Deprecated inline tools configuration in favor of tool_ids for better tool management
Tool system:
- Refactored tool configuration structure to use centralized tool management

Dubbing

CSV processing:
- Create dubbing project - Added csv_fps parameter for custom frame rate control

SIP Trunking

Phone number creation:
- Create SIP trunk phone number - Added inbound_media_encryption parameter for security configuration

Voice Library

Voice categories:
- Updated voice response models to include “famous” as a new voice category option
- Enhanced voice search and filtering capabilities

June 17, 2025

Conversational AI

Dynamic variables in simulated conversations: Added support for dynamic variable population in simulated conversations, enabling more flexible and context-aware conversation testing scenarios.
MCP server integration: Introduced comprehensive support for Model Context Protocol (MCP) servers, allowing agents to connect to external tools and services through standardized protocols with configurable approval policies.
Burst pricing for extra concurrency: Added bursting capability for workspace call limits, automatically allowing up to 3x the configured concurrency limit during peak usage for overflow capacity.

Studio

JSON content initialization: Added support for initializing Studio projects with structured JSON content through the from_content_json parameter, enabling programmatic project creation with predefined chapters, blocks, and voice configurations.

Workspaces

Webhook management: Introduced workspace-level webhook management capabilities, allowing administrators to view, configure, and monitor webhook integrations across the entire workspace with detailed usage tracking and failure diagnostics.

API

View API changes

New Endpoints

Conversational AI - MCP Servers

Create MCP server - Create a new MCP server configuration in the workspace
List MCP servers - Retrieve all MCP server configurations available in the workspace
Get MCP server - Retrieve a specific MCP server configuration from the workspace
Update MCP server approval policy - Update the approval policy configuration for an MCP server
Create MCP server tool approval - Add approval for a specific MCP tool when using per-tool approval mode
Delete MCP server tool approval - Remove approval for a specific MCP tool when using per-tool approval mode

Workspace

Get workspace webhooks - Retrieve all webhook configurations for the workspace with optional usage information

Updated Endpoints

Conversational AI

Agent simulation:
- Simulate conversation - Added dynamic_variables parameter for populating conversation context with runtime values
- Simulate conversation stream - Added dynamic_variables parameter for streaming conversation simulations
Agent configuration:
- Agent platform settings - Added bursting_enabled parameter to control burst pricing for call limits
WebSocket events:
- Enhanced ClientEvent enum to include mcp_connection_status for real-time MCP server monitoring
Conversation charging:
- Added is_burst indicator to conversation metadata for tracking burst pricing usage

Studio

Create Studio project - Added from_content_json parameter for JSON-based project setup

User Management

User profile:
- Get user - Deprecated can_use_delayed_payment_methods field in user response model

Subscription Management

Subscription status:
- Removed canceled and unpaid from available subscription status types, streamlining subscription state management

June 8, 2025

Text to Speech

Eleven v3 (alpha): Released Eleven v3 (alpha), our most expressive Text to Speech model, as a research preview.

Conversational AI

Custom voice settings in multi-voice: Added support for configuring individual voice settings per supported voice in multi-voice agents, allowing fine-tuned control over stability, speed, similarity boost, and streaming latency for each voice.
Silent transfer to human in Twilio: Added backend configuration support for silent (cold) transfer to human in the Twilio native integration, enabling seamless handoff without announcing the transfer to callers.
Batch calling retry and cancel: Added support for retrying outbound calls to phone numbers that did not respond during a batch call, along with the ability to cancel ongoing batch operations for better campaign management.
LLM pinning: Added support for versioned LLM models with explicit checkpoint identifiers
Custom LLM headers: Added support for passing custom headers to custom LLMs
Fixed issue in non-latin languages: Fixed an issue causing some conversations in non latin alphabet languages to fail.

SDKs

Python SDK v2.3.0: Released Python SDK v2.3.0
JavaScript SDK v2.2.0: Released JavaScript SDK v2.2.0

API

View API changes

New Endpoints

Conversational AI

Batch Calling:
- Cancel batch call - Cancel a running batch call and set all recipients to cancelled status
- Retry batch call - Retry a batch call by setting completed recipients back to pending status
Knowledge Base RAG:
- Get document RAG indexes - Get information about all RAG indexes of a knowledge base document
- Delete document RAG index - Delete a specific RAG index for a knowledge base document
- RAG index overview - Get total size and information of RAG indexes used by knowledge base documents

Updated Endpoints

Conversational AI

Supported Voices:
- Agent configuration - Added optimize_streaming_latency, stability, speed, and similarity_boost parameters for per-voice TTS customization
Transfer to Human:
- Agent configuration - Added enable_client_message parameter to control whether a message is played to the client during transfer
Knowledge Base:
- Knowledge base documents now use supported_usages instead of prompt_injectable for better usage mode control
- RAG index creation now returns enhanced response model with usage information
Custom LLM:
- Agent configuration - Added request_headers parameter for custom header configuration
Widget Configuration:
- Agent platform settings - Added comprehensive styles configuration for widget appearance customization
LLM:
- Added support for versioned LLM models with explicit version identifiers

June 1, 2025

Conversational AI

Multi-voice support for agents: Enable conversational AI agents to dynamically switch between different voices during conversations for multi-character storytelling, language tutoring, and role-playing scenarios.
Claude Sonnet 4 support: Added Claude Sonnet 4 as a new LLM option for conversational agents, providing enhanced reasoning capabilities and improved performance.
Genesys Cloud integration: Introduced AudioHook Protocol integration for seamless connection with Genesys Cloud contact center platform.
Force delete knowledge base documents: Added force parameter to knowledge base document deletion, allowing removal of documents even when used by agents.
Multimodal widget: Added text input and text-only mode defaults for better user experience with improved widget configuration.

API

View API changes

Updated Endpoints

Speech to Text

Create transcript - Added webhook parameter for asynchronous processing with webhook delivery

Conversational AI

Knowledge Base:
- Delete knowledge base document - Added force query parameter to delete documents regardless of agent dependencies
Widget:
- Widget configuration - Added text input and text-only mode support for multi-modality

May 26, 2025

Forced Aligment

Forced alignment improvements: Fixed a rare failure case in forced alignment processing to improve reliability.

Voices

Live moderated voices filter: Added include_live_moderated query parameter to the shared voices endpoint, allowing you to include or exclude voices that are live moderated.

Conversational AI

Secret dynamic variables: Added support for specifying dynamic variables as secrets with the secret__ prefix. Secret dynamic variables can only be used in webhook tool headers and are never sent to an LLM, enhancing security for sensitive data. Learn more.
Skip turn system tool: Introduced a new system tool called skip_turn. When enabled, the agent will skip its turn if the user explicitly indicates they need a moment to think or perform an action (e.g., “just a sec”, “give me a minute”). This prevents turn timeout from being triggered during intentional user pauses. See the skip turn tool docs for more information.
Text input support: Added text input support in websocket connections via “user_message” event with text field. Also added “user_activity” event support to indicate typing or other UI activity, improving agent turn-taking when there’s interleaved text and audio input.
RAG chunk limit: Added ability to configure the maximum number of chunks collected during RAG retrieval, giving users more control over context window usage and costs.
Enhanced widget configuration: Expanded widget customization options to include text input and text only mode.
LLM usage calculator: Introduced tools to calculate expected LLM token usage and costs for agents, helping with cost estimation and planning.

Audio Native

Accessibility improvements: Enhanced accessibility for the AudioNative player with multiple improvements:
- Added aria-labels for all buttons
- Enabled keyboard navigation for all interactive elements
- Made progress bar handle focusable and keyboard-accessible
- Improved focus indicator visibility for better screen reader compatibility

API

View API changes

New Endpoints

Added 3 new endpoints:
- Get Agent Knowledge Base Size - Returns the number of pages in the agent’s knowledge base.
- Calculate Agent LLM Usage - Calculates expected number of LLM tokens needed for the specified agent.
- Calculate LLM Usage - Returns a list of LLM models and the expected cost for using them based on the provided values.

Updated Endpoints

Voices

Get Shared Voices - Added include_live_moderated query parameter to GET /v1/shared-voices to filter voices by live moderation status.

Conversational AI

Agent Configuration:
- Enhanced system tools with new skip_turn tool configuration
- Improved RAG configuration with max_retrieved_rag_chunks_count parameter
Widget Configuration:
- Added support for text-only mode
Batch Calling:
- Batch call responses now include phone_provider field with default value “twilio”

Text to Speech

Voice Settings:
- Added quality parameter to voice settings for controlling audio generation quality
- Model response schema updated to include can_use_quality field

May 19, 2025

SDKs

SDKs V2: Released new v2 SDKs for both Python and JavaScript

Speech to Text

Speech to text logprobs: The Speech to Text response now includes a logprob field for word prediction confidence.

Billing

Improved API error messages: Enhanced API error messages for subscriptions with failed payments. This provides clearer information if a failed payment has caused a user to reach their quota threshold sooner than expected.

Conversational AI

Batch calls: Released new batch calling functionality, which allows you to automate groups of outbound calls.
Increased evaluation criteria limit: The maximum number of evaluation criteria for agent performance evaluation has been increased from 5 to 10.
Human-readable IDs: Introduced human-readable IDs for key Conversational AI entities (e.g., agents, conversations). This improves usability and makes resources easier to identify and manage through the API and UI.
Unanswered call tracking: ‘Not Answered’ outbound calls are now reliably detected and visible in the conversation history.
LLM cost visibility in dashboard: The Conversational AI dashboard now displays the total and per-minute average LLM costs.
Zero retention mode (ZRM) for agents: Allowed enabling Zero Retention Mode (ZRM) per agent.
Dynamic variables in headers: Added option of setting dynamic variable as a header value for tools
Customisable tool timeouts: Shipped setting different timeout durations per tool.

Workspaces

Simplified secret updates: Workspace secrets can now be updated more granularly using a PATCH request via the API, simplifying the management of individual secret values. For technical details, please see the API changes section below.

API

View API changes

New Endpoints

Added 6 new endpoints:
- Get Signed Url - Get a signed URL to start a conversation with an agent that requires authorization.
- Simulate Conversation - Run a conversation between an agent and a simulated user.
- Simulate Conversation (Stream) - Run and stream a conversation simulation between an agent and a simulated user.
- Update Convai Workspace Secret - Update an existing secret for the Convai workspace.
- Submit Batch Call Request - Submit a batch call request to schedule calls for multiple recipients.
- Get All Batch Calls for Workspace - Retrieve all batch calls for the current workspace.

Updated Endpoints

Conversational AI

Agents & Conversations:
- Endpoint GET /v1/convai/conversation/get_signed_url (snake_case path) has been deprecated. Use the new GET /v1/convai/conversation/get-signed-url (kebab-case path) instead.
Phone Numbers:
- Get Phone Number Details - Response schema for GET /v1/convai/phone-numbers/{phone_number_id} updated to distinct Twilio and SIPTrunk provider details.
- Update Phone Number - Response schema for PATCH /v1/convai/phone-numbers/{phone_number_id} updated similarly for Twilio and SIPTrunk.
- List Phone Numbers - Response schema for GET /v1/convai/phone-numbers/ list items updated for Twilio and SIPTrunk providers.

Text To Speech

Text to Speech Endpoints - Default model_id changed from eleven_monolingual_v1 to eleven_multilingual_v2 for the following endpoints:
- POST /v1/text-to-speech/{voice_id}/stream
- POST /v1/text-to-speech/{voice_id}/stream-with-timestamps
- POST /v1/text-to-speech/{voice_id}
- POST /v1/text-to-speech/{voice_id}/with-timestamps

Voices

Get Shared Voices - Added include_custom_rates query parameter to GET /v1/shared-voices.
Schema Updates:
- LibraryVoiceResponseModel and VoiceSharingResponseModel now include an optional fiat_rate field (USD per 1000 credits).

May 12, 2025

Billing

Downgraded Plan Pricing Fix: Fixed an issue where customers with downgraded subscriptions were shown their current price instead of the correct future price.

Conversational AI

Edit Knowledge Base Document Names: You can now edit the names of knowledge base documents.
See: Knowledge Base
Conversation Simulation: Released a new endpoint that allows you to test an agent over text

Studio

Export Paragraphs as Zip: Added support for exporting separated paragraphs in a zip file.
See: Studio

SDKs

Released new SDKs:
- ElevenLabs Python v1.58.1
- ElevenLabs JS v1.58.0

API

View API changes

New Endpoints

Update metadata for a speaker
PATCH /v1/dubbing/resource/{dubbing_id}/speaker/{speaker_id}
Amend the metadata associated with a speaker, such as their voice. Both voice cloning and using voices from the ElevenLabs library are supported.
Search similar voices for a speaker
GET /v1/dubbing/resource/{dubbing_id}/speaker/{speaker_id}/similar-voices
Fetch the top 10 similar voices to a speaker, including IDs, names, descriptions, and sample audio.
Simulate a conversation
POST /v1/convai/agents/{agent_id}/simulate_conversation
Run a conversation between the agent and a simulated user.
Simulate a conversation (stream)
POST /v1/convai/agents/{agent_id}/simulate_conversation/stream
Stream a simulated conversation between the agent and a simulated user.
Handle outbound call via SIP trunk
POST /v1/convai/sip-trunk/outbound-call
Initiate an outbound call using SIP trunking.

Updated Endpoints

List conversations
GET /v1/convai/conversations
Added call_start_after_unix query parameter to filter conversations by start date.
Update knowledge base document
PATCH /v1/convai/knowledge-base/{documentation_id}
Now supports updating the name of a document.
Text to Speech endpoints
The default model for all TTS endpoints is now eleven_multilingual_v2 (was eleven_monolingual_v1).

Removed Endpoints

None.

May 5, 2025

Dubbing

Disable Voice Cloning: Added an option in the Dubbing Studio UI to disable voice cloning when uploading audio, aligning with the existing disable_voice_cloning API parameter.

Billing

Quota Exceeded Error: Improved error messaging for exceeding character limits. Users attempting to generate audio beyond their quota within a short billing window will now receive a clearer 401 unauthorized: This request exceeds your quota limit of... error message indicating the limit has been exceeded.

SDKs

Released new SDKs: Added ElevenLabs Python v1.58.0 and ElevenLabs JS v1.58.0 to fix a breaking change that had been mistakenly shipped

April 28, 2025

Conversational AI

Custom Dashboard Charts: The Conversational AI Dashboard can now be extended with custom charts displaying the results of evaluation criteria over time. See the new GET and PATCH endpoints for managing dashboard settings.
Call History Filtering: Added the ability to filter the call history by start date using the new call_start_before_unix parameter in the List Conversations endpoint. Try it here.
Server Tools: Added option of making PUT requests in server tools
Transfer to human: Added call forwarding functionality to support forwarding to operators, see docs here
Language detection: Fixed an issue where the language detection system tool would trigger on a user replying yes in non-English language.

Usage Analytics

Custom Aggregation: Added an optional aggregation_interval parameter to the Get Usage Metrics endpoint to control the interval over which to aggregate character usage (hour, day, week, month, or cumulative).
New Metric Breakdowns: The Usage Analytics section now supports additional metric breakdowns including minutes_used, request_count, ttfb_avg, and ttfb_p95, selectable via the new metric parameter in the Get Usage Metrics endpoint. Furthermore, you can now get a breakdown and filter by request_queue.

API

View API changes

New Endpoints

Added 2 new endpoints for managing Conversational AI dashboard settings:
- Get Dashboard Settings - Retrieves custom chart configurations for the ConvAI dashboard.
- Update Dashboard Settings - Updates custom chart configurations for the ConvAI dashboard.

Updated Endpoints

Audio Generation (TTS, S2S, SFX, Voice Design)

Updated endpoints to support new output_format option pcm_48000:
- Text to Speech (POST /v1/text-to-speech/{voice_id})
- Text to Speech with Timestamps (POST /v1/text-to-speech/{voice_id}/with-timestamps)
- Text to Speech Stream (POST /v1/text-to-speech/{voice_id}/stream)
- Text to Speech Stream with Timestamps (POST /v1/text-to-speech/{voice_id}/stream/with-timestamps)
- Speech to Speech (POST /v1/speech-to-speech/{voice_id})
- Speech to Speech Stream (POST /v1/speech-to-speech/{voice_id}/stream)
- Sound Generation (POST /v1/sound-generation)
- Create Voice Previews (POST /v1/text-to-voice/create-previews)

Usage Analytics

Updated usage metrics endpoint:
- Get Usage Metrics (GET /v1/usage/character-stats) - Added optional aggregation_interval and metric query parameters.

Conversational AI

Updated conversation listing endpoint:
- List Conversations (GET /v1/convai/conversations) - Added optional call_start_before_unix query parameter for filtering by start date.

Schema Changes

Conversational AI

Added detailed LLM usage and pricing information to conversation charging and history models.
Added tool_latency_secs to tool result schemas
Added access_info to GET /v1/convai/agents/{agent_id}

April 21, 2025

Professional Voice Cloning (PVC)

PVC API: Introduced a comprehensive suite of API endpoints for managing Professional Voice Clones (PVC). You can now programmatically create voices, add/manage/delete audio samples, retrieve audio/waveforms, manage speaker separation, handle verification, and initiate training. For a full list of new endpoints check the API changes summary below or read the PVC API reference here.

Speech to Text

Enhanced Export Options: Added options to include or exclude timestamps and speaker IDs when exporting Speech to Text results in segmented JSON format via the API.

Conversational AI

New LLM Models: Added support for new GPT-4.1 models: gpt-4.1, gpt-4.1-mini, and gpt-4.1-nano here
VAD Score: Added a new client event which sends VAD scores to the client, see reference here

Workspace

Member Management: Added a new API endpoint to allow administrators to delete workspace members here

API

View API changes

New Endpoints

Added 16 new endpoints:
- Delete Member - Allows deleting workspace members.
- Create PVC Voice - Creates a new PVC voice.
- Edit PVC Voice - Edits PVC voice metadata.
- Add Samples To PVC Voice - Adds audio samples to a PVC voice.
- Update PVC Voice Sample - Updates a PVC voice sample (noise removal, speaker selection, trimming).
- Delete PVC Voice Sample - Deletes a sample from a PVC voice.
- Retrieve Voice Sample Audio - Retrieves audio for a PVC voice sample.
- Retrieve Voice Sample Visual Waveform - Retrieves the visual waveform for a PVC voice sample.
- Retrieve Speaker Separation Status - Gets the status of speaker separation for a sample.
- Start Speaker Separation - Initiates speaker separation for a sample.
- Retrieve Separated Speaker Audio - Retrieves audio for a specific separated speaker.
- Get PVC Voice Captcha - Gets the captcha for PVC voice verification.
- Verify PVC Voice Captcha - Submits captcha verification for a PVC voice.
- Run PVC Training - Starts the training process for a PVC voice.
- Request Manual Verification - Requests manual verification for a PVC voice.

Updated Endpoints

Speech to Text

Updated endpoint with changes:
- Create Forced Alignment Task - Added enabled_spooled_file parameter to allow streaming large files (POST /v1/forced-alignment).

Schema Changes

Conversational AI

GET conversation details: Added has_audio, has_user_audio, has_response_audio boolean fields here

Dubbing

GET dubbing resource : Added status field to each render here