Tools migration

Text to Speech

  • Audio tags automatic removal: Audio tags are now automatically removed when switching from V3 to V2 models, ensuring optimal compatibility and performance.

Conversational AI

  • Tools management UI: Added a new comprehensive tools management interface for creating, configuring, and managing tools across all agents in your workspace.
  • Streamlined agent creation: Introduced a new agent creation flow with improved user experience and better configuration options.
  • Agent duplication: Added the ability to duplicate existing agents, allowing you to quickly create variations of successful agent configurations.

SIP Trunking

Voices

  • Famous voice category: Added a new “famous” voice category to the voice library, expanding the available voice options for users.

Dubbing

  • CSV frame rate control: Added csv_fps parameter to control frame rate when parsing CSV files for dubbing projects, providing more precise timing control.

SDKs

  • ElevenLabs JavaScript SDK v2.4.0: Released with new Conversational AI SDK support for Node.js. View release notes
  • ElevenLabs Python SDK v2.5.0: Updated with enhanced Conversational AI capabilities. View release notes

API

New Endpoints

Conversational AI

Updated Endpoints

Conversational AI

  • Agent configuration:

    • Added built_in_tools configuration for system tools management
    • Deprecated inline tools configuration in favor of tool_ids for better tool management
  • Tool system:

    • Refactored tool configuration structure to use centralized tool management

Dubbing

SIP Trunking

Voice Library

  • Voice categories:
    • Updated voice response models to include “famous” as a new voice category option
    • Enhanced voice search and filtering capabilities

Conversational AI

  • Dynamic variables in simulated conversations: Added support for dynamic variable population in simulated conversations, enabling more flexible and context-aware conversation testing scenarios.
  • MCP server integration: Introduced comprehensive support for Model Context Protocol (MCP) servers, allowing agents to connect to external tools and services through standardized protocols with configurable approval policies.
  • Burst pricing for extra concurrency: Added bursting capability for workspace call limits, automatically allowing up to 3x the configured concurrency limit during peak usage for overflow capacity.

Studio

  • JSON content initialization: Added support for initializing Studio projects with structured JSON content through the from_content_json parameter, enabling programmatic project creation with predefined chapters, blocks, and voice configurations.

Workspaces

  • Webhook management: Introduced workspace-level webhook management capabilities, allowing administrators to view, configure, and monitor webhook integrations across the entire workspace with detailed usage tracking and failure diagnostics.

API

New Endpoints

Conversational AI - MCP Servers

Workspace

  • Get workspace webhooks - Retrieve all webhook configurations for the workspace with optional usage information

Updated Endpoints

Conversational AI

  • Agent simulation:

  • Agent configuration:

  • WebSocket events:

    • Enhanced ClientEvent enum to include mcp_connection_status for real-time MCP server monitoring
  • Conversation charging:

    • Added is_burst indicator to conversation metadata for tracking burst pricing usage

Studio

User Management

  • User profile:
    • Get user - Deprecated can_use_delayed_payment_methods field in user response model

Subscription Management

  • Subscription status:
    • Removed canceled and unpaid from available subscription status types, streamlining subscription state management

Text to Speech

  • Eleven v3 (alpha): Released Eleven v3 (alpha), our most expressive Text to Speech model, as a research preview.

Conversational AI

  • Custom voice settings in multi-voice: Added support for configuring individual voice settings per supported voice in multi-voice agents, allowing fine-tuned control over stability, speed, similarity boost, and streaming latency for each voice.
  • Silent transfer to human in Twilio: Added backend configuration support for silent (cold) transfer to human in the Twilio native integration, enabling seamless handoff without announcing the transfer to callers.
  • Batch calling retry and cancel: Added support for retrying outbound calls to phone numbers that did not respond during a batch call, along with the ability to cancel ongoing batch operations for better campaign management.
  • LLM pinning: Added support for versioned LLM models with explicit checkpoint identifiers
  • Custom LLM headers: Added support for passing custom headers to custom LLMs
  • Fixed issue in non-latin languages: Fixed an issue causing some conversations in non latin alphabet languages to fail.

SDKs

API

New Endpoints

Conversational AI

  • Batch Calling:

    • Cancel batch call - Cancel a running batch call and set all recipients to cancelled status
    • Retry batch call - Retry a batch call by setting completed recipients back to pending status
  • Knowledge Base RAG:

Updated Endpoints

Conversational AI

  • Supported Voices:

    • Agent configuration - Added optimize_streaming_latency, stability, speed, and similarity_boost parameters for per-voice TTS customization
  • Transfer to Human:

    • Agent configuration - Added enable_client_message parameter to control whether a message is played to the client during transfer
  • Knowledge Base:

    • Knowledge base documents now use supported_usages instead of prompt_injectable for better usage mode control
    • RAG index creation now returns enhanced response model with usage information
  • Custom LLM:

  • Widget Configuration:

  • LLM:

Conversational AI

API

Updated Endpoints

Speech to Text

  • Create transcript - Added webhook parameter for asynchronous processing with webhook delivery

Conversational AI

Forced Aligment

  • Forced alignment improvements: Fixed a rare failure case in forced alignment processing to improve reliability.

Voices

  • Live moderated voices filter: Added include_live_moderated query parameter to the shared voices endpoint, allowing you to include or exclude voices that are live moderated.

Conversational AI

  • Secret dynamic variables: Added support for specifying dynamic variables as secrets with the secret__ prefix. Secret dynamic variables can only be used in webhook tool headers and are never sent to an LLM, enhancing security for sensitive data. Learn more.
  • Skip turn system tool: Introduced a new system tool called skip_turn. When enabled, the agent will skip its turn if the user explicitly indicates they need a moment to think or perform an action (e.g., “just a sec”, “give me a minute”). This prevents turn timeout from being triggered during intentional user pauses. See the skip turn tool docs for more information.
  • Text input support: Added text input support in websocket connections via “user_message” event with text field. Also added “user_activity” event support to indicate typing or other UI activity, improving agent turn-taking when there’s interleaved text and audio input.
  • RAG chunk limit: Added ability to configure the maximum number of chunks collected during RAG retrieval, giving users more control over context window usage and costs.
  • Enhanced widget configuration: Expanded widget customization options to include text input and text only mode.
  • LLM usage calculator: Introduced tools to calculate expected LLM token usage and costs for agents, helping with cost estimation and planning.

Audio Native

  • Accessibility improvements: Enhanced accessibility for the AudioNative player with multiple improvements:
    • Added aria-labels for all buttons
    • Enabled keyboard navigation for all interactive elements
    • Made progress bar handle focusable and keyboard-accessible
    • Improved focus indicator visibility for better screen reader compatibility

API

New Endpoints

Updated Endpoints

Voices

  • Get Shared Voices - Added include_live_moderated query parameter to GET /v1/shared-voices to filter voices by live moderation status.

Conversational AI

  • Agent Configuration:

    • Enhanced system tools with new skip_turn tool configuration
    • Improved RAG configuration with max_retrieved_rag_chunks_count parameter
  • Widget Configuration:

    • Added support for text-only mode
  • Batch Calling:

    • Batch call responses now include phone_provider field with default value “twilio”

Text to Speech

  • Voice Settings:
    • Added quality parameter to voice settings for controlling audio generation quality
    • Model response schema updated to include can_use_quality field

SDKs

Speech to Text

  • Speech to text logprobs: The Speech to Text response now includes a logprob field for word prediction confidence.

Billing

  • Improved API error messages: Enhanced API error messages for subscriptions with failed payments. This provides clearer information if a failed payment has caused a user to reach their quota threshold sooner than expected.

Conversational AI

  • Batch calls: Released new batch calling functionality, which allows you to automate groups of outbound calls.
  • Increased evaluation criteria limit: The maximum number of evaluation criteria for agent performance evaluation has been increased from 5 to 10.
  • Human-readable IDs: Introduced human-readable IDs for key Conversational AI entities (e.g., agents, conversations). This improves usability and makes resources easier to identify and manage through the API and UI.
  • Unanswered call tracking: ‘Not Answered’ outbound calls are now reliably detected and visible in the conversation history.
  • LLM cost visibility in dashboard: The Conversational AI dashboard now displays the total and per-minute average LLM costs.
  • Zero retention mode (ZRM) for agents: Allowed enabling Zero Retention Mode (ZRM) per agent.
  • Dynamic variables in headers: Added option of setting dynamic variable as a header value for tools
  • Customisable tool timeouts: Shipped setting different timeout durations per tool.

Workspaces

  • Simplified secret updates: Workspace secrets can now be updated more granularly using a PATCH request via the API, simplifying the management of individual secret values. For technical details, please see the API changes section below.

API

New Endpoints

Updated Endpoints

Conversational AI

  • Agents & Conversations:
    • Endpoint GET /v1/convai/conversation/get_signed_url (snake_case path) has been deprecated. Use the new GET /v1/convai/conversation/get-signed-url (kebab-case path) instead.
  • Phone Numbers:
    • Get Phone Number Details - Response schema for GET /v1/convai/phone-numbers/{phone_number_id} updated to distinct Twilio and SIPTrunk provider details.
    • Update Phone Number - Response schema for PATCH /v1/convai/phone-numbers/{phone_number_id} updated similarly for Twilio and SIPTrunk.
    • List Phone Numbers - Response schema for GET /v1/convai/phone-numbers/ list items updated for Twilio and SIPTrunk providers.

Text To Speech

  • Text to Speech Endpoints - Default model_id changed from eleven_monolingual_v1 to eleven_multilingual_v2 for the following endpoints:
    • POST /v1/text-to-speech/{voice_id}/stream
    • POST /v1/text-to-speech/{voice_id}/stream-with-timestamps
    • POST /v1/text-to-speech/{voice_id}
    • POST /v1/text-to-speech/{voice_id}/with-timestamps

Voices

  • Get Shared Voices - Added include_custom_rates query parameter to GET /v1/shared-voices.
  • Schema Updates:
    • LibraryVoiceResponseModel and VoiceSharingResponseModel now include an optional fiat_rate field (USD per 1000 credits).

Billing

  • Downgraded Plan Pricing Fix: Fixed an issue where customers with downgraded subscriptions were shown their current price instead of the correct future price.

Conversational AI

  • Edit Knowledge Base Document Names: You can now edit the names of knowledge base documents.
    See: Knowledge Base
  • Conversation Simulation: Released a new endpoint that allows you to test an agent over text

Studio

  • Export Paragraphs as Zip: Added support for exporting separated paragraphs in a zip file.
    See: Studio

SDKs

API

New Endpoints

  • Update metadata for a speaker
    PATCH /v1/dubbing/resource/{dubbing_id}/speaker/{speaker_id}
    Amend the metadata associated with a speaker, such as their voice. Both voice cloning and using voices from the ElevenLabs library are supported.

  • Search similar voices for a speaker
    GET /v1/dubbing/resource/{dubbing_id}/speaker/{speaker_id}/similar-voices
    Fetch the top 10 similar voices to a speaker, including IDs, names, descriptions, and sample audio.

  • Simulate a conversation
    POST /v1/convai/agents/{agent_id}/simulate_conversation
    Run a conversation between the agent and a simulated user.

  • Simulate a conversation (stream)
    POST /v1/convai/agents/{agent_id}/simulate_conversation/stream
    Stream a simulated conversation between the agent and a simulated user.

  • Handle outbound call via SIP trunk
    POST /v1/convai/sip-trunk/outbound-call
    Initiate an outbound call using SIP trunking.

Updated Endpoints

  • List conversations
    GET /v1/convai/conversations
    Added call_start_after_unix query parameter to filter conversations by start date.

  • Update knowledge base document
    PATCH /v1/convai/knowledge-base/{documentation_id}
    Now supports updating the name of a document.

  • Text to Speech endpoints
    The default model for all TTS endpoints is now eleven_multilingual_v2 (was eleven_monolingual_v1).

Removed Endpoints

  • None.

Dubbing

  • Disable Voice Cloning: Added an option in the Dubbing Studio UI to disable voice cloning when uploading audio, aligning with the existing disable_voice_cloning API parameter.

Billing

  • Quota Exceeded Error: Improved error messaging for exceeding character limits. Users attempting to generate audio beyond their quota within a short billing window will now receive a clearer 401 unauthorized: This request exceeds your quota limit of... error message indicating the limit has been exceeded.

SDKs

Conversational AI

  • Custom Dashboard Charts: The Conversational AI Dashboard can now be extended with custom charts displaying the results of evaluation criteria over time. See the new GET and PATCH endpoints for managing dashboard settings.
  • Call History Filtering: Added the ability to filter the call history by start date using the new call_start_before_unix parameter in the List Conversations endpoint. Try it here.
  • Server Tools: Added option of making PUT requests in server tools
  • Transfer to human: Added call forwarding functionality to support forwarding to operators, see docs here
  • Language detection: Fixed an issue where the language detection system tool would trigger on a user replying yes in non-English language.

Usage Analytics

  • Custom Aggregation: Added an optional aggregation_interval parameter to the Get Usage Metrics endpoint to control the interval over which to aggregate character usage (hour, day, week, month, or cumulative).
  • New Metric Breakdowns: The Usage Analytics section now supports additional metric breakdowns including minutes_used, request_count, ttfb_avg, and ttfb_p95, selectable via the new metric parameter in the Get Usage Metrics endpoint. Furthermore, you can now get a breakdown and filter by request_queue.

API

New Endpoints

  • Added 2 new endpoints for managing Conversational AI dashboard settings:

Updated Endpoints

Audio Generation (TTS, S2S, SFX, Voice Design)

Usage Analytics

  • Updated usage metrics endpoint:
    • Get Usage Metrics (GET /v1/usage/character-stats) - Added optional aggregation_interval and metric query parameters.

Conversational AI

  • Updated conversation listing endpoint:
    • List Conversations (GET /v1/convai/conversations) - Added optional call_start_before_unix query parameter for filtering by start date.

Schema Changes

Conversational AI

Professional Voice Cloning (PVC)

  • PVC API: Introduced a comprehensive suite of API endpoints for managing Professional Voice Clones (PVC). You can now programmatically create voices, add/manage/delete audio samples, retrieve audio/waveforms, manage speaker separation, handle verification, and initiate training. For a full list of new endpoints check the API changes summary below or read the PVC API reference here.

Speech to Text

  • Enhanced Export Options: Added options to include or exclude timestamps and speaker IDs when exporting Speech to Text results in segmented JSON format via the API.

Conversational AI

  • New LLM Models: Added support for new GPT-4.1 models: gpt-4.1, gpt-4.1-mini, and gpt-4.1-nano here
  • VAD Score: Added a new client event which sends VAD scores to the client, see reference here

Workspace

  • Member Management: Added a new API endpoint to allow administrators to delete workspace members here

API

New Endpoints

Updated Endpoints

Speech to Text

  • Updated endpoint with changes:

Schema Changes

Conversational AI

  • GET conversation details: Added has_audio, has_user_audio, has_response_audio boolean fields here

Dubbing

  • GET dubbing resource : Added status field to each render here