Text to speech

  • Opus format support: Added support for Opus format with 48kHz sample rate across multiple bitrates (32-192 kbps).
  • Improved websocket error handling: Updated TTS websocket API to return more accurate error codes (1011 for internal errors instead of 1008) for better error identification and SLA monitoring.

Conversational AI

  • Twilio outbound: Added ability to natively run outbound calls.
  • Post-call webhook override: Added ability to override post-call webhook settings at the agent level, providing more flexible configurations.
  • Large knowledge base document viewing: Enhanced the knowledge base interface to allow viewing the entire content of large RAG documents.
  • Added call SID dynamic variable: Added system__call_sid as a system dynamic variable to allow referencing the call ID in prompts and tools.

Studio

  • Actor Mode: Added Actor Mode in Studio, allowing you to use your own voice recordings to direct the way speech should sound in Studio projects.
  • Improved keyboard shortcuts: Updated keyboard shortcuts for viewing settings and editor shortcuts to avoid conflicts and simplified shortcuts for locking paragraphs.

Dubbing

  • Dubbing duplication: Made dubbing duplication feature available to all users.
  • Manual mode foreground generation: Added ability to generate foreground audio when using manual mode with a file and CSV.

Voices

  • Enhanced voice collections: Improved voice collections with visual upgrades, language-based filtering, navigation breadcrumbs, collection images, and mouse dragging for carousel navigation.
  • Locale filtering: Added locale parameter to shared voices endpoint for more precise voice filtering.

API

Updated Endpoints

Text to Speech

Audio Format

Conversational AI

Voices

  • Updated Voice endpoints:

Dubbing

  • Updated Dubbing endpoint:
    • Dub a video or audio file - Renamed beta feature use_replacement_voices_from_library parameter to disable_voice_cloning for clarity