Voices

Conversational AI

  • Native outbound calling: Added native outbound calling for Twilio-configured numbers, eliminating the need for complex setup configurations. Outbound calls are now visible in the Call History page.
  • Automatic language detection: Added new system tool for automatic language detection that enables agents to switch languages based on both explicit user requests (“Let’s talk in Spanish”) and implicit language in user audio.
  • Pronunciation dictionary improvements: Fixed phoneme tags in pronunciation dictionaries to work correctly with conversational AI.
  • Large RAG document viewing: Added ability to view the entire content of large RAG documents in the knowledge base.
  • Customizable widget controls: Updated UI to include an optional mute microphone button and made widget icons customizable via slots.

Sound Effects

  • Fractional duration support: Fixed an issue where users couldn’t enter fractional values (like 0.5 seconds) for sound effect generation duration.

Speech to Text

  • Repetition handling: Improved detection and handling of repetitions in speech-to-text processing.

Studio

  • Reader publishing fixes: Added support for mp3_44100_192 output format (high quality) so users below Publisher tier can export audio to Reader.

Mobile

  • Core app signup: Added signup endpoints for the new Core mobile app.

API

New Endpoints

Updated Endpoints

Conversational AI

  • Updated Conversational AI endpoints:
    • Create agent - Added mic_muting_enabled property for UI control and workspace_overrides property for workspace-specific configurations
    • Update agent - Added workspace_overrides property for customizing agent behavior per workspace
    • Get agent - Added workspace_overrides property to the response
    • Get widget - Added mic_muting_enabled property for controlling microphone muting in the widget UI
    • Get conversation - Added rag information to view knowledge base content used during conversations
    • Create phone number - Replaced generic structure with specific twilio phone number and sip trunk options
    • Compute RAG index - Removed force_reindex query parameter for more controlled indexing
    • List knowledge base documents - Changed response structure to support different document types
    • Get knowledge base document - Modified to return different response models based on document type

Text to Speech

Speech to Text

  • Updated Speech to Text endpoint:
    • Convert speech to text - Removed biased_keywords property from form data and improved internal repetition detection algorithm

Voice Management

Studio

Pronunciation Dictionary

  • Updated Pronunciation Dictionary endpoints: