Changelog

Voices

List Voices V2: Added a new V2 voice search endpoint with better search and additional filtering options

Conversational AI

Native outbound calling: Added native outbound calling for Twilio-configured numbers, eliminating the need for complex setup configurations. Outbound calls are now visible in the Call History page.
Automatic language detection: Added new system tool for automatic language detection that enables agents to switch languages based on both explicit user requests (“Let’s talk in Spanish”) and implicit language in user audio.
Pronunciation dictionary improvements: Fixed phoneme tags in pronunciation dictionaries to work correctly with conversational AI.
Large RAG document viewing: Added ability to view the entire content of large RAG documents in the knowledge base.
Customizable widget controls: Updated UI to include an optional mute microphone button and made widget icons customizable via slots.

Sound Effects

Fractional duration support: Fixed an issue where users couldn’t enter fractional values (like 0.5 seconds) for sound effect generation duration.

Speech to Text

Repetition handling: Improved detection and handling of repetitions in speech-to-text processing.

Studio

Reader publishing fixes: Added support for mp3_44100_192 output format (high quality) so users below Publisher tier can export audio to Reader.

Mobile

Core app signup: Added signup endpoints for the new Core mobile app.

API

View API changes

New Endpoints

Added 5 new endpoints:
- List voices (v2) - Enhanced voice search capabilities with additional filtering options
- Initiate outbound call - New endpoint for making outbound calls via Twilio integration
- Add pronunciation dictionary from rules - Create pronunciation dictionaries directly from rules without file upload
- Get knowledge base document content - Retrieve full document content from the knowledge base
- Get knowledge base document chunk - Retrieve specific chunks from knowledge base documents

Updated Endpoints

Conversational AI

Updated Conversational AI endpoints:
- Create agent - Added mic_muting_enabled property for UI control and workspace_overrides property for workspace-specific configurations
- Update agent - Added workspace_overrides property for customizing agent behavior per workspace
- Get agent - Added workspace_overrides property to the response
- Get widget - Added mic_muting_enabled property for controlling microphone muting in the widget UI
- Get conversation - Added rag information to view knowledge base content used during conversations
- Create phone number - Replaced generic structure with specific twilio phone number and sip trunk options
- Compute RAG index - Removed force_reindex query parameter for more controlled indexing
- List knowledge base documents - Changed response structure to support different document types
- Get knowledge base document - Modified to return different response models based on document type

Text to Speech

Updated Text to Speech endpoints:
- Convert text to speech - Made properties optional, including stability and similarity settings
- Stream text to speech - Made voice settings properties optional for more flexible streaming requests
- Convert with timestamps - Made settings optional and modified pronunciation_dictionary_locators property
- Stream with timestamps - Made voice settings properties optional for more flexible requests

Speech to Text

Updated Speech to Text endpoint:
- Convert speech to text - Removed biased_keywords property from form data and improved internal repetition detection algorithm

Voice Management

Updated Voice endpoints:
- Get voices - Updated voice settings properties in the response
- Get default voice settings - Made stability and similarity properties optional
- Get voice settings - Made numeric properties optional for more flexible configuration
- Edit voice settings - Made stability and similarity settings optional
- Create voice - Modified array properties to accept null values
- Create voice from preview - Updated voice settings model with optional properties

Studio

Updated Studio endpoints:
- Get project - Added version_rules_num to project metadata
- Get project snapshot - Removed status property
- Create pronunciation dictionaries - Modified pronunciation_dictionary_locators property and string properties to accept null values

Pronunciation Dictionary

Updated Pronunciation Dictionary endpoints:
- Get all pronunciation dictionaries - Added sort and sort_direction query parameters, plus latest_version_rules_num and integer properties to response
- Get pronunciation dictionary - Added latest_version_rules_num and integer properties to response
- Add from file - Added version_rules_num property to response for tracking rules quantity
- Add rules - Added version_rules_num to response for rules tracking
- Remove rules - Added version_rules_num to response for rules tracking