Voices
- List Voices V2: Added a new V2 voice search endpoint with better search and additional filtering options
Conversational AI
- Native outbound calling: Added native outbound calling for Twilio-configured numbers, eliminating the need for complex setup configurations. Outbound calls are now visible in the Call History page.
- Automatic language detection: Added new system tool for automatic language detection that enables agents to switch languages based on both explicit user requests (“Let’s talk in Spanish”) and implicit language in user audio.
- Pronunciation dictionary improvements: Fixed phoneme tags in pronunciation dictionaries to work correctly with conversational AI.
- Large RAG document viewing: Added ability to view the entire content of large RAG documents in the knowledge base.
- Customizable widget controls: Updated UI to include an optional mute microphone button and made widget icons customizable via slots.
Sound Effects
- Fractional duration support: Fixed an issue where users couldn’t enter fractional values (like 0.5 seconds) for sound effect generation duration.
Speech to Text
- Repetition handling: Improved detection and handling of repetitions in speech-to-text processing.
Studio
- Reader publishing fixes: Added support for mp3_44100_192 output format (high quality) so users below Publisher tier can export audio to Reader.
Mobile
- Core app signup: Added signup endpoints for the new Core mobile app.
API
View API changes
New Endpoints
- Added 5 new endpoints:
- List voices (v2) - Enhanced voice search capabilities with additional filtering options
- Initiate outbound call - New endpoint for making outbound calls via Twilio integration
- Add pronunciation dictionary from rules - Create pronunciation dictionaries directly from rules without file upload
- Get knowledge base document content - Retrieve full document content from the knowledge base
- Get knowledge base document chunk - Retrieve specific chunks from knowledge base documents
Updated Endpoints
Conversational AI
- Updated Conversational AI endpoints:
- Create agent - Added
mic_muting_enabled
property for UI control andworkspace_overrides
property for workspace-specific configurations - Update agent - Added
workspace_overrides
property for customizing agent behavior per workspace - Get agent - Added
workspace_overrides
property to the response - Get widget - Added
mic_muting_enabled
property for controlling microphone muting in the widget UI - Get conversation - Added rag information to view knowledge base content used during conversations
- Create phone number - Replaced generic structure with specific twilio phone number and sip trunk options
- Compute RAG index - Removed
force_reindex
query parameter for more controlled indexing - List knowledge base documents - Changed response structure to support different document types
- Get knowledge base document - Modified to return different response models based on document type
- Create agent - Added
Text to Speech
- Updated Text to Speech endpoints:
- Convert text to speech - Made properties optional, including
stability
andsimilarity
settings - Stream text to speech - Made voice settings properties optional for more flexible streaming requests
- Convert with timestamps - Made settings optional and modified
pronunciation_dictionary_locators
property - Stream with timestamps - Made voice settings properties optional for more flexible requests
- Convert text to speech - Made properties optional, including
Speech to Text
- Updated Speech to Text endpoint:
- Convert speech to text - Removed
biased_keywords
property from form data and improved internal repetition detection algorithm
- Convert speech to text - Removed
Voice Management
- Updated Voice endpoints:
- Get voices - Updated voice settings properties in the response
- Get default voice settings - Made
stability
andsimilarity
properties optional - Get voice settings - Made numeric properties optional for more flexible configuration
- Edit voice settings - Made
stability
andsimilarity
settings optional - Create voice - Modified array properties to accept null values
- Create voice from preview - Updated voice settings model with optional properties
Studio
- Updated Studio endpoints:
- Get project - Added
version_rules_num
to project metadata - Get project snapshot - Removed
status
property - Create pronunciation dictionaries - Modified
pronunciation_dictionary_locators
property and string properties to accept null values
- Get project - Added
Pronunciation Dictionary
- Updated Pronunciation Dictionary endpoints:
- Get all pronunciation dictionaries - Added
sort
andsort_direction
query parameters, pluslatest_version_rules_num
andinteger
properties to response - Get pronunciation dictionary - Added
latest_version_rules_num
andinteger
properties to response - Add from file - Added
version_rules_num
property to response for tracking rules quantity - Add rules - Added
version_rules_num
to response for rules tracking - Remove rules - Added
version_rules_num
to response for rules tracking
- Get all pronunciation dictionaries - Added