Speech to text
scribe_v1_experimental
: Launched a new experimental preview of the Scribe v1 model with improvements including improved performance on audio files with multiple languages, reduced hallucinations when audio is interleaved with silence, and improved audio tags. The new model is available via the API under the model namescribe_v1_experimental
Text to speech
- A-law format support: Added a-law format with 8kHz sample rate to enable integration with European telephony systems.
- Fixed quota issues: Fixed a database bug that caused some requests to be mistakenly rejected as exceeding their quota.
Conversational AI
- Document type filtering: Added support for filtering knowledge base documents by their type (file, URL, or text).
- Non-audio agents: Added support for conversational agents that don’t output audio but still send response transcripts and can use tools. Non-audio agents can be enabled by removing the audio client event.
- Improved agent templates: Updated all agent templates with enhanced configurations and prompts. See more about how to improve system prompts here.
- Fixed stuck exports: Fixed an issue that caused exports to be stuck for extended periods.
Studio
- Fixed volume normalization: Fixed issue with streaming project snapshots when volume normalization is enabled.
New API endpoints
- Forced alignment: Added new forced alignment endpoint for aligning audio with text, perfect for subtitle generation.
- Batch calling: Added batch calling endpoint for scheduling calls to multiple recipients
API
View API changes
New Endpoints
- Added Forced alignment endpoint for aligning audio with text
- Added dedicated endpoints for knowledge base document types:
Updated Endpoints
Text to Speech
- Added a-law format (8kHz) to all audio endpoints:
Voices
- Get voices - Added
collection_id
parameter for filtering voices by collection
Knowledge Base
- Get knowledge base - Added
types
parameter for filtering documents by type - General endpoint for creating knowledge base documents marked as deprecated in favor of specialized endpoints
User Subscription
- Get user subscription - Added
professional_voice_slots_used
property to track number of professional voices used in a workspace
Conversational AI
- Added
silence_end_call_timeout
parameter to set maximum wait time before terminating a call - Removed
/v1/convai/agents/{agent_id}/add-secret
endpoint (now handled by workspace secrets endpoints)