Speech to text

  • scribe_v1_experimental: Launched a new experimental preview of the Scribe v1 model with improvements including improved performance on audio files with multiple languages, reduced hallucinations when audio is interleaved with silence, and improved audio tags. The new model is available via the API under the model name scribe_v1_experimental

Text to speech

  • A-law format support: Added a-law format with 8kHz sample rate to enable integration with European telephony systems.
  • Fixed quota issues: Fixed a database bug that caused some requests to be mistakenly rejected as exceeding their quota.

Conversational AI

  • Document type filtering: Added support for filtering knowledge base documents by their type (file, URL, or text).
  • Non-audio agents: Added support for conversational agents that don’t output audio but still send response transcripts and can use tools. Non-audio agents can be enabled by removing the audio client event.
  • Improved agent templates: Updated all agent templates with enhanced configurations and prompts. See more about how to improve system prompts here.
  • Fixed stuck exports: Fixed an issue that caused exports to be stuck for extended periods.

Studio

  • Fixed volume normalization: Fixed issue with streaming project snapshots when volume normalization is enabled.

New API endpoints

  • Forced alignment: Added new forced alignment endpoint for aligning audio with text, perfect for subtitle generation.
  • Batch calling: Added batch calling endpoint for scheduling calls to multiple recipients

API

New Endpoints

Updated Endpoints

Text to Speech

Voices

  • Get voices - Added collection_id parameter for filtering voices by collection

Knowledge Base

  • Get knowledge base - Added types parameter for filtering documents by type
  • General endpoint for creating knowledge base documents marked as deprecated in favor of specialized endpoints

User Subscription

  • Get user subscription - Added professional_voice_slots_used property to track number of professional voices used in a workspace

Conversational AI

  • Added silence_end_call_timeout parameter to set maximum wait time before terminating a call
  • Removed /v1/convai/agents/{agent_id}/add-secret endpoint (now handled by workspace secrets endpoints)