Voices

  • New PVC flow: Added new flow for Professional Voice Clone creation, try it out here

Conversational AI

  • Agent-agent transfer: Added support for agent-to-agent transfers via a new system tool, enabling more complex conversational flows. See the Agent Transfer tool documentation for details.
  • Enhanced tool debugging: Improved how tool execution details are displayed in the conversation history for easier debugging.
  • Language detection fix: Resolved an issue regarding the forced calling of the language detection tool.

Dubbing

  • Render endpoint: Introduced a new endpoint to regenerate audio or video renders for specific languages within a dubbing project. This automatically handles missing transcriptions or translations. See the Render Dub endpoint.
  • Increased size limit: Raised the maximum allowed file size for dubbing projects to 1 GiB.

API

New Endpoints

Updated Endpoints

Pronunciation Dictionaries

Speech to Text

  • Updated Speech to Text endpoint (POST /v1/speech-to-text):
    • Added cloud_storage_url parameter to allow transcription directly from public S3 or GCS URLs (up to 2GB).
    • Made the file parameter optional; exactly one of file or cloud_storage_url must now be provided.

Speech to Speech

Conversational AI

Voices

AudioNative