Changelog

Voices

New PVC flow: Added new flow for Professional Voice Clone creation, try it out here

Agents Platform

Agent-agent transfer: Added support for agent-to-agent transfers via a new system tool, enabling more complex conversational flows. See the Agent Transfer tool documentation for details.
Enhanced tool debugging: Improved how tool execution details are displayed in the conversation history for easier debugging.
Language detection fix: Resolved an issue regarding the forced calling of the language detection tool.

Dubbing

Render endpoint: Introduced a new endpoint to regenerate audio or video renders for specific languages within a dubbing project. This automatically handles missing transcriptions or translations. See the Render Dub endpoint.
Increased size limit: Raised the maximum allowed file size for dubbing projects to 1 GiB.

API

View API changes

New Endpoints

Added render dub endpoint - Regenerate dubs for a specific language.

Updated Endpoints

Pronunciation Dictionaries

Updated the response for the GET /v1/pronunciation-dictionaries/{pronunciation_dictionary_id}/ endpoint and related components to include the permission_on_resource field.

Speech to Text

Updated Speech to Text endpoint (POST /v1/speech-to-text):
- Added cloud_storage_url parameter to allow transcription directly from public S3 or GCS URLs (up to 2GB).
- Made the file parameter optional; exactly one of file or cloud_storage_url must now be provided.

Speech to Speech

Added optional file_format parameter (pcm_s16le_16 or other) for lower latency with PCM input to POST /v1/speech-to-speech/{voice_id}

Agents Platform

Updated components to support agent-agent transfer tool

Voices

Updated GET /v1/voices/{voice_id} samples field to include optional trim_start and trim_end parameters.

AudioNative

Updated Get /v1/audio-native/{project_id}/settings to include status field (processing or ready).