Voices
- New PVC flow: Added new flow for Professional Voice Clone creation, try it out here
Agents Platform
- Agent-agent transfer: Added support for agent-to-agent transfers via a new system tool, enabling more complex conversational flows. See the Agent Transfer tool documentation for details.
- Enhanced tool debugging: Improved how tool execution details are displayed in the conversation history for easier debugging.
- Language detection fix: Resolved an issue regarding the forced calling of the language detection tool.
Dubbing
- Render endpoint: Introduced a new endpoint to regenerate audio or video renders for specific languages within a dubbing project. This automatically handles missing transcriptions or translations. See the Render Dub endpoint.
- Increased size limit: Raised the maximum allowed file size for dubbing projects to 1 GiB.
API
View API changes
New Endpoints
- Added render dub endpoint - Regenerate dubs for a specific language.
Updated Endpoints
Pronunciation Dictionaries
- Updated the response for the
GET /v1/pronunciation-dictionaries/{pronunciation_dictionary_id}/endpoint and related components to include thepermission_on_resourcefield.
Speech to Text
- Updated Speech to Text endpoint (
POST /v1/speech-to-text):- Added
cloud_storage_urlparameter to allow transcription directly from public S3 or GCS URLs (up to 2GB). - Made the
fileparameter optional; exactly one offileorcloud_storage_urlmust now be provided.
- Added
Speech to Speech
- Added optional
file_formatparameter (pcm_s16le_16orother) for lower latency with PCM input toPOST /v1/speech-to-speech/{voice_id}
Agents Platform
- Updated components to support agent-agent transfer tool
Voices
- Updated
GET /v1/voices/{voice_id}samplesfield to include optionaltrim_startandtrim_endparameters.
AudioNative
- Updated
Get /v1/audio-native/{project_id}/settingsto includestatusfield (processingorready).