Voices
- New PVC flow: Added new flow for Professional Voice Clone creation, try it out here
Conversational AI
- Agent-agent transfer: Added support for agent-to-agent transfers via a new system tool, enabling more complex conversational flows. See the Agent Transfer tool documentation for details.
- Enhanced tool debugging: Improved how tool execution details are displayed in the conversation history for easier debugging.
- Language detection fix: Resolved an issue regarding the forced calling of the language detection tool.
Dubbing
- Render endpoint: Introduced a new endpoint to regenerate audio or video renders for specific languages within a dubbing project. This automatically handles missing transcriptions or translations. See the Render Dub endpoint.
- Increased size limit: Raised the maximum allowed file size for dubbing projects to 1 GiB.
API
View API changes
New Endpoints
- Added render dub endpoint - Regenerate dubs for a specific language.
Updated Endpoints
Pronunciation Dictionaries
- Updated the response for the
GET /v1/pronunciation-dictionaries/{pronunciation_dictionary_id}/
endpoint and related components to include thepermission_on_resource
field.
Speech to Text
- Updated Speech to Text endpoint (
POST /v1/speech-to-text
):- Added
cloud_storage_url
parameter to allow transcription directly from public S3 or GCS URLs (up to 2GB). - Made the
file
parameter optional; exactly one offile
orcloud_storage_url
must now be provided.
- Added
Speech to Speech
- Added optional
file_format
parameter (pcm_s16le_16
orother
) for lower latency with PCM input toPOST /v1/speech-to-speech/{voice_id}
Conversational AI
- Updated components to support agent-agent transfer tool
Voices
- Updated
GET /v1/voices/{voice_id}
samples
field to include optionaltrim_start
andtrim_end
parameters.
AudioNative
- Updated
Get /v1/audio-native/{project_id}/settings
to includestatus
field (processing
orready
).