Text to Voice

  • Voice Design: Launched new Text to Voice Design with Eleven v3 for creating custom voices from text descriptions.

Speech to Text

  • Enhanced Diarization: Added diarization_threshold parameter to the Speech to Text endpoint. Fine-tune the balance between speaker accuracy and total speaker count by adjusting the threshold between 0.1 and 0.4.

Professional Voice Cloning

  • Background Noise Removal: Added remove_background_noise to clean up voice samples using audio isolation models for better quality training data.

Studio

Workspaces

  • Service Account Groups: Service accounts can now be added to workspace groups for better permission management and access control.

  • Workspace Authentication: Added support for workspace authentication connections, enabling secure webhook tool integrations with external services.

SDKs

  • Python SDK: Released v2.6.0 with latest API support and bug fixes.
  • JavaScript SDK: Released v2.5.0 with latest API support and bug fixes.
  • React Conversational AI SDK: Added WebRTC support in 0.2.0

API

New Endpoints

Updated Endpoints

Speech to Text

Voice Management

  • Get voice sample audio - Added remove_background_noise query parameter and moved from request body to query parameters