Professional Voice Cloning (PVC)

  • PVC API: Introduced a comprehensive suite of API endpoints for managing Professional Voice Clones (PVC). You can now programmatically create voices, add/manage/delete audio samples, retrieve audio/waveforms, manage speaker separation, handle verification, and initiate training. For a full list of new endpoints check the API changes summary below or read the PVC API reference here.

Speech to Text

  • Enhanced Export Options: Added options to include or exclude timestamps and speaker IDs when exporting Speech to Text results in segmented JSON format via the API.

Conversational AI

  • New LLM Models: Added support for new GPT-4.1 models: gpt-4.1, gpt-4.1-mini, and gpt-4.1-nano here
  • VAD Score: Added a new client event which sends VAD scores to the client, see reference here

Workspace

  • Member Management: Added a new API endpoint to allow administrators to delete workspace members here

API

New Endpoints

Updated Endpoints

Speech to Text

  • Updated endpoint with changes:

Schema Changes

Conversational AI

  • GET conversation details: Added has_audio, has_user_audio, has_response_audio boolean fields here

Dubbing

  • GET dubbing resource : Added status field to each render here