Conversational AI

  • Default LLM update: Changed the default agent LLM from Gemini 1.5 Flash to Gemini 2.0 Flash for improved performance.
  • Fixed incorrect conversation abandons: Improved detection of conversation continuations, preventing premature abandons when users repeat themselves.
  • Twilio information in history: Added Twilio call details to conversation history for better tracking.
  • Knowledge base redesign: Redesigned the knowledge base interface.
  • System dynamic variables: Added system dynamic variables to use time, conversation id, caller id and other system values as dynamic variables in prompts and tools.
  • Twilio client initialisation: Adds an agent level override for conversation initiation client data twilio webhook.
  • RAG chunks in history: Added retrieved chunks by RAG to the call transcripts in the history view.

Speech to Text

  • Reduced pricing: Reduced the pricing of our Scribe model, see more here.
  • Improved VAD detection: Enhanced Voice Activity Detection with better pause detection at segment boundaries and improved handling of silent segments.
  • Enhanced diarization: Improved speaker clustering with a better ECAPA model, symmetric connectivity matrix, and more selective speaker embedding generation.
  • Fixed ASR bugs: Resolved issues with VAD rounding, silence and clustering that affected transcription accuracy.

Studio

  • Disable publishing UI: Added ability to disable the publishing interface for specific workspace members to support enterprise workflows.
  • Snapshot API improvement: Modified endpoints for project and chapter snapshots to return an empty list instead of throwing errors when snapshots can’t be downloaded.
  • Disabled auto-moderation: Turned off automatic moderation based on Text to Speech generations in Studio.

Workspaces

  • Fixed API key editing: Resolved an issue where editing workspace API keys would reset character limits to zero, causing the keys to stop working.
  • Optimized free subscriptions: Fixed an issue with refreshing free subscription character limits,

API

New Endpoints

Updated Endpoints

Dubbing

Project Management

  • Updated Project endpoints:
    • Add project - Made metadata, project_name, description nullable
    • Create podcast - Made title, description, author nullable
    • Get project - Made last_modified_at, created_at, project_name nullable
    • Add chapter - Made chapter_id, word_count, statistics nullable
    • Update chapter - Made content and blocks properties nullable

Conversational AI

  • Updated Conversational AI endpoints:
    • Update agent - Made conversation_config, platform_settings nullable and added workspace_overrides property
    • Create agent - Made agent_name, prompt, widget_config nullable and added workspace_overrides property
    • Add to knowledge base - Made document_name nullable
    • Get conversation - Added twilio_call_data model and made transcript, metadata nullable

Text to Speech

Voice Management

  • Updated Voice endpoints:

Speech to Text

Other Updates