Default LLM update: Changed the default agent LLM from Gemini 1.5 Flash to Gemini 2.0 Flash for improved performance.
Fixed incorrect conversation abandons: Improved detection of conversation continuations, preventing premature abandons when users repeat themselves.
Twilio information in history: Added Twilio call details to conversation history for better tracking.
Knowledge base redesign: Redesigned the knowledge base interface.
System dynamic variables: Added system dynamic variables to use time, conversation id, caller id and other system values as dynamic variables in prompts and tools.
Twilio client initialisation: Adds an agent level override for conversation initiation client data twilio webhook.
RAG chunks in history: Added retrieved chunks by RAG to the call transcripts in the history view.
Speech to Text
Reduced pricing: Reduced the pricing of our Scribe model, see more here.
Improved VAD detection: Enhanced Voice Activity Detection with better pause detection at segment boundaries and improved handling of silent segments.
Enhanced diarization: Improved speaker clustering with a better ECAPA model, symmetric connectivity matrix, and more selective speaker embedding generation.
Fixed ASR bugs: Resolved issues with VAD rounding, silence and clustering that affected transcription accuracy.
ElevenCreative Studio
Disable publishing UI: Added ability to disable the publishing interface for specific workspace members to support enterprise workflows.
Snapshot API improvement: Modified endpoints for project and chapter snapshots to return an empty list instead of throwing errors when snapshots can’t be downloaded.
Disabled auto-moderation: Turned off automatic moderation based on Text to Speech generations in ElevenCreative Studio.
Workspaces
Fixed API key editing: Resolved an issue where editing workspace API keys would reset character limits to zero, causing the keys to stop working.
Optimized free subscriptions: Fixed an issue with refreshing free subscription character limits,