Forced Aligment
- Forced alignment improvements: Fixed a rare failure case in forced alignment processing to improve reliability.
Voices
- Live moderated voices filter: Added
include_live_moderated
query parameter to the shared voices endpoint, allowing you to include or exclude voices that are live moderated.
Conversational AI
- Secret dynamic variables: Added support for specifying dynamic variables as secrets with the
secret__
prefix. Secret dynamic variables can only be used in webhook tool headers and are never sent to an LLM, enhancing security for sensitive data. Learn more. - Skip turn system tool: Introduced a new system tool called skip_turn. When enabled, the agent will skip its turn if the user explicitly indicates they need a moment to think or perform an action (e.g., “just a sec”, “give me a minute”). This prevents turn timeout from being triggered during intentional user pauses. See the skip turn tool docs for more information.
- Text input support: Added text input support in websocket connections via “user_message” event with text field. Also added “user_activity” event support to indicate typing or other UI activity, improving agent turn-taking when there’s interleaved text and audio input.
- RAG chunk limit: Added ability to configure the maximum number of chunks collected during RAG retrieval, giving users more control over context window usage and costs.
- Enhanced widget configuration: Expanded widget customization options to include text input and text only mode.
- LLM usage calculator: Introduced tools to calculate expected LLM token usage and costs for agents, helping with cost estimation and planning.
Audio Native
- Accessibility improvements: Enhanced accessibility for the AudioNative player with multiple improvements:
- Added aria-labels for all buttons
- Enabled keyboard navigation for all interactive elements
- Made progress bar handle focusable and keyboard-accessible
- Improved focus indicator visibility for better screen reader compatibility
API
View API changes
New Endpoints
- Added 3 new endpoints:
- Get Agent Knowledge Base Size - Returns the number of pages in the agent’s knowledge base.
- Calculate Agent LLM Usage - Calculates expected number of LLM tokens needed for the specified agent.
- Calculate LLM Usage - Returns a list of LLM models and the expected cost for using them based on the provided values.
Updated Endpoints
Voices
- Get Shared Voices - Added
include_live_moderated
query parameter toGET /v1/shared-voices
to filter voices by live moderation status.
Conversational AI
-
Agent Configuration:
- Enhanced system tools with new
skip_turn
tool configuration - Improved RAG configuration with
max_retrieved_rag_chunks_count
parameter
- Enhanced system tools with new
-
Widget Configuration:
- Added support for text-only mode
-
Batch Calling:
- Batch call responses now include
phone_provider
field with default value “twilio”
- Batch call responses now include
Text to Speech
- Voice Settings:
- Added
quality
parameter to voice settings for controlling audio generation quality - Model response schema updated to include
can_use_quality
field
- Added