Conversational AI

  • Azure OpenAI custom LLM support: Added support for Azure-hosted OpenAI models in custom LLM configurations. When using an Azure endpoint, a new required field for API version is now available in the UI.
  • Genesys output variables: Added support for output variables when using Genesys integrations, enabling better call analytics and data collection.
  • Gemini 2.5 Preview Models Deprecation: Models gemini-2.5-flash-preview-05-20 and gemini-2.5-flash-preview-04-17 have been deprecated in Conversational AI as they are being deprecated on 15th July by Google. All agents using these models will automatically be transferred to gemini-2.5-flash the next time they are used. No action is required.
  • WebRTC rollout: Began progressive rollout of WebRTC capabilities for improved connection stability and performance. WebRTC mode can be selected in the React SDK and is used in 11.ai.
  • Keypad touch tone: Fixed an issue affecting playing keypad touch tones on Twilio. See keypad touch tone documentation.

Voices

  • Language collection navigation: Added quick navigation from language preview collections to view all available voices in that language, making it easier to explore voice options by language.

Text to Voice

  • Preview streaming: Added new streaming endpoint for Text to Voice previews, allowing real-time streaming of generated voice previews via /v1/text-to-voice/{generated_voice_id}/stream.
  • Enhanced voice design: Added stream_previews option to voice design endpoint, enabling streaming-only preview generation for improved performance.
  • Improved parameter controls: Enhanced loudness, quality, and guidance scale parameters with better control options for more precise voice generation.

Studio

  • Podcast customization: Added support for intro and outro text in podcast creation, along with custom instructions prompts for better style and tone control.

SDKs

API

New Endpoints

Updated Endpoints

Text to Voice

  • Create voice previews - Enhanced loudness, quality, and guidance_scale parameter descriptions
  • Design voice - Added stream_previews property for streaming-only preview generation

Studio

  • Create podcast - Added intro, outro, and instructions_prompt properties

Conversational AI