Introducing ElevenLabs Conversational AI 2.0

Conversational AI 2.0 launches with advanced features and enterprise readiness.

Conversational AI across different industries

Introducing Conversational AI 2.0, a significant evolution of our platform designed to enable the creation of the most sophisticated, capable, and trustworthy voice agents in the world. Building on the foundation laid just five months ago, this release introduces significant improvements and comprehensive enterprise readiness, marking a new era of communication and understanding.

Feature Area Conversational AI v1 Conversational AI v2
Interaction Flow Basic Conversational API State-of-the-Art Turn-Taking Model
Knowledge Access N/A Integrated RAG (Low Latency, High Privacy)
Multilingual Manual Switching Integrated Automatic Language Detection
Personas Single Voice Per Agent Multi-Character Switching within Single Agent
Enterprise Readiness Standard Security HIPAA Compliance, EU Residency, Enhanced Security & Reliability
Modality Voice only Voice-only, text-only, and voice+text
Telephony Support Twilio inbound only Full inbound+outbound support, with batch call scheduling and fully-fledged SIP trunking integration

Building more human-like interactions

At the heart of effective communication lies natural interaction flow. Conversational AI 2.0 introduces custom models specifically designed to make AI interactions smoother and more intuitive.

  • Natural turn-taking to understand the flow of conversation. Traditional voice systems often struggle with the rhythm of human dialogue, leading to awkward pauses or unnatural interruptions. Conversational AI 2.0 incorporates a state-of-the-art turn-taking model engineered to overcome this. This advanced model analyzes conversational cues in real-time such as “um” “ah”, allowing the agent to understand when to interrupt or when to wait. The result is a fluid, natural dialogue, as demonstrated in scenarios like customer service interactions where an agent seamlessly handles pauses while a user finds information ("Oh, let me just double check. Um...") before providing a swift response. This capability significantly enhances user experience, improves efficiency in task completion, and makes interactions feel more genuinely conversational.
  • Multilingual communication with integrated language detection. Businesses need to communicate across language barriers. Conversational AI 2.0 integrates automatic language detection directly into the agent. This allows the AI to identify the language being spoken by the user and respond appropriately within the same interaction, enabling "seamless multilingual discussions" without requiring manual configuration or user prompts. This feature is invaluable for global enterprises aiming to provide consistent, high-quality service to diverse customer bases, opening doors to broader markets and more inclusive user experiences.

Knowledge and creativity unleashed

Beyond conversational fluency, intelligence and adaptability are key. Conversational AI 2.0 empowers agents with unprecedented knowledge access and creative flexibility.

  • Integrated RAG: knowledgeable agents, minimum latency, maximum privacy. Retrieval-Augmented Generation (RAG) allows AI models to access and incorporate information from external knowledge sources into their responses. ElevenLabs has uniquely integrated this capability directly into the voice agent architecture, enabling retrieval from your specific knowledge base. Crucially, this is achieved with minimum latency and maximum privacy. This unlocks powerful enterprise applications, such as a medical assistants retrieving specific treatment guidelines instantly, or support agents accessing the latest product information from internal documentation.

Streamlining operations

  • Multimodality Engineering your agents to precisely match the behaviour you need can be challenging work. Doing it twice, once for text agents and once for voice agents is even harder. ElevenLabs Conversational AI now supports multimodality, so you can create agents that can communicate over text, voice or both at the same time. Crucially, this means your agent only needs to be defined once, reducing load on your engineering team. 
  • Batch calls: Manual outbound calling presents operational limitations for organizations seeking to reach large audiences efficiently. ElevenLabs has developed Batch Calling for our Conversational AI platform to address these challenges, enabling users to automate and scale their outbound voice communications. Batch Calling allows the initiation of multiple outbound calls simultaneously using your Conversational AI agents, perfect for use cases such as sending alerts, conducting surveys, or delivering personalized messages to extensive contact lists with increased speed and consistency. [link to batch calling post]

Built for the enterprise: trust, security, and scalability

エンタープライズ向けに構築:信頼性、安全性、スケーラビリティ

  • Full HIPAA Compliance: Essential for healthcare applications, ensuring patient data privacy and regulatory adherence, directly supporting use cases like the medical RAG example.
  • Enterprise-Grade Security: Implementing comprehensive security measures to protect data and ensure system integrity.
  • Third-Party Integrations: Designed for flexibility, allowing seamless connection with existing enterprise systems and workflows.
  • Optional EU Data Residency: Addressing data sovereignty requirements for organizations operating in or serving the European Union.
  • Industry-Leading Reliability: Engineered for high availability and consistent performance, ensuring agents are dependable for critical business functions.

These features demonstrate a commitment to providing a platform that enterprises can trust for mission-critical deployments.

これらの機能は、ミッションクリティカルな展開において企業が信頼できるプラットフォームを提供することへのコミットメントを示しています。

会話型AI 2.0は1.0よりも大幅に優れています

会話型AI 2.0のリリースは、初版からわずか4か月後に行われ、ElevenLabsの迅速な革新へのコミットメントを示しています。V1が高品質な会話音声の基準を確立した一方で、V2は複数の次元で大きな進歩を遂げています:

この迅速な開発サイクルは、音声AIの可能性を追求し、ユーザーに迅速に価値を提供することへの私たちの献身を強調しています。

未来は今:会話型AI 2.0を始めましょう

ElevenLabsの会話型AI 2.0は、真にインテリジェントで自然かつ信頼できる音声エージェントを構築するためのツールを提供します。カスタマーサービスの向上から新しい形のインタラクティブコンテンツの実現、エンタープライズ知識へのアクセスの効率化まで、その可能性は広がっています。documentation, visit our developer portal, or contact our sales team to discover how Conversational AI 2.0 can transform your business.

もっと見る

プロダクト
Multimodal

マルチモーダル会話型AIの紹介

私たちのAIエージェントは、音声とテキスト入力を同時に処理できるようになり、より自然で効率的、かつ柔軟なユーザーインタラクションを実現します。

デベロッパー
Abstract

Testing Conversational AI Agents

Discover how to effectively test and improve conversational AI agents using robust evaluation criteria and conversation simulations.

ElevenLabs

最高品質のAIオーディオで制作を

無料で始める

すでにアカウントをお持ちですか? ログイン