ElevenLabs-hosted LLMs now available in Agents Platform

Launch faster, more capable, and more efficient voice agents using co-located open-source LLMs hosted by ElevenLabs.

hosted llms

Advancing real-time conversational performance

We’re introducing ElevenLabs-hosted LLMs in our agents platform, enabling faster, more capable, and more efficient voice agents.

By hosting open-source models directly within our infrastructure, we deliver ultra-low latency and reduced reasoning cost, and our customers can now deploy voice agents without relying on additional providers.

Built for reasoning and responsiveness

With GLM 4.5 Air, ElevenLabs Agents achieve top-tier reasoning accuracy and tool-calling performance at roughly one-third the cost of alternatives.

For lighter reasoning tasks, Qwen3-30b-a3b delivers sub-150ms Time To First Sentence, enabling fluid, natural dialogue experiences.

Comparing ElevenLabs-hosted LLMs with State of the Art proprietary models

comps

The benefits of co-located architecture

Our hosted LLMs operate alongside proprietary Speech to Text, Text to Speech, and turn-taking models within a single environment. This unified architecture reduces latency, improves reliability, and enhances data security.

Try it today

ElevenLabs-hosted LLMs are now available in Agents Platform.

Learn more about our LLM offering in our docs.

Explore articles by the ElevenLabs team

Product
workflows

Introducing Agent Workflows

Workflows, our visual editor for designing complex conversation flows in agents platform, is now live.

Product
ElevenLabs Agent Testing

Introducing Tests for ElevenLabs Agents

Ensure reliability and compliance with ElevenLabs Agents Testing. Run structured simulations for tool calls, human transfers, workflows, and guardrails. Integrate into CI/CD and ship agents with confidence.

ElevenLabs

Create with the highest quality AI Audio

Get started free

Already have an account? Log in