Skip to content

Text to Speech API - Up To 40% Faster Globally

TTS Faster

We've rolled out multi-region serving for our Text to Speech API. Requests now automatically route to the nearest backend (US, Netherlands, or Singapore) delivering faster time to first byte (TTFB) with no code changes required.

What changed

When you call api.elevenlabs.io, our infrastructure routes to the optimal backend based on your location:

  • Americas: US-Central
  • Europe, Middle East, Africa: Netherlands
  • Asia-Pacific: Singapore

You can verify your serving region via the x-region header in the API response.

Performance

With upgraded GPUs and an optimized inference stack, Flash v2.5 achieves 50ms model time to first byte, and with network routing improvements on top, that leads to large reductions in perceived latency.

Measured TTFB improvements across 11 global locations:

TTFB Reduction
Europe
~100-150ms
Southeast Asia
~150-200ms
India
~100-150ms
Japan
~50-80ms
Australia
~80-120ms

For most international developers, this represents a 20-40% reduction in perceived latency.

Why it matters

For voice agents and real-time applications, 150ms less latency means more natural conversations, better responsiveness, and a consistent experience for users regardless of geography. Combined with Flash v2.5's inference speed, this is the fastest agentic Text to Speech available.

Get started

No migration needed. If you're calling api.elevenlabs.io, global routing is already active.

To opt-out of the global routing and always use USA servers, use the api.us.elevenlabs.io base URL for your API requests.

See our latency optimization guide for additional best practices. Enterprise customers requiring regional data residency can contact sales.


Explore articles by the ElevenLabs team

ElevenLabs

Create with the highest quality AI Audio

Get started free

Already have an account? Log in