Burst pricing | ElevenLabs Documentation

Overview

Burst pricing allows your conversational AI agents to temporarily exceed your workspace’s subscription concurrency limit during high-demand periods. When enabled, your agents can handle up to 3 times your normal concurrency limit, with excess calls charged at double the standard rate.

This feature helps prevent missed calls during traffic spikes while maintaining cost predictability for your regular usage patterns.

How burst pricing works

When burst pricing is enabled for an agent:

Normal capacity: Calls within your subscription limit are charged at standard rates
Burst capacity: Additional calls (up to 3x your limit or 300 concurrent calls, whichever is lower) are accepted but charged at 2x the normal rate
Over-capacity rejection: Calls exceeding the burst limit are rejected with an error

Capacity calculations

Subscription limit	Burst capacity	Maximum concurrent calls
10 calls	30 calls	30 calls
50 calls	150 calls	150 calls
100 calls	300 calls	300 calls
200 calls	300 calls	300 calls (capped)

Burst capacity is capped at 300 concurrent calls regardless of your subscription limit.

Cost implications

Burst pricing follows a tiered charging model:

Within subscription limit: Standard per-minute rates apply
Burst calls: Charged at 2x the standard rate
Deprioritized processing: Burst calls receive lower priority for speech-to-text and text-to-speech processing

Example pricing scenario

For a workspace with a 20-call subscription limit:

Calls 1-20: Standard rate (e.g., $0.08/minute)
Calls 21-60: Double rate (e.g., $0.16/minute)
Calls 61+: Rejected

Burst calls are deprioritized and may experience higher latency for speech processing, similar to anonymous-tier requests.

Configuration

Burst pricing is configured per agent in the call limits settings.

Dashboard configuration

Navigate to your agent settings
Go to the Call Limits section
Enable the Burst pricing toggle
Save your agent configuration

API configuration

Burst pricing can be configured via the API, as shown in the examples below

1 from dotenv import load_dotenv
2 from elevenlabs.client import ElevenLabs
3 import os
4 
5 load_dotenv()
6 
7 elevenlabs = ElevenLabs(
8     api_key=os.getenv("ELEVENLABS_API_KEY"),
9 )
10 
11 # Update agent with burst pricing enabled
12 response = elevenlabs.conversational_ai.agents.update(
13     agent_id="your-agent-id",
14     agent_config={
15         "platform_settings": {
16             "call_limits": {
17                 "agent_concurrency_limit": -1,  # Use workspace limit
18                 "daily_limit": 1000,
19                 "bursting_enabled": True
20             }
21         }
22     }
23 )