Burst pricing

Optimize call capacity with burst concurrency to handle traffic spikes.

Overview

Burst pricing allows your conversational AI agents to temporarily exceed your workspace’s subscription concurrency limit during high-demand periods. When enabled, your agents can handle up to 3 times your normal concurrency limit, with excess calls charged at double the standard rate.

This feature helps prevent missed calls during traffic spikes while maintaining cost predictability for your regular usage patterns.

How burst pricing works

When burst pricing is enabled for an agent:

  1. Normal capacity: Calls within your subscription limit are charged at standard rates
  2. Burst capacity: Additional calls (up to 3x your limit or 300 concurrent calls, whichever is lower) are accepted but charged at 2x the normal rate
  3. Over-capacity rejection: Calls exceeding the burst limit are rejected with an error

Capacity calculations

Subscription limitBurst capacityMaximum concurrent calls
10 calls30 calls30 calls
50 calls150 calls150 calls
100 calls300 calls300 calls
200 calls300 calls300 calls (capped)
Burst capacity is capped at 300 concurrent calls regardless of your subscription limit.

Cost implications

Burst pricing follows a tiered charging model:

  • Within subscription limit: Standard per-minute rates apply
  • Burst calls: Charged at 2x the standard rate
  • Deprioritized processing: Burst calls receive lower priority for speech-to-text and text-to-speech processing

Example pricing scenario

For a workspace with a 20-call subscription limit:

  • Calls 1-20: Standard rate (e.g., $0.08/minute)
  • Calls 21-60: Double rate (e.g., $0.16/minute)
  • Calls 61+: Rejected

Burst calls are deprioritized and may experience higher latency for speech processing, similar to anonymous-tier requests.

Configuration

Burst pricing is configured per agent in the call limits settings.

Dashboard configuration

  1. Navigate to your agent settings
  2. Go to the Call Limits section
  3. Enable the Burst pricing toggle
  4. Save your agent configuration

API configuration

Burst pricing can be configured via the API, as shown in the examples below

1from dotenv import load_dotenv
2from elevenlabs.client import ElevenLabs
3import os
4
5load_dotenv()
6
7elevenlabs = ElevenLabs(
8 api_key=os.getenv("ELEVENLABS_API_KEY"),
9)
10
11# Update agent with burst pricing enabled
12response = elevenlabs.conversational_ai.agents.update(
13 agent_id="your-agent-id",
14 agent_config={
15 "platform_settings": {
16 "call_limits": {
17 "agent_concurrency_limit": -1, # Use workspace limit
18 "daily_limit": 1000,
19 "bursting_enabled": True
20 }
21 }
22 }
23)