Burst pricing
Overview
Burst pricing allows your conversational AI agents to temporarily exceed your workspace’s subscription concurrency limit during high-demand periods. When enabled, your agents can handle up to 3 times your normal concurrency limit, with excess calls charged at double the standard rate.
This feature helps prevent missed calls during traffic spikes while maintaining cost predictability for your regular usage patterns.
How burst pricing works
When burst pricing is enabled for an agent:
- Normal capacity: Calls within your subscription limit are charged at standard rates
- Burst capacity: Additional calls (up to 3x your limit or 300 concurrent calls, whichever is lower) are accepted but charged at 2x the normal rate
- Over-capacity rejection: Calls exceeding the burst limit are rejected with an error
Capacity calculations
Cost implications
Burst pricing follows a tiered charging model:
- Within subscription limit: Standard per-minute rates apply
- Burst calls: Charged at 2x the standard rate
- Deprioritized processing: Burst calls receive lower priority for speech-to-text and text-to-speech processing
Example pricing scenario
For a workspace with a 20-call subscription limit:
- Calls 1-20: Standard rate (e.g., $0.08/minute)
- Calls 21-60: Double rate (e.g., $0.16/minute)
- Calls 61+: Rejected
Burst calls are deprioritized and may experience higher latency for speech processing, similar to anonymous-tier requests.
Configuration
Burst pricing is configured per agent in the call limits settings.
Dashboard configuration
- Navigate to your agent settings
- Go to the Call Limits section
- Enable the Burst pricing toggle
- Save your agent configuration
API configuration
Burst pricing can be configured via the API, as shown in the examples below