Flexible pricing for your needs

Flash / Turbo
Text to Speech
Price per 1K characters
- Ultra-low latency (~75ms)
- 32 languages supported
- 40,000 character limit

Multilingual v2 / v3
Text to Speech
Price per 1K characters
- Low latency (~250-300ms)
- High quality voice generation
- 32 languages supported
- 40,000 character limit

Scribe v1 / v2
Speech to Text
Price per hour
- Over 98% transcription accuracy
- Keyterm prompting
- 90+ languages supported
- Dynamic audio tagging

Scribe v2 Realtime
Speech to Text
Price per hour
- Low latency (~150ms)
- 90+ languages supported
- Precise word-level timestamps
- Realtime transcription

Music
Music Generation
Price per minute
- 5 minute duration limit
- Commercial use licensing on Starter+ plans
- 44.1kHz, 128-192kbps audio

Voice Isolator
Audio Processing
Price per minute
- Removes ambient sounds, reverb, and interference
- WAV, MP3, FLAC, OGG and AAC audio inputs
- Files up to 500MB/1 hour long

Voice Changer
Audio Processing
Price per minute
- Fast real-time processing
- 10,000+ human-like voices
- 70+ languages supported

Sound Effects
Audio Generation
Price per generation
- Generate custom sound effects
- Royalty-free
- MPS (44.1kHz) or WAV (48kHz) output

Dubbing v1
Dubbing
Price per minute
- Automatic speaker detection
- 29 languages supported
- MP3, MP4, WAV, and MOV formats
Prices exclude all taxes, levies and duties.
Calculate your costs based on your usage needs
Pay as you go
Simple pricing based on usage, tailored to each of our flagship models.
- No commitment, cancel anytime
- Access to all ElevenLabs products and models
- Pay only for what you use
- Scales with your usage
Startup Grants Program
Build intelligent, real-time conversational AI agents into your new product or startup for free with an ElevenLabs Grant.
- 12 months free
- 33,000,000 characters
- High concurrency limits and improved support
Enterprise Plan
Custom solution for large organizations.
- Custom terms & assurance around DPA/SLAs
- Custom SSO & Priority support
- BAAs for HIPAA customers, and more
Model pricing
Text to Speech API
Generate speech from text with our high-quality models
Price per 1K characters
Characters included
$0.05
20,000
$0.05
120,000
$0.05
440,000
$0.05
1,980,000
$0.05
5,980,000
$0.05
19,800,000
Price per 1K characters
Characters included
$0.10
10,000
$0.10
60,000
$0.10
220,000
$0.10
990,000
$0.10
2,990,000
$0.10
9,900,000
Speech to Text API
Transcribe audio realtime or in bulk
Price per hour
Entity detection (per hour)
Keyterm prompting (per hour)
Hours included
$0.22
$0.070
$0.050
4 hours 30 minutes
$0.22
$0.070
$0.050
27 hours
$0.22
$0.070
$0.050
100 hours
$0.22
$0.070
$0.050
450 hours
$0.22
$0.070
$0.050
1359 hours
$0.22
$0.070
$0.050
4500 hours
Price per hour
Hours included
$0.39
2 hours 30 minutes
$0.39
15 hours
$0.39
56 hours
$0.39
254 hours
$0.39
767 hours
$0.39
2538 hours
Music
Turn text prompts into music
Price per minute
Minutes included
Cost per Finetune
$0.300
3 min
$1.50
$0.300
16 min
$1.50
$0.300
62 min
$1.50
$0.300
304 min
$1.50
$0.300
1,100 min
$1.50
$0.300
4,800 min
$1.50
Voice Isolator
Remove background noise from audio
Price per minute
Minutes included
$0.120
8.3 min
$0.120
50 min
$0.120
183 min
$0.120
825 min
$0.120
2,492 min
$0.120
8,250 min
Voice Changer
Transform voice characteristics
Price per minute
Minutes included
$0.120
8.3 min
$0.120
50 min
$0.120
183 min
$0.120
825 min
$0.120
2,492 min
$0.120
8,250 min
Sound Effects
Generate sound effects from text descriptions
Price per minute
Number of included generations
$0.120
8
$0.120
150
$0.120
605
$0.120
3,000
$0.120
9,000
$0.120
30,000
Dubbing v1
Automatically dub audio and video content
Price per minute (with watermark)
Included minutes (with watermark)
Price per minute (without watermark)
Included minutes (no watermark)
$0.33
2.53 min
$0.50
$0.33
18 min
$0.50
12 min
$0.33
67 min
$0.50
44 min
$0.33
300 min
$0.50
198 min
$0.33
906 min
$0.50
598 min
$0.33
3,000 min
$0.50
1,980 min