
Introducing Eleven v3 (alpha) — the most expressive Text to Speech model
Eleven v3 is the most expressive Text to Speech model
Discover Voice Design v3: create unique AI voices with ease. Describe your desired voice, get three instant options, and deploy for creators, businesses, and developers.
We're excited to announce Voice Design v3. The new release makes creating voices quicker, easier, and more intuitive than ever before.
Voice Design v3 is ideal for creators, businesses, and developers who need specific voices for their project. All you have to do is describe your voice, instantly get three options, choose your favorite, and start using it right away.
Give it a try and see how easy voice creation can be.
When we launched Voice Design, the goal was simple: let any creator generate a bespoke voice — no studio booking, no audio library deep dives. Version 3 takes that further, offering more control, faster iteration, and a smoother path from concept to final audio.
Projects often require dozens of voices: a calm narrator, an anxious side character, a few game NPCs, maybe even a talking raccoon. Searching for “something close enough” slows teams down.
Voice Design v3 removes that friction. Type a description, generate three candidates, choose one, and move on — all while paying only for the characters in your prompt, not per sample.
Hit Generate and v3 returns three distinct voices. Keep the one you like—it fills a voice slot—and discard the rest. No queue. No extra cost.
No dropdown forests. No hidden levers. Just results.
Below you’ll find the attributes our research team sees most often in top‑quality results:
Attribute | Why it matters | Example keywords |
---|---|---|
Age | Sets vocal texture and pitch | child, teen, middle-aged, elderly |
Accent/nationality | Grounds the character in place | thick Australian, light French, neutral American |
Gender | Guides resonance | male, female, gender-neutral |
Tone & emotion | Drives delivery | warm, assertive, anxious, joyful |
Speed | Controls pacing without editing | fast, measured, languid |
Guidance scale | Balances creativity vs. prompt fidelity | “guidance scale 10” (try 8–12 for accuracy, 3–5 for exploration) |
For a full matrix, see the Prompting guide in docs.
The best prompts read like everyday speech — short, specific, and jargon-free. That principle echoes our own writing style: if a word can be cut, cut it.
Voice Design v3 is live in the ElevenLabs dashboard: Voices → My Voices → Add a new voice → Voice Design
Log in, type a prompt, and click Generate. Soon, you’ll have three new voices — only one of which existed in your head.
See you in the studio.
Eleven v3 is the most expressive Text to Speech model
Create dynamic multi-character dialogue with Eleven v3 Audio Tags. Script overlapping voices, interruptions, and emotional shifts for natural, human-like AI conversations.