
Einführung von Eleven v3 (alpha) — das ausdrucksstärkste Text to Speech Modell
Eleven v3 ist das ausdrucksstärkste Text to Speech Modell
Discover Voice Design v3: create unique AI voices with ease. Describe your desired voice, get three instant options, and deploy for creators, businesses, and developers.
We're excited to announce Voice Design v3. The new release makes creating voices quicker, easier, and more intuitive than ever before.
Voice Design v3 is ideal for creators, businesses, and developers who need specific voices for their project. All you have to do is describe your voice, instantly get three options, choose your favorite, and start using it right away.
Give it a try and see how easy voice creation can be.
When we launched Voice Design, the goal was simple: let any creator generate a bespoke voice — no studio booking, no audio library deep dives. Version 3 takes that further, offering more control, faster iteration, and a smoother path from concept to final audio.
Projects often require dozens of voices: a calm narrator, an anxious side character, a few game NPCs, maybe even a talking raccoon. Searching for “something close enough” slows teams down.
Voice Design v3 removes that friction. Type a description, generate three candidates, choose one, and move on — all while paying only for the characters in your prompt, not per sample.
Hit Generate and v3 returns three distinct voices. Keep the one you like—it fills a voice slot—and discard the rest. No queue. No extra cost.
No dropdown forests. No hidden levers. Just results.
Below you’ll find the attributes our research team sees most often in top‑quality results:
Attribute | Why it matters | Example keywords |
---|---|---|
Age | Sets vocal texture and pitch | child, teen, middle-aged, elderly |
Accent/nationality | Grounds the character in place | thick Australian, light French, neutral American |
Gender | Guides resonance | male, female, gender-neutral |
Tone & emotion | Drives delivery | warm, assertive, anxious, joyful |
Speed | Controls pacing without editing | fast, measured, languid |
Guidance scale | Balances creativity vs. prompt fidelity | “guidance scale 10” (try 8–12 for accuracy, 3–5 for exploration) |
For a full matrix, see the Prompting guide in docs.
Die besten Eingaben lesen sich wie Alltagssprache – kurz, spezifisch und ohne Fachjargon. Dieses Prinzip spiegelt unseren eigenen Schreibstil wider: Wenn ein Wort gestrichen werden kann, streichen Sie es.
Voice Design v3 ist im ElevenLabs-Dashboard verfügbar:Voices → My Voices → Add a new voice → Voice Design
Einloggen, geben Sie eine Eingabe ein und klicken Sie auf Generieren. Bald haben Sie drei neue Stimmen – von denen nur eine in Ihrem Kopf existierte.
Wir sehen uns im Studio.
Eleven v3 ist das ausdrucksstärkste Text to Speech Modell
Create dynamic multi-character dialogue with Eleven v3 Audio Tags. Script overlapping voices, interruptions, and emotional shifts for natural, human-like AI conversations.