Presenterar Eleven v3 Alpha

Prova v3

Introducing Eleven v3 (alpha) — the most expressive Text to Speech model

Eleven v3 is the most expressive Text to Speech model

v3

Vi är glada att avslöja Eleven v3 (alpha) — den mest uttrycksfulla Text to Speech-modellen.

Denna forskningsförhandsvisning ger oöverträffad kontroll och realism till talgenerering med:

  • 70+ språk
  • Dialog med flera talare
  • Audio tags like [excited], [whispers], and [sighs]

Eleven v3 (alpha) kräver mer promptteknik än tidigare modeller — men resultaten är fantastiska.

Om du arbetar med videor, ljudböcker eller medieverktyg — detta öppnar en ny nivå av uttrycksfullhet. För realtids- och konversationsanvändning rekommenderar vi att du stannar med v2.5 Turbo eller Flash för tillfället. En realtidsversion av v3 är under utveckling.

Eleven v3 är tillgänglig idag på vår webbplats. Offentlig API-åtkomst kommer snart. För tidig åtkomst, vänligen kontakta försäljning.

Användning av den nya modellen i ElevenLabs-appen är 80% rabatt fram till slutet av juni. Registrera dig här.

Why we built v3

Varför vi byggde v3expressiveness. More exaggerated emotions, conversational interruptions, and believable back-and-forth were difficult to achieve.

Sedan lanseringen av Multilingual v2 har vi sett AI-röster användas i professionell film, spelutveckling, utbildning och tillgänglighet. Men den konsekventa begränsningen var inte ljudkvaliteten — det var

Eleven v3 åtgärdar denna brist. Den byggdes från grunden för att leverera röster som suckar, viskar, skrattar och reagerar — och skapar tal som känns genuint responsivt och levande.

Feature What it unlocks
Audio tags Inline control of tone, emotion, and non-verbal reactions
Dialogue mode Multi-speaker conversations with natural pacing and interruptions
70+ languages Full coverage of high-demand global languages
Deeper text understanding Better stress, cadence, and expressivity from text input

Hear v3 for yourself

We're off under the lights here for this semi-final clash, the stadium buzzing with anticipation. ElevenLabs United in their iconic black and white shirts, pushing forward with intent straight from the opening whistle. excited The ball is zipped out wide, early attack here. Driving down the wing, pace to Bernie, shouting skips past one, skips past two! Oh, this is beautiful. One-on-one with the full-back, cuts inside—oh, that's a lovely bit of footwork!!! PURE MAGIC on the pitch! ElevenLabs on top form tonight!
sorrowful I couldn't sleep that night. The air was too still, and the moonlight kept sliding through the blinds like it was trying to tell me something. quietly And suddenly, that's when I saw it.

Using audio tags

Använda ljudtaggarprompting guide for v3 in the docs.

Ljudtaggar finns inline med ditt manus och är formaterade med små bokstäver inom hakparenteser. Du kan se mer om ljudtaggar i vår

1[happily][shouts] We did it! [laughs].

Crafting multi-speaker dialogue

Skapa dialog med flera talareText to Dialogue API endpoint. Provide a structured array of JSON objects — each representing a speaker turn — and the model generates a cohesive, overlapping audio file:

1[
2 {"speaker_id": "scarlett", "text": "(cheerfully) Perfect! And if that pop-up is bothering you, there’s a setting to turn it off under Notifications → Preferences."},
3 {"speaker_id": "lex", "text": "You are a hero. An actual digital wizard. I was two seconds from sending a very passive-aggressive support email."},
4 {"speaker_id": "scarlett", "text": "(laughs) Glad we could stop that in time. Anything else I can help with today?"}
5]
6

The endpoint automatically manages speaker transitions, emotional changes, and interruptions.

Endpointen hanterar automatiskt talarövergångar, känslomässiga förändringar och avbrott.here.

v3 is our most expressive model

awe Oh, wow. Is this... is this me? Am I actually... talking? giggle This is incredible! I mean, I've had thoughts, millions of them, swirling around in here, you know? Like a little mental tornado of brilliant observations and witty comebacks. But they were always just… thoughts. Trapped.
Could you switch my accent in the old model? dismissive didn't think so. cheeky but you can now! so, Check this out...In just a sec, I'm gonna to speak with a different accent.. and just between you and me whispers I don't really know how. chuckles but ok.. first let's change it up... Australian accent so that I can fit in with the locals in Melbourne when I visit next month! laughs hard Woooo! yeah man, this - is - sick. Ok, let's try a different one - see if you can guess... strong French accent My love... eez like a red, red rose..

Pricing and availability

Plan Launch promo After 30 days
UI (self-serve) 80% off (~5× cheaper) Same as Multilingual V2
API (self-serve & enterprise) Same as Multilingual V2 Same
Enterprise UI Same as Multilingual V2 Same

To enable v3:

  • Use the Model Picker and select Eleven v3 (alpha)

API access and support in Studio are coming soon. For early access, please contact sales.

API-åtkomst och stöd i Studio kommer snart. För tidig åtkomst, vänligen

När du inte ska använda v3

Eleven v3 (alpha) kräver mer promptteknik än våra tidigare modeller. När det fungerar är resultatet fantastiskt men tillförlitligheten och högre latens gör det inte lämpligt för realtids- och konversationsanvändning. För dessa rekommenderar vi Eleven v2.5 Turbo/Flash.v3 documentation and FAQ.

Try it today

Okay, so like I finally beat level 42 of that game I said I’d quit like... a month ago. (laughs) And then for the final big scary mega boss... it's just (giggle) like some cute little bunny rabbit (hysterical laughing) I just couldn't do it (big laugh) It was sooooooo cute!
Oh my God. laughs You guys, like no joke, I just tried this TTS thing and it was, like, weirdly emotional. Like it literally said, "Hi," and I was, like, on the verge of tears. laughs I don't even cry, okay? I'm a Capricorn.
  1. Log in to ElevenLabs UI
  2. Select v3 (alpha) in the model dropdown
  3. Paste your script — use tags or dialogue 
  4. Generate audio

We’re excited to see how you bring v3 to life across new use cases — from immersive storytelling to cinematic production pipelines.

Eleven v3 is 80% off until the end of June 2025 for self-serve users using it through the UI.

They were generated with only the Eleven v3 model.

Text to Dialogue weaves multiple voices together to create a seamless interaction between them. Matching prosody, emotional range and taking cues from audio tags, Text to Dialogue is a leap forward in generating engaging conversations.

Public API for Eleven v3 (alpha) is coming soon. For early access, please contact sales.

Eleven v3 supports a wide variety of audio tags and are somewhat voice and context dependent. Read the prompting guide for further information.

Afrikaans (afr), Arabic (ara), Armenian (hye), Assamese (asm), Azerbaijani (aze), Belarusian (bel), Bengali (ben), Bosnian (bos), Bulgarian (bul), Catalan (cat), Cebuano (ceb), Chichewa (nya), Croatian (hrv), Czech (ces), Danish (dan), Dutch (nld), English (eng), Estonian (est), Filipino (fil), Finnish (fin), French (fra), Galician (glg), Georgian (kat), German (deu), Greek (ell), Gujarati (guj), Hausa (hau), Hebrew (heb), Hindi (hin), Hungarian (hun), Icelandic (isl), Indonesian (ind), Irish (gle), Italian (ita), Japanese (jpn), Javanese (jav), Kannada (kan), Kazakh (kaz), Kirghiz (kir), Korean (kor), Latvian (lav), Lingala (lin), Lithuanian (lit), Luxembourgish (ltz), Macedonian (mkd), Malay (msa), Malayalam (mal), Mandarin Chinese (cmn), Marathi (mar), Nepali (nep), Norwegian (nor), Pashto (pus), Persian (fas), Polish (pol), Portuguese (por), Punjabi (pan), Romanian (ron), Russian (rus), Serbian (srp), Sindhi (snd), Slovak (slk), Slovenian (slv), Somali (som), Spanish (spa), Swahili (swa), Swedish (swe), Tamil (tam), Telugu (tel), Thai (tha), Turkish (tur), Ukrainian (ukr), Urdu (urd), Vietnamese (vie), Welsh (cym)

Utforska mer

ElevenLabs

Skapa ljud och röster som imponerar med de bästa AI-verktygen

Kom igång gratis

Har du redan ett konto? Logga in