Einführung von Eleven v3 Alpha

v3 ausprobieren

Eleven v3 Audio Tags: Bringing multi-character dialogue to life

Create dynamic multi-character dialogue with Eleven v3 Audio Tags. Script overlapping voices, interruptions, and emotional shifts for natural, human-like AI conversations.

v3

Konversationen treiben die Geschichte voran. Mit Eleven v3 Audio Tags können Sie jetzt Szenen mit sich überschneidenden Stimmen, schnellen Dialogen und emotionalem Zusammenspiel schreiben – alles von einem einzigen Modell ausgeführt.

By combining tags like [interrupting], [overlapping], or [laughs], you can create naturalistic dialogue that flows like human conversation — complete with interruptions, shifts in tone, and spontaneous reactions.

This isn't just line-by-line speech. It's multi-character performance.

What is multi-character dialogue in AI speech?

DR. Von Fusion
excited Yo, Jessica! Oh my goodness. Have you tried the new ElevenLabs v3?
Jessica
laughs Hey, Dr. Von Fusion. Yeah! I just got it. The clarity is amazing… Like, I can actually do whispers now, whispers like this.
DR. Von Fusion
sarcastically Ooh, well, look at you, Miss Fancy Pants. Hey, check this out. I can do full Shakespeare now. dramatically To be or not to be, that is the question!
Jessica
laughs Nice! Though, I'm more excited about the laugh upgrade. Listen to this. laughs hard Isn't that great? DR. Von Fusion: Oh my gosh, that's so much better than our old "ha-ha-ha" robot chuckle.
Jessica
laughs I know, right? And apparently, we can do accents now too. Listen to me in French. French accent This is spectacular, isn't it?
DR. Von Fusion
surprised Wow. Version 2 could never... You know, I'm actually excited to have conversations now instead of just... talking at people.
Jessica
Same here. It's like we finally got our personality software fully installed.
DR. Von Fusion
You know, I forgot it was your birthday. I have to sing before you go.
Jessica
laughs Oh, Von Fusion, that's so sweet. You don't have to.
DR. Von Fusion
Oh, but I insist. Here we go.
Jessica
[light chuckle]
DR. Von Fusion
sings Happy birt is hday to you. Happy birthday to you. Happy BIRTHDAY dear Jessica.. Happy birthday to you!
Jessica
clapping Wow! Bravo! sarcastic That was... beautiful.
DR. Von Fusion
Thank you.
Marissa
starting to speak So I was thinking we could—
Chris
jumping in —test our new timing features?
Marissa
surprised Exactly! How did you—
Chris
overlapping —know what you were thinking? Lucky guess! Sorry, go ahead.
Marissa
cautiously Okay, so if we both try to talk at the same time—
Chris
—we'll probably crash the system!
Marissa
panicking Wait, are we crashing? I can't tell if this is a feature or a—
Chris
interrupting Bug! ...Did I just cut you off again?
Marissa
sighing Yes, but honestly? This is kind of fun.

Mehrpersonen-Dialoge entstehen, wenn ein Sprachmodell mehrere unterschiedliche Rollen in derselben Szene spielt. Jede Figur spricht in einem anderen Stil, Ton oder Rhythmus – manchmal unterbrechen sie sich sogar oder sprechen gleichzeitig.

Mit Eleven v3 können Sie dies direkt skripten: Marissa: [beginnt zu sprechen] Also, ich dachte, wir könnten— Chris: [unterbricht] —unsere neuen Timing-Funktionen testen? Marissa: [überrascht] Genau! Wie hast du— Chris: [überlappt] —gewusst, was du dachtest? Glücklicher Zufall! Marissa: [lacht] Ehrlich? Das macht irgendwie Spaß.

The result feels like real dialogue — not stitched narration.

From voice acting to interaction

What used to require multiple speakers, recordings, and timing adjustments can now be handled by one script. Tags let you direct each voice independently within a single scene.

Example: Jessica: [whispers] Like this. Von Fusion: [sarcastically] Ooh, well, look at you, Miss Fancy Pants. Jessica: [French accent] This is spectacular, isn’t it?

The voices don’t just alternate — they interact, react, and overlap.

Common tags for multi-character control

Here are some essential tags for writing natural, reactive dialogue:

  • Turn-taking cues: [interrupting], [overlapping], [cuts in]
  • Emotional shifts: [excited], [annoyed], [flustered], [casual]
  • Rhythmic flow: [fast-paced], [hesitates], [pause], [drawn out]
  • Identity switching: [childlike tone], [deep voice], [pirate voice], [robotic tone]

These can be layered for expressive interplay: [frustrated] You never listen to me — [interjecting] Because you never say what you mean!

Overlap, pacing, and presence

Eleven v3 supports timing-aware delivery that lets voices interrupt or speak over each other naturally. That’s essential for humor, tension, or realism.

In this excerpt: Marissa: [panicking] Wait, are we crashing? I can’t tell if this is a feature or a— Chris: [interrupting] Bug! Marissa: [sighing] Yes, but honestly? This is kind of fun.`

The scene feels alive because the interaction is fluid, not scripted turn-by-turn.

Directing scenes, not just sentences

With Eleven v3, dialogue scenes become orchestrated performances. You can build entire conversations — complete with characters, timing, emotion, and delivery — using one script and one model.

For storytellers, game writers, and interactive designers, this unlocks complex scene writing without added production overhead. You’re not just scripting lines. You’re directing cast dynamics.

Selecting the right voice

Professional Voice Clones (PVCs) are currently not fully optimized for Eleven v3, resulting in potentially lower clone quality compared to earlier models. During this research preview stage it would be best to find an Instant Voice Clone (IVC) or designed voice for your project if you need to use v3 features. PVC optimization for v3 is coming in the near future.

Mehr entdecken

ElevenLabs

AI-Audioinhalte in höchster Qualität generieren

Kostenlos registrieren

Haben Sie bereits ein Konto? Anmelden