
Eleven v3 Audio Tags: Precision delivery control for AI speech
Fine-grained control over timing, rhythm, and emphasis with Eleven v3 Audio Tags. Transform flat delivery into dynamic, performative content.
Introducing Eleven v3 (alpha)
Try v3Proving the power of speaking to viewers in their own language
In 2023, YouTube introduced its Multi-Language Audio (MLA), letting creators add multiple language tracks that adjust automatically to viewers’ preferences. By September 2024, “hundreds of thousands” of creators gained access, allowing their content to reach more viewers.
In June 2024, we worked with Drew Binsky to explore this feature. Using our Dubbing Studio, Drew released videos in Spanish, Portuguese, Arabic, German, and Italian. The results were clear: dubbed videos drew in up to 1 million extra views from non-English-speaking regions for mature videos (17% of total views) and already make up a significant share of views for newer videos with 300k views (23% of total views) of a total 1.3M views to date.
Dubbed videos show an incremental spike in viewership starting at the end of June. During June and July, videos released with AI dub reached on average 2.6M views per video, while non-dubbed content averaged 1.1M views. YouTube’s MLA rollout was timely, offering creators the chance to upload multiple audio tracks. With our dubbing tool, Drew already had his content localized, making it a perfect fit for this new feature.
For Drew, this let him stay true to his style while reaching audiences worldwide. For us, it reinforced a simple truth: meeting viewers in their own language can turn good stories into global ones.
Dubbing Studio lets you smoothly localize content across 29 languages while keeping the unique tone and style of each speaker. Whether you’re uploading from YouTube, TikTok, or any other platform, our tool maintains your voice and delivery style across languages, allowing you to reach global audiences without losing the essence of your style. For more control, you can manually edit translations, adjust voice settings, and fine-tune audio to make sure words match the on-screen action.
Our work with Drew reinforced some key takeaways for making AI dubbing work.
If you’re interested in learning more about Dubbing Studio, visit the page, and read our guide on how to make the most of it.
Fine-grained control over timing, rhythm, and emphasis with Eleven v3 Audio Tags. Transform flat delivery into dynamic, performative content.
Create dynamic multi-character dialogue with Eleven v3 Audio Tags. Script overlapping voices, interruptions, and emotional shifts for natural, human-like AI conversations.