Voice remixing

Learn how to transform and enhance existing voices by modifying their attributes including gender, accent, style, pacing, audio quality, and more.

Overview

ElevenLabs voice remixing is available on the core platform and via API. This feature transforms existing voices by allowing you to modify their core attributes while maintaining the unique characteristics that make them recognizable. This is particularly useful for adapting voices to different contexts, creating variations for different characters, or improving and/or changing the audio quality of existing voice profiles.

As an example, here is an original voice:

And here is a remixed version, switching to a San Francisco accent:

Usage

The voice remixing model allows you to iteratively transform voices you own by adjusting multiple attributes through natural language prompts and customizable settings.

Key Features

  • Attribute Modification: Change gender, accent, speaking style, pacing, and audio quality of any voice you own
  • Iterative Editing: Continue refining voices based on previously remixed versions
  • Script Flexibility: Use default scripts or input custom scripts with v3 model audio tags like [laughs] or [whispers]
  • Prompt Strength Control: Adjust remix intensity from low to high for precise control over transformations

Remixing parameters

Prompt Strength

Voice remixing offers varying degrees of prompt strength to control how much your voice transforms:

  • Low: Subtle changes that maintain most of the original voice characteristics
  • Medium: Balanced transformation that modifies key attributes while preserving voice identity
  • High: Strong adherence to remix prompt, may significantly change the tonality of the original voice
  • Max: A full transformation of the voice, but at the cost of changing the voice entirely

Script Options

  • Default Scripts: Pre-configured scripts optimized for voice remixing
  • Custom Scripts: Input your own text with support for v3 model audio tags such as:
    • [laughs] - Add laughter
    • [whispers] - Convert to whispered speech
    • [sighs] - Add sighing
    • Additional emotion and style tags supported which can help craft the voice

Supported Voice Formats

Input

  • Any cloned voice that you personally own (Instant Voice Clone or Professional Voice Clone)
  • Voices created through our Voice Design product

Output

  • Full-quality voice model in v3 (but backwards compatibility to all other models)
  • Iteratively editable voice that can be further remixed

Key facts

  • Pricing: Based on the length of the test script used during the remixing process
  • Eligible voices: Only voices you own or have explicit permissions for can be remixed
  • Original voice: Remixing creates a new voice variant — the original voice is not modified
  • Iteration limit: No limit on iterative remixing; each remix can serve as the base for the next