Using pronunciation dictionaries
Overview
Pronunciation dictionaries allow you to customize how your AI agent pronounces specific words or phrases. This is particularly useful for:
- Correcting pronunciation of names, places, or technical terms
- Ensuring consistent pronunciation across conversations
- Customizing regional pronunciation variations
ElevenLabs supports both IPA and CMU alphabets.
Phoneme tags only work with eleven_flash_v2, eleven_turbo_v2 & eleven_monolingual_v1 models.
If you use phoneme tags with other models, they will silently skip the word.
Phoneme tags (IPA/CMU) only work for English. For other languages, use Alias tags instead, which replace words with alternative spellings or phrases that produce the desired pronunciation.
Quickstart
Create an API key
Create an API key in the dashboard here, which you’ll use to securely access the API.
Store the key as a managed secret and pass it to the SDKs either as a environment variable via an .env file, or directly in your app’s configuration depending on your preference.
Create a pronunciation dictionary file
In this example, we will create a pronunciation dictionary file for the word tomato.
This rule will use the “IPA” alphabet and update the pronunciation for tomato and Tomato with a different pronunciation. PLS files are case sensitive which is why we include it both with and without a capital “T”.
You can use AI tools like Claude or ChatGPT to help generate IPA or CMU notations for specific words.
Next steps
To learn more about pronunciation dictionaries, please refer to the API reference.