Using pronunciation dictionaries
How-to guide · Assumes you have completed the ElevenAPI quickstart.
Overview
Pronunciation dictionaries allow you to customize how your AI agent pronounces specific words or phrases. This is particularly useful for:
- Correcting pronunciation of names, places, or technical terms
- Ensuring consistent pronunciation across conversations
- Customizing regional pronunciation variations
ElevenLabs supports both IPA and CMU alphabets.
Phoneme tags only work with eleven_flash_v2 & eleven_monolingual_v1 models. If you use phoneme
tags with other models, they will silently skip the word.
Phoneme tags (IPA/CMU) only work for English. For other languages, use Alias tags instead, which replace words with alternative spellings or phrases that produce the desired pronunciation.
Quickstart
This guide assumes you have set up your API key and SDK. Complete the quickstart first if you haven’t.
Create a pronunciation dictionary file
In this example, we will create a pronunciation dictionary file for the word tomato.
This rule will use the “IPA” alphabet and update the pronunciation for tomato and Tomato with a different pronunciation. PLS files are case sensitive which is why we include it both with and without a capital “T”.
You can use AI tools like Claude or ChatGPT to help generate IPA or CMU notations for specific words.