For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Connect
BlogHelp CenterAPI PricingSign up
OverviewElevenCreativeElevenAgentsElevenAPIReception AIAPI referenceChangelog
OverviewElevenCreativeElevenAgentsElevenAPIReception AIAPI referenceChangelog
  • Get started
    • Quickstart
    • Agents Quickstart
    • Choosing the right model
  • Tutorials
    • Text to Speech
    • Speech to Text
    • Speech Engine
    • Music
    • Text to Dialogue
    • Voice Changer
    • Voice Isolator
    • Dubbing
    • Sound effects
    • Forced Alignment
  • Concepts
    • Understanding audio streaming
    • Understanding latency
    • Voice cloning
  • How-to guides
  • Reference
    • Libraries & SDKs
    • Errors
    • Agent tooling
    • Webhooks
    • Zero Retention Mode
    • Breaking changes policy
    • UI components
    • Example projects
    • Next.js template
    • Showcase
  • Private deployment
    • Overview
LogoLogo
Login
Login
Connect
BlogHelp CenterAPI PricingSign up
On this page
  • Using the Forced Alignment API
  • Next steps
Tutorials

Forced Alignment quickstart

Learn how to use the Forced Alignment API to align text to audio.
Was this page helpful?
Previous

Understanding audio streaming

Why streaming audio generation is different from streaming files, and what that means for your application.
Next
Built with

This guide will show you how to use the Forced Alignment API to align text to audio.

Using the Forced Alignment API

1

Create an API key

Create an API key in the dashboard here, which you’ll use to securely access the API.

Store the key as a managed secret and pass it to the SDKs either as a environment variable via an .env file, or directly in your app’s configuration depending on your preference.

.env
1ELEVENLABS_API_KEY=<your_api_key_here>
2

Install the SDK

We’ll also use the dotenv library to load our API key from an environment variable.

1pip install elevenlabs
2pip install python-dotenv
3

Make the API request

Create a new file named example.py or example.mts, depending on your language of choice and add the following code:

1# example.py
2import os
3from io import BytesIO
4from elevenlabs.client import ElevenLabs
5import requests
6from dotenv import load_dotenv
7
8load_dotenv()
9
10elevenlabs = ElevenLabs(
11 api_key=os.getenv("ELEVENLABS_API_KEY"),
12)
13
14audio_url = (
15 "https://storage.googleapis.com/eleven-public-cdn/audio/marketing/nicole.mp3"
16)
17response = requests.get(audio_url)
18audio_data = BytesIO(response.content)
19
20# Perform the text-to-speech conversion
21transcription = elevenlabs.forced_alignment.create(
22 file=audio_data,
23 text="With a soft and whispery American accent, I'm the ideal choice for creating ASMR content, meditative guides, or adding an intimate feel to your narrative projects."
24)
25
26print(transcription)
4

Execute the code

1python example.py

You should see the transcript of the audio file with exact timestamps printed to the console.

Next steps

Speech to Text

Transcribe audio to text without requiring an existing transcript

Text to Speech

Generate the audio from text to use with forced alignment

API reference

Explore all Forced Alignment parameters and response formats