Create Forced Alignment
Force align an audio file to text. Use this endpoint to get the timing information for each character and word in an audio file based on a provided text transcript.
Headers
xi-api-key
Request
This endpoint expects a multipart form containing a file.
file
The file to align. All major audio formats are supported. The file size must be less than 1GB.
text
The text to align with the audio. The input text can be in any format, however diarization is not supported at this time.
enabled_spooled_file
If true, the file will be streamed to the server and processed in chunks. This is useful for large files that cannot be loaded into memory. The default is false.
Response
Successful Response
characters
List of characters with their timing information.
words
List of words with their timing information.