Create Forced Alignment

Force align an audio file to text. Use this endpoint to get the timing information for each character and word in an audio file based on a provided text transcript.

Headers

xi-api-keystringRequired

Request

This endpoint expects a multipart form containing a file.
filefileRequired

The file to align. All major audio formats are supported. The file size must be less than 1GB.

textstringRequired

The text to align with the audio. The input text can be in any format, however diarization is not supported at this time.

enabled_spooled_filebooleanOptionalDefaults to false

If true, the file will be streamed to the server and processed in chunks. This is useful for large files that cannot be loaded into memory. The default is false.

Response

Successful Response

characterslist of objects

List of characters with their timing information.

wordslist of objects

List of words with their timing information.

Errors