Stitching multiple requests
This guide shows you how to maintain voice prosody across multiple text chunks/generations.
This guide shows you how to maintain voice prosody across multiple text chunks/generations.
How-to guide ยท Assumes you have completed the ElevenAPI quickstart.
When converting a large body of text into audio, you may encounter abrupt changes in prosody from one chunk to another. This can be particularly noticeable when converting text that spans multiple paragraphs or sections. In order to maintain voice prosody over multiple chunks, you can use the Request Stitching feature.
This feature allows you to provide context on what has already been generated and what will be generated in the future, helping to maintain a consistent voice and prosody throughout the entire text.
eleven_v3 model.Hereโs an example without Request Stitching:
And the same example with Request Stitching:
Request Stitching is easiest when using the ElevenLabs SDKs.
This guide assumes you have set up your API key and SDK. Complete the quickstart first if you havenโt.
In order to use the request IDs of a previous request for conditioning it needs to have processed completely. In case of streaming this means the audio has to be read completely from the response body.
The difference depends on the model, voice and voice settings used.
The request IDs should be no older than two hours.
Yes, unless you are an enterprise user with increased privacy requirements.