Voice design
Create a voice from a text prompt.
Headers
xi-api-key
Query parameters
output_format
The output format of the generated audio.
Request
This endpoint expects an object.
voice_description
Description to use for the created voice.
text
Text to generate, text length has to be between 100 and 1000.
auto_generate_text
Whether to automatically generate a text suitable for the voice description.
loudness
Controls the volume level of the generated voice. -1 is quietest, 1 is loudest, 0 corresponds to roughly -24 LUFS.
quality
Higher quality results in better voice output but less variety.
seed
Random number that controls the voice generation. Same seed with same inputs produces same voice.
guidance_scale
Controls how closely the AI follows the prompt. Lower numbers give the AI more freedom to be creative, while higher numbers force it to stick more to the prompt. High numbers can cause voice to sound artificial or robotic. We recommend to use longer, more detailed prompts at lower Guidance Scale.
Response
Successful Response
previews
The previews of the generated voices.
text
The text used to preview the voices.