Convert Text to Voice

POST
/v1/text-to-voice/create-previews

Generate voices from a single text prompt.

Query parameters

output_formatenumOptionalDefaults to mp3_44100_192

Output format of the generated audio. Must be one of: mp3_22050_32 - output format, mp3 with 22.05kHz sample rate at 32kbps. mp3_44100_32 - output format, mp3 with 44.1kHz sample rate at 32kbps. mp3_44100_64 - output format, mp3 with 44.1kHz sample rate at 64kbps. mp3_44100_96 - output format, mp3 with 44.1kHz sample rate at 96kbps. mp3_44100_128 - default output format, mp3 with 44.1kHz sample rate at 128kbps. mp3_44100_192 - output format, mp3 with 44.1kHz sample rate at 192kbps. Requires you to be subscribed to Creator tier or above. pcm_16000 - PCM format (S16LE) with 16kHz sample rate. pcm_22050 - PCM format (S16LE) with 22.05kHz sample rate. pcm_24000 - PCM format (S16LE) with 24kHz sample rate. pcm_44100 - PCM format (S16LE) with 44.1kHz sample rate. Requires you to be subscribed to Pro tier or above. ulaw_8000 - μ-law format (sometimes written mu-law, often approximated as u-law) with 8kHz sample rate. Note that this format is commonly used for Twilio audio inputs.

Request

This endpoint expects an object.
voice_descriptionstringRequired>=20 characters<=1000 characters

Description to use for the created voice.

textstringRequired>=100 characters<=1000 characters

Text to generate, text length has to be between 100 and 1000.

auto_generate_textbooleanOptional

Whether to automatically generate a text suitable for the voice description.

Response

Successful Response

previewslist of objects
textstring