Auto-regenerate is live in Projects
Our long form text editor now lets you regenerate faulty fragments, adjust playback speed, and provide quality feedback
In November we announced our new, fastest model that generates speech at ≈400ms latency (+ network latency) and is over twice as fast as our V1 models.
Unfortunately users found that it struggled to pronounce long numbers. Give a listen to this generation of "The current stock price for NVIDIA is $867.49.":
Today we just released improved numbers pronunciation for our Turbo v2 model. Here's pronunciation after the change:
Thank you to all of the users who submitted feedback that inspired this fix - and please continue to share areas where our models can be improved.
Our long form text editor now lets you regenerate faulty fragments, adjust playback speed, and provide quality feedback
Developers brought ideas to life using AI, from real time voice commands to custom storytelling