Overview
Elevate your projects with this guide into the Voiceover Studio.
Similar to the Dubbing Studio, the new Voiceover Studio gives users an opportunity to create their own interactive content, but with a little more freedom. Voiceover Studio combines the audio timeline with our Sound Effects feature, giving you the ability to write a dialogue between any number of speakers, choose those speakers, and intertwine your own creative sound effects anywhere you like.
Creating a Voiceover
To begin, click “Create a new voiceover”. Here you have the option to upload a video, audio or create your Voiceover from scratch. After that, it’s as simple as pressing “Create voiceover” - you can name your Voiceover before or after it’s created. Once in the Studio, you will notice it looks very similar to a Dubbing Studio project - and it is - with some notable additions. Let’s briefly revisit the layout -
Timeline
On the bottom half of your screen, you will see the audio timeline. This is a linear representation of your Voiceover project. Each row represents a track, and on the far left section you have the track information for voiceover or SFX tracks. In the middle, you can create the clips that represent when a voice is speaking or a SFX is playing. On the right-hand side, you have the settings for the currently selected clip.
Speaker Cards
In Dubbing Studio, the AI creates the Speaker Cards automatically - in Voiceover Studio, you get to create these on your own! Because of this, your Voiceover Project screen will begin blank after creation, and you will need to first add Tracks and Clips.
Adding Tracks
There are three types of tracks you can add in the studio: Voiceover tracks, SFX tracks and uploaded audio.
-
Voiceover Tracks: Voiceover tracks create new Speakers. You can click and add clips on the timeline wherever you like. After creating a clip, start writing your desired text on the speaker cards above and click “Generate”. Similar to Dubbing Studio, you will also see a little cogwheel on each Speaker track - simply click on it to adjust the voice settings or replace any speaker with a voice directly from your VoiceLab - including your own Professional Voice Clone if you have created one.
-
SFX Tracks: Add a SFX track, then click anywhere on that track to create a SFX clip. Similar to our independent SFX feature, simply start writing your prompt in the Speaker card above and click “Generate” to create your new SFX audio. You can lengthen or shorten SFX clips and move them freely around your timeline to fit your project - make sure to press the “stale” button if you do so.
-
Uploaded Audio: Add an audio track including background music or sound effects. It’s best to avoid uploading audio with speakers, as any speakers in this track will not be detected, so you won’t be able to translate or correct them.
Track Features
Once you’ve created a new Voiceover Track, you will see on the left-hand side of each track that you have a few options. You can also click directly on “New Voiceover Speaker” to rename it to keep yourself more organized.
Click the cog to open the Track Voice Settings. This is where you can change the voice and model used for this Voiceover Track, and adjust the voice settings. If you make changes here before generating audio for the track, the audio will generate with the settings you choose. If you change settings after audio has aleady been generated for the track, this audio will be labelled “Stale”, and you will need to regenerate it, either by clicking the regenerate icon to generate a specific clip, or “Generate Stale Audio” to regenerate all the stale audio in your Voiceover project.
By clicking the small Headphones icon on either a Speaker or SFX track, you can “solo” that track which will mute all other tracks on playback. If you want to delete a track, simply click the three small dots next to the Headphones icon on the track.
Key Differences from Dubbing Studio
If you chose not to upload a video when you created your Voiceover project, then the entire timeline is yours to work with and there are no time constraints. This differs from Dubbing Studio as it gives you a lot more freedom to create what you want and adjust the timing more easily.
When you Add a Voiceover Track, you will instantly be able to create clips on your timeline. Once you create a Voiceover clip, begin by writing in the Speaker Card above. After generating that audio, you will notice your clip on the timeline will automatically adjust its length based on the text prompt - this is called “Dynamic Generation”. This option is also available in Dubbing Studio by right-clicking specific clips, but because syncing is more important with dubbed videos, the default generation type there is “Fixed Generation,” meaning the clips’ lengths are not affected.
Credit Costs
Voiceover Studio does not deduct credits to create your initial project. Credits are deducted every time material is generated. Similar to Speech-Synthesis, credit costs for Voiceover Clips are based on the length of the text prompt. SFX clips will deduct 80 credits per generation.
If you choose to Dub (translate) your Voiceover Project into different languages, this will also cost additional credits depending on how much material needs to be generated. The cost is 1 credit per character for the translation, plus the cost of generating the new audio for the additional languages.
Translating and Exporting
Similar to Dubbing Studio, after you’ve finished creating your Tracks and Clips and you’ve arranged them on the Timeline, you can click the “plus” icon on the bottom of the page to Dub your Voiceover into different languages. Click to add the desired language(s), and then make sure to generate by pressing “Generate Stale Audio” on the bottom right.
To export your Voiceover Project, simply click “Export” in the bottom right and choose your desired file type. Once the file has been generated, it will be available for download.
Uploading Scripts
With Voiceover Studio, you have the option to upload a script for your project as a CSV file. You can either include speaker name and line, or speaker name, line, start time and end time.
Sample format, speaker and line
Sample format, speaker, line, start time and end time.
Once your script has imported, make sure to assign voices to each speaker before you generate the audio. To do this, click the cog icon in the information for each track, on the left.
If you don’t specify start and end times for your clips, Voiceover Studio will estimate how long each clip will be, and distribute them along your timeline.
Dynamic Duration
By default, Voiceover Studio uses Dynamic Duration, which means that the length of the clip will vary depending on the text input and the voice used. This ensures that the audio sounds as natural as possible, but it means that the length of the clip might change after the audio has been generated. You can easily reposition your clips along the timeline once they have been generated to get a natural sounding flow. If you click “Generate Stale Audio”, or use the generate button on the clip, the audio will be generated using Dynamic Duration.
This also applies if you do specify the start and end time for your clips. The clips will generate based on the start time you specify, but if you use the default Dynamic Duration, the end time is likely to change once you generate the audio.
Fixed Duration
If you need the clip to remain the length specified, you can choose to generate with Fixed Duration instead. To do this, you need to right click on the clip and select “Generate Audio Fixed Duration”. This will adjust the length of the generated audio to fit the specified length of the clip. This could lead to the audio sounding unnaturally quick or slow, depending on the length of your clip.
If you want to generate multiple clips at once, you can use shift + click to select multiple clips for a speaker at once, then right click on one of them to select “Generate Audio Fixed Duration” for all selected clips.