Dubbing Studio
Fine-grained control over your dubs.
Fine-grained control over your dubs.


Speaker cards show the original transcription and translation (if you add one) of dialogue from the source video. You can click ‘Transcribe Audio’ to retranscribe the original speech, or click the arrow to re-translate an existing transcription.
Both transcriptions and translations can be edited freely - just click inside a speaker card and start typing to edit the text.
You can see the name of each speaker in the top left of the speaker card. To change the name of a speaker or reassign a clip to a different speaker, you’ll need to use the Timeline.
The timeline contains many important elements of Dubbing Studio, covered in more detail in different sections below:
There are 3 main ways to navigate the timeline:

NOTE: By default, all regenerations in Dubbing Studio are Fixed Generations, which means that the system will keep the duration of the clip fixed regardless of how much text it contains. This can lead to speech speeding up or slowing down significantly if you adjust the length of a clip without changing the text, or if you add/remove a large number of words to a clip.
Consider a clip with the phrase ‘I’m doing well.’ If that clip were set to last 10 seconds and the audio were generated using Fixed Generations, the speech would sound slow and drawn out.
Alternatively, you can use Dynamic Generations by right clicking a segment and selecting it from the options. This will attempt to adjust the length of the clip to the length of the text and make the audio sound more natural.
But be careful – using Dynamic Generations could affect sync and timing in your videos. If, for example, you select Dynamic Generation for a clip with many words in it, and there is not enough room before the next clip for it to properly expand, the audio may not generate properly.

Stale audio refers to audio that needs to be regenerated for one of many reasons (clip length changes, settings changes, transcription/translation changes, etc). You can regenerate stale clips individually or click ‘Generate Stale Audio’ to bulk generate all stale audio clips.
You can right click a clip and select ‘Clip History’ to view previous generations and select the one that sounds best.

As you split and merge clips, the speaker cards above the timeline will update to reflect these changes.
To reassign a clips to a different speaker, click the segment and drag it to another track.

Use the action buttons at the bottom of the timeline to add new audio tracks

To select the voice that will be used to generate audio on a specific speaker track, click the settings cog icon on the left side of the timeline near the speaker name.
There are 3 main types of voices to choose from in Dubbing Studio:
You can also create, save, and reuse a voice from a specific clip by right clicking the clip and selecting ‘Create Voice from Selection.‘
You can set voice settings at two levels:
Track Level - changes will apply across all clips in the track, which can help with stability and consistency.
Clip Level - changes will only apply to a specific clip. To set clip-level settings, use the panel on the right side of the timeline. Disable the ‘inherit track settings’ toggle and configure your desired settings.

Click ‘Export’ in the bottom right of Dubbing Studio to open the export menu.
Dubbing Studio currently supports the following export formats:
Make sure you select the correct language when exporting.
In cases where you already have an accurate dubbing script prepared and want to ensure your Dubbing Studio project sticks to your exact clips and speaker assignment, you can use the Manual Dub option during creation.
To create a Manual Dub, you’ll need:
The CSV file must strictly follow the predefined format in order to be processed correctly. Please see below for samples in the three supported timecodes: