Add text-to-speech voiceover

Generate professional voiceovers from text. Step-by-step using the AI agent or manual panel.


Add narration without recording. Type your script, pick a voice, and get professional audio added directly to your timeline.


What you'll learn

  • Generate voiceover from text
  • Choose from multiple voices
  • Adjust speaking speed
  • Place on timeline

Step 1: Open the TTS panel

  1. Click "TTS" tab in the left sidebar
    • Located between "Sounds" and "Text"
  2. See the text input — large text box ready for input

Step 2: Enter your script

  1. Click in the text box
  2. Type your script (up to 4,096 characters)
  3. Add punctuation — commas and periods help with natural pauses

Example:

"Welcome to this tutorial. Today, I'll show you how to quickly edit videos using AI."


Step 3: Choose a voice

  1. Click "Voice" dropdown
  2. Select a voice:
    • Alloy — Neutral, versatile (default)
    • Echo — Warm, conversational
    • Fable — British, storytelling
    • Onyx — Deep, authoritative
    • Nova — Enthusiastic, bright
    • Shimmer — Clear, informative

Step 4: Set speed

  1. Click "Speed" dropdown or slider
  2. Select:
    • 0.8x — Slower, instructional
    • 1.0x — Normal (default)
    • 1.2x — Quick, energetic
    • 1.5x — Fast

Match to content:

  • Tutorials → 0.9-1.0x
  • TikTok/Reels → 1.2-1.5x
  • Professional → 1.0x

Step 5: Generate audio

  1. Click "Generate" button
  2. Wait for processing — progress bar shows status
  3. Hear a preview — audio plays when ready

Step 6: Add to timeline

  1. Click "Add to timeline"
  2. Voiceover appears on a new audio track
  3. Name shows — starts with "VO " + first few words (e.g., "VO Welcome to...")

Using the Agent instead

For quick generation:

"Generate voiceover: 'Welcome to the tutorial.'"

With voice:

"Generate TTS with Nova voice: 'Check out this amazing feature.'"

With speed:

"Generate voiceover at 1.2x: 'Let's get started.'"


Editing the voiceover

Trim

  1. Click the audio clip on the timeline
  2. Drag edges — left edge for start, right for end
  3. Clips ripple if ripple mode is on

Split

  1. Press Space to pause at split point
  2. Press S — splits clip at playhead
  3. Delete or move half as needed

Adjust volume

  1. Hover over clip — volume line appears
  2. Drag up/down — adjust dB (decibels)
  3. Or use Properties → Audio → Volume

Add fades

  1. Hover over clip corners
  2. Drag the fade handle — top-left for fade in, top-right for fade out
  3. Drag distance = fade duration

Long scripts

For scripts over 4,096 characters:

  1. Split into chunks
  2. Generate first part
  3. Add to timeline
  4. Generate second part
  5. Place after first

Or use Agent:

"Generate voiceover for [first paragraph]" "Now generate for [second paragraph]"


Tips

  • Chunk long content — easier editing control
  • Use punctuation — commas = small pause, periods = big pause
  • Preview voices — generate short test first
  • Match to platform — slower for education, faster for social
  • Mix levels — voice should be louder than background music
  • Add fades — smooth start/end always sounds better

Troubleshooting

| Issue | Fix | |-------|-----| | Generate button grayed | Check text isn't empty; verify TTS config | | Preview won't play | Try adding to timeline anyway; sometimes preview bugs | | Voice sounds wrong | Regenerate with different voice | | Too fast/slow | Change speed and regenerate | | Long scripts fail | Split into <4,096 character chunks | | Audio is quiet | Use Properties panel → Audio → Volume |


See also

Community