Name: 1. Choosing a Technical Approach
Author: inclusionAI

1. Choosing a Technical Approach

Burn hard subtitles from UTF-8 SRT files using moviepy 2.x with CJK-capable system fonts; tune font size, placement, stroke, and encode settings (bitrate or CRF) to avoid oversized outputs. Documents ffprobe/ffmpeg workflows for inspection, encoding, and batch jobs; troubleshooting for fonts, bitrate, and pacing. Covers voiceover with edge-tts (voice selection, rate/volume/pitch), matching narration length to video with atempo/apad, and multi-scene pacing with breathing room. Targets moviepy 2.x and Python 3.x on macOS, Linux, and Windows.

inclusionAI1,181 estrellas10 abr 2026

Ocupación
Categorías: Medios

1. Choosing a Technical Approach

Recommended: Python moviepy + CJK fonts

Tools: moviepy 2.x
Fonts: System CJK fonts (e.g. STHeiti, Songti, PingFang)
Pros: Cross-platform, supports Chinese, easy styling control
Cons: Slower processing (~40s for an 80s video)

Alternative: FFmpeg + libass (requires rebuild)

Tools: FFmpeg with libass support
Pros: Fast processing
Cons: Requires rebuilding FFmpeg; complex setup

2. Core Code Template

#!/usr/bin/env python3
import re
from moviepy import VideoFileClip, TextClip, CompositeVideoClip

def parse_srt(srt_file):
    """Parse an SRT subtitle file."""
    with open(srt_file, 'r', encoding='utf-8') as f:
        content = f.read()
    
    blocks = content.strip().split('\n\n')
    subtitles = []
    
    for block in blocks:
        lines = block.strip().split('\n')
        if len(lines) >= 3:
            time_line = lines[1]
            match = re.match(r'(\d{2}):(\d{2}):(\d{2}),(\d{3}) --> (\d{2}):(\d{2}):(\d{2}),(\d{3})', time_line)
            if match:
                start_h, start_m, start_s, start_ms, end_h, end_m, end_s, end_ms = match.groups()
                start_time = int(start_h) * 3600 + int(start_m) * 60 + int(start_s) + int(start_ms) / 1000
                end_time = int(end_h) * 3600 + int(end_m) * 60 + int(end_s) + int(end_ms) / 1000
                text = '\n'.join(lines[2:])
                subtitles.append(((start_time, end_time), text))
    
    return subtitles

def make_textclip(txt, font_path, font_size=40):
    """Create a subtitle text clip."""
    return TextClip(
        text=txt,
        font_size=font_size,             # Tune for resolution
        color='white',
        font=font_path,                  # CJK-capable font path
        stroke_color='black',
        stroke_width=2.5,
        method='caption',
        size=(1100, None),               # 1100px width, auto height
        text_align='center'
    )

def add_subtitles(video_path, srt_path, output_path, font_path, font_size=40, bottom_margin=100):
    """Burn hard subtitles into a video."""
    video = VideoFileClip(video_path)
    subtitles = parse_srt(srt_path)
    
    subtitle_clips = []
    for (start, end), text in subtitles:
        txt_clip = make_textclip(text, font_path, font_size)
        txt_clip = txt_clip.with_start(start).with_end(end)
        # Position: pixels from bottom (avoids wrapped lines past the lower edge)
        txt_clip = txt_clip.with_position(('center', video.h - bottom_margin))
        subtitle_clips.append(txt_clip)
    
    final_video = CompositeVideoClip([video] + subtitle_clips)
    
    # Important: cap bitrate to avoid huge files
    # Prefer checking source bitrate first, then ~1.2–1.5× that value
    final_video.write_videofile(
        output_path,
        codec='libx264',
        audio_codec='aac',
        fps=video.fps,
        preset='medium',
        bitrate='600k',      # Tune to source (often 400–800k)
        threads=4
    )
    
    video.close()

# Example usage
if __name__ == '__main__':
    add_subtitles(
        video_path='input_video.mp4',
        srt_path='subtitles.srt',
        output_path='output_video_with_subtitles.mp4',
        font_path='/System/Library/Fonts/STHeiti Medium.ttc',  # macOS
        font_size=40,        # e.g. 40px for 1280×720
        bottom_margin=100    # 100px from bottom
    )

1. Choosing a Technical Approach

inclusionAI1,181 estrellas10 abr 2026

Ocupación
Categorías: Medios

1. Choosing a Technical Approach

Recommended: Python moviepy + CJK fonts

Tools: moviepy 2.x

Fonts: System CJK fonts (e.g. STHeiti, Songti, PingFang)

Pros: Cross-platform, supports Chinese, easy styling control

Cons: Slower processing (~40s for an 80s video)

Alternative: FFmpeg + libass (requires rebuild)

Tools: FFmpeg with libass support

Pros: Fast processing

Cons: Requires rebuilding FFmpeg; complex setup

2. Core Code Template

#!/usr/bin/env python3 import re from moviepy import VideoFileClip, TextClip, CompositeVideoClip def parse_srt(srt_file): """Parse an SRT subtitle file.""" with open(srt_file, 'r', encoding='utf-8') as f: content = f.read() blocks = content.strip().split('\n\n') subtitles = [] for block in blocks: lines = block.strip().split('\n') if len(lines) >= 3: time_line = lines[1] match = re.match(r'(\d{2}):(\d{2}):(\d{2}),(\d{3}) --> (\d{2}):(\d{2}):(\d{2}),(\d{3})', time_line) if match: start_h, start_m, start_s, start_ms, end_h, end_m, end_s, end_ms = match.groups() start_time = int(start_h) * 3600 + int(start_m) * 60 + int(start_s) + int(start_ms) / 1000 end_time = int(end_h) * 3600 + int(end_m) * 60 + int(end_s) + int(end_ms) / 1000 text = '\n'.join(lines[2:]) subtitles.append(((start_time, end_time), text)) return subtitles def make_textclip(txt, font_path, font_size=40): """Create a subtitle text clip.""" return TextClip( text=txt, font_size=font_size, # Tune for resolution color='white', font=font_path, # CJK-capable font path stroke_color='black', stroke_width=2.5, method='caption', size=(1100, None), # 1100px width, auto height text_align='center' ) def add_subtitles(video_path, srt_path, output_path, font_path, font_size=40, bottom_margin=100): """Burn hard subtitles into a video.""" video = VideoFileClip(video_path) subtitles = parse_srt(srt_path) subtitle_clips = [] for (start, end), text in subtitles: txt_clip = make_textclip(text, font_path, font_size) txt_clip = txt_clip.with_start(start).with_end(end) # Position: pixels from bottom (avoids wrapped lines past the lower edge) txt_clip = txt_clip.with_position(('center', video.h - bottom_margin)) subtitle_clips.append(txt_clip) final_video = CompositeVideoClip([video] + subtitle_clips) # Important: cap bitrate to avoid huge files # Prefer checking source bitrate first, then ~1.2–1.5× that value final_video.write_videofile( output_path, codec='libx264', audio_codec='aac', fps=video.fps, preset='medium', bitrate='600k', # Tune to source (often 400–800k) threads=4 ) video.close() # Example usage if __name__ == '__main__': add_subtitles( video_path='input_video.mp4', srt_path='subtitles.srt', output_path='output_video_with_subtitles.mp4', font_path='/System/Library/Fonts/STHeiti Medium.ttc', # macOS font_size=40, # e.g. 40px for 1280×720 bottom_margin=100 # 100px from bottom )

Resolution	Recommended size	Notes
1280×720	40px	HD
1920×1080	60px	Full HD
3840×2160	120px	4K

Voice ID	Gender	Style / use case	Notes
`zh-CN-YunxiNeural`	Male	Storytelling, explainers, short video	Very natural, bright
`zh-CN-YunjianNeural`	Male	Sports, explainers, fast pace	Energetic, punchy
`zh-CN-YunyangNeural`	Male	News, professional	Mature, steady
`zh-CN-XiaoxiaoNeural`	Female	News, fiction, general	Warm, natural

1. Choosing a Technical Approach

1. Choosing a Technical Approach

Recommended: Python moviepy + CJK fonts

Alternative: FFmpeg + libass (requires rebuild)

2. Core Code Template

1. Choosing a Technical Approach

1. Choosing a Technical Approach

Recommended: Python moviepy + CJK fonts

Alternative: FFmpeg + libass (requires rebuild)

2. Core Code Template

3. Key Parameter Settings

3.1 Font choice (critical)

3.2 Font size by resolution

3.3 Position

3.4 Bitrate (avoid oversized files)

4. Common Issues and Fixes

Issue 1: Subtitles show as boxes

Issue 2: Output file size explodes

Issue 3: Wrapped lines extend past the bottom

Issue 4: Subtitles look faint or unclear

5. End-to-End Workflow

Step 1: Prepare the subtitle file

Step 2: Inspect the source video

Step 3: Tune parameters from video metadata

Step 4: Run the burn-in script

Step 5: Validate output

6. Performance Tips

6.1 Speed

6.2 Smaller files

6.3 Quality (CRF)

7. Quick Checklist

8. Reference Commands

List CJK-capable fonts

Video info

Subtitle format / encoding

Batch processing

9. Troubleshooting

moviepy import fails

Font path not found

Encode appears stuck

10. Best Practices Summary

Suggested defaults (1280×720)

Quality vs size

Use cases

11. Voiceover with edge-tts

11.1 Approach

11.2 Example Chinese voices

11.3 Minimal Python example

11.4 Matching audio length to video (FFmpeg)

11.5 Smoother multi-scene narration

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api