Use this skill when the source video has narration audio but no usable slide visuals, and the final deliverable should be a slide-based lecture video.

Resolve bundled scripts relative to this skill directory. If the runtime has already opened this SKILL.md, prefer paths like scripts/extract_slide_outline.py and scripts/render_from_timing_csv.py instead of machine-specific absolute paths.

Core workflow

Inventory inputs.
- Confirm which of these exist: audio-only mp4/m4a/mp3/wav, ppt/pptx, pdf, and any pre-rendered slide images.
- Prefer an existing pdf or image directory for rendering. Treat pptx as the source of slide text and as a fallback for export.
Prepare tools.
- Required for deterministic steps: ffmpeg, ffprobe, pdftoppm.
- Required for transcription: whisper-cli from whisper-cpp plus a multilingual model such as ggml-small.bin.

Use this skill when the source video has narration audio but no usable slide visuals, and the final deliverable should be a slide-based lecture video.

Core workflow

Inventory inputs.
- Confirm which of these exist: audio-only mp4/m4a/mp3/wav, ppt/pptx, pdf, and any pre-rendered slide images.
- Prefer an existing pdf or image directory for rendering. Treat pptx as the source of slide text and as a fallback for export.
Prepare tools.
- Required for deterministic steps: ffmpeg, ffprobe, pdftoppm.
- Required for transcription: whisper-cli from whisper-cpp plus a multilingual model such as ggml-small.bin.

Ppt Audio To Video

Core workflow

Ppt Audio To Video

Core workflow

Heuristics for timing alignment

Common commands

Bundled scripts

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api