This subskill is part of the self-contained video-study-notes skill. Its helper scripts live under subskills/media-transcribe/scripts/.

Use this companion skill when a local audio or video file needs transcription, especially when neither an upstream download nor a local sidecar subtitle file provides usable subtitle text.

Prefer the bundled Python script subskills/media-transcribe/scripts/transcribe_audio.py. Run it from the skill-local .venv managed by uv. If the input is a video file and the parent workflow wants a deterministic copy under <project_root>/audio/, first use scripts/prepare_audio.py.

When to use

an upstream probe reports no regular subtitle tracks
a local video has no usable sidecar subtitle text file
the source only exposes danmaku/xml and the user wants spoken-content transcription
the user already has a downloaded audio/video file and wants text or subtitles

Default approach

Prefer a local audio file as input. If the caller only has a local video file, first create (or another explicit format) with .

This subskill is part of the self-contained video-study-notes skill. Its helper scripts live under subskills/media-transcribe/scripts/.

Use this companion skill when a local audio or video file needs transcription, especially when neither an upstream download nor a local sidecar subtitle file provides usable subtitle text.

When to use

an upstream probe reports no regular subtitle tracks
a local video has no usable sidecar subtitle text file
the source only exposes danmaku/xml and the user wants spoken-content transcription
the user already has a downloaded audio/video file and wants text or subtitles

Default approach

Prefer a local audio file as input. If the caller only has a local video file, first create (or another explicit format) with .

Media Transcribe

When to use

Default approach

Media Transcribe

When to use

Default approach

Model strategy

Commands

Output files

Notes

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api