Transcribe long-form audio, YouTube videos, podcasts, interviews, or panels; summarize them; extract chapter markers; and draft publishing assets like titles, YouTube descriptions, show notes, and tweet/X copy. Use when a user shares a YouTube link or audio/video file and asks for transcription, diarization, summaries with timestamps, chapter markers, show notes, titles, descriptions, promo copy, or social posts. Especially use for Whisper-based workflows, source-audio-first podcast processing, and any request to turn a recorded conversation into publishable assets.
Use this skill for podcast or interview publishing workflows.
Fetch source audio first.
yt-dlp to extract audio.Transcribe with a fallback ladder.
OPENAI_API_KEY is available.bash rather than failing on permissions.Keep the user updated during long jobs.
Produce the publishing pack.
Package timestamps correctly.
0:00 Intro and origin story3:30 How the brand grewWhen a user wants publishing help, default to a reusable launch pack.
YouTube titles
YouTube descriptions
X main post
X follow-up reply
Native clip ideas
Quote card ideas
Default output order:
scripts/transcribe_youtube.sh: download audio from YouTube, shrink oversized audio for Whisper API, and fall back cleanly.