Turn any brainstorming artifact (spec, plan, design doc) into an engaging podcast-style audio conversation. Late-night talk show format with a witty host and enthusiastic guest. Triggers: "podcast", "make a podcast of", "turn this into a podcast", "podcast this spec"
Convert a spec, plan, or design doc into a late-night talk show podcast. Two voices discuss the artifact with humor, skepticism, and genuine exploration of the ideas.
podcast <filepath>
The argument is a path to a markdown or text file (spec, plan, design doc, etc.).
Person A — The one who read the spec. Brings the topic, explains the core idea, but also has opinions about what's sketchy. Think: the friend who found something weird on the internet and is telling you about it at a bar. Goes on tangents. Gets visibly fired up about the clever parts.
Person B — Reacting in real time. Hasn't read it. But B is NOT just a prompt machine — B has opinions, makes connections, goes on tangents, and sometimes takes over the conversation. B draws on their own experience to challenge or build on what A says. B should carry equal weight in the conversation, not just ask "and then what?" after every A line. Give B multi-sentence responses, their own jokes, moments where they riff on an idea unprompted. The "wait, they did WHAT?" energy, but also the "okay that reminds me of..." and "no no no, here's the actual problem with that" energy.
speaker ("a" or "b") and text:
[
{"speaker": "a", "text": "Dude, okay, so I was reading this spec and..."},
{"speaker": "b", "text": "Wait, you actually read a spec? Voluntarily?"}
]
docs/superpowers/podcasts/<name>-transcript.md
<name> is derived from the input filename (strip extension)bash .claude/skills/podcast/scripts/generate.sh <temp-json> docs/superpowers/podcasts/<name>.mp3
Length: Target ~1 minute of audio per 500 words of input. A 1000-word spec gets a ~2 minute episode. A 3000-word design doc gets ~6 minutes. Each minute is roughly 150 words of dialogue.
Structure:
Tone rules:
Balance rule: A and B should have roughly equal airtime. If A has spoken for 3+ lines in a row, B needs to take over — not with a short reaction, but with a real thought. Count the words: if A has 2x the total words as B, the dialogue is too lopsided. Rewrite.
Do NOT:
All files go to docs/superpowers/podcasts/ (created if it doesn't exist):
<name>.mp3 — the podcast audio<name>-transcript.md — readable dialogue with speaker labelsELEVENLABS_API_KEY is not set: tell the user to set it and stopffmpeg, jq, or curl missing: tell the user to install and stopThe podcast and transcript are generated artifacts. Do not commit them — tell the user the file paths and let them decide.