Create podcasts from topics, URLs, or text. Triggers on: "做播客", "podcast", "播客", "录一期节目", "chat about", "discuss", "debate", "dialogue", "make a podcast about".
/speech)/explainer)/image-gen)/content-parser)Generate podcast episodes with 1-2 AI speakers discussing a topic. Supports quick overviews, deep analysis, and debate formats. Input can be a topic description, URL(s), or text. Output is a full audio episode with transcript.
shared/cli-authentication.mdshared/cli-patterns.md for command execution and error handlingshared/speaker-selection.md as fallback only; fetch from the speakers API when the user wants to change voiceshared/config-pattern.md before any interactionshared/speaker-selection.md for speaker selection (text table + free-text input)~/Downloads/ or .listenhub/ — save artifacts to the current working directory with friendly topic-based names (see shared/config-pattern.md § Artifact Naming)Follow shared/cli-authentication.md § Auth Check. If the CLI is not installed or the user is not logged in, auto-install and auto-login — never ask the user to run commands manually.
Follow shared/config-pattern.md Step 0 (Zero-Question Boot).
If file doesn't exist — silently create with defaults and proceed:
mkdir -p ".listenhub/podcast"
echo '{"outputMode":"inline","language":null,"defaultMode":"quick","defaultSpeakers":{}}' > ".listenhub/podcast/config.json"
CONFIG_PATH=".listenhub/podcast/config.json"
CONFIG=$(cat "$CONFIG_PATH")
Do NOT ask any setup questions. Proceed directly to the Interaction Flow.
If file exists — read config silently and proceed:
CONFIG_PATH=".listenhub/podcast/config.json"
[ ! -f "$CONFIG_PATH" ] && CONFIG_PATH="$HOME/.listenhub/podcast/config.json"
CONFIG=$(cat "$CONFIG_PATH")
Only run when the user explicitly asks to reconfigure. Display current settings:
当前配置 (podcast):
输出方式:{inline / download / both}
语言偏好:{zh / en / 未设置}
默认模式:{quick / deep / debate / 未设置}
默认主播:{speakerName(s) / 使用内置默认}
Then ask these questions in order and save:
outputMode: Follow shared/output-mode.md § Setup Flow Question.
Language (optional): "默认语言?"
nullMode (optional): "默认播客模式?"
nullAfter collecting answers, save immediately:
NEW_CONFIG=$(echo "$CONFIG" | jq --arg m "$OUTPUT_MODE" '. + {"outputMode": $m}')
# Save language if user chose one (not "每次手动选择")
if [ "$LANGUAGE" != "null" ]; then
NEW_CONFIG=$(echo "$NEW_CONFIG" | jq --arg lang "$LANGUAGE" '. + {"language": $lang}')
fi
# Save mode if user chose one
if [ "$MODE" != "null" ]; then
NEW_CONFIG=$(echo "$NEW_CONFIG" | jq --arg mode "$MODE" '. + {"defaultMode": $mode}')
fi
echo "$NEW_CONFIG" > "$CONFIG_PATH"
CONFIG=$(cat "$CONFIG_PATH")
Ask topic and optional reference materials together in a single question using AskUserQuestion with two sub-questions, or a single free-text prompt:
What topic would you like to turn into a podcast? If you have reference materials (URLs or text), include them here too.
Accept: topic description, URL(s), pasted text, or any combination.
Examples of valid input:
Default: "quick" — skip this question unless:
config.defaultMode is set to something else → use that value silentlyOnly ask this question if the user's intent is ambiguous AND no default is configured. In most cases, just use "quick".
Default: match the user's interaction language. Detect from the language the user used in Step 1:
zhenconfig.language is set → use that valueNever ask this question. Always infer silently. Show in the confirmation summary so the user can override if needed.
Default: 2 speakers (dialogue) — the most common and engaging format.
Skip this question. Debate mode requires 2 speakers. For quick/deep, default to 2 speakers as well.
Only use 1 speaker if the user explicitly requests a monologue or solo format.
Follow shared/speaker-selection.md:
config.defaultSpeakers.{language} is set → use saved speakers silentlyshared/speaker-selection.md (no question asked)For 2-speaker mode (dialogue/debate): use Primary + Secondary defaults for the language.
Summarize all choices:
Ready to generate podcast:
Topic: {topic}
Mode: {mode}
Language: {language}
Speakers: {speaker name(s)}
References: {yes/no + brief description}
Proceed?
Wait for explicit confirmation before calling any CLI command. The user can adjust any parameter here before confirming.
Submit (background): Run the CLI command with run_in_background: true and timeout: 360000:
listenhub podcast create \
--query "{topic}" \
--source-url "{url}" \
--source-text "{text}" \
--mode {quick|deep|debate} \
--lang {en|zh|ja} \
--speaker "{name}" \
--speaker "{name2}" \
--json
Flag notes:
--query — the topic or question to discuss--source-url — repeatable, one per URL reference--source-text — repeatable, one per text block reference--mode — one of quick, deep, debate--lang — language code--speaker — repeatable (max 2); use speaker display names--speaker-id — alternative to --speaker; use speaker IDs instead of names--source-url / --source-text if the user provided no referencesThe CLI handles polling internally and returns the final result when generation completes.
Tell the user the task is submitted and that they will be notified when it finishes.
When notified of completion, Present result:
Parse the CLI JSON output to extract fields: audioUrl, subtitlesUrl, audioDuration, credits.
Read OUTPUT_MODE from config. Follow shared/output-mode.md for behavior.
inline or both: Display audioUrl as a clickable link.
Present:
播客已生成!
在线收听:{audioUrl}
字幕:{subtitlesUrl}(如有)
时长:{audioDuration / 1000}s
消耗积分:{credits}
download or both: Also download the file. Generate a topic slug following shared/config-pattern.md § Artifact Naming.
SLUG="{topic-slug}" # e.g. "ai-developments"
NAME="${SLUG}-podcast.mp3"
# Dedup: if file exists, append -2, -3, etc.
BASE="${NAME%.*}"; EXT="${NAME##*.}"; i=2
while [ -e "$NAME" ]; do NAME="${BASE}-${i}.${EXT}"; i=$((i+1)); done
curl -sS -o "$NAME" "{audioUrl}"
Present:
已保存到当前目录:
{NAME}
Offer to show transcript or provide download URL on request
Update config with the choices made this session:
NEW_CONFIG=$(echo "$CONFIG" | jq \
--arg lang "{language}" \
--arg mode "{mode}" \
--argjson speakers '{"{language}": ["{speakerId}"]}' \
'. + {"language": $lang, "defaultMode": $mode, "defaultSpeakers": (.defaultSpeakers + $speakers)}')
echo "$NEW_CONFIG" > "$CONFIG_PATH"
shared/cli-speakers.mdshared/speaker-selection.mdshared/cli-patterns.mdshared/cli-authentication.mdshared/config-pattern.mdUser: "Make a podcast about the latest AI developments"
Agent workflow:
listenhub podcast create \
--query "The latest AI developments" \
--mode deep \
--lang en \
--speaker "Mars" \
--speaker "Mia" \
--json
Wait for CLI to return result, then present with title and listen link.