Name: Listenhub
Author: dvcrn

Listenhub

Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video", "read this aloud", "generate an image", or share knowledge in audio/visual form. Supports: topic descriptions, YouTube links, article URLs, plain text, and image prompts.

dvcrn3 星标2026年3月15日

职业
分类: 内容创作

<purpose> **The Hook**: Paste content, get audio/video/image. That simple.

Four modes, one entry point:

Podcast — Two-person dialogue, ideal for deep discussions
Explain — Single narrator + AI visuals, ideal for product intros
TTS/Flow Speech — Pure voice reading, ideal for articles
Image Generation — AI image creation, ideal for creative visualization

Users don't need to remember APIs, modes, or parameters. Just say what you want. </purpose>

⛔ Hard Constraints (Inviolable)

The scripts are the ONLY interface. Period.

┌─────────────────────────────────────────────────────────┐
│  AI Agent  ──▶  ./scripts/*.sh  ──▶  ListenHub API     │
│                      ▲                                  │
│                      │                                  │
│            This is the ONLY path.                       │
│            Direct API calls are FORBIDDEN.              │
└─────────────────────────────────────────────────────────┘

</instructions> <examples> <example name="podcast-request"> <user>Make a podcast about the latest AI developments</user> <response> → Got it! Preparing two-person podcast... Topic: Latest AI developments <example name="explainer-request"> <user>Create an explainer video introducing Claude Code</user> <response> → Got it! Preparing explainer video... Topic: Claude Code introduction <example name="tts-request"> <user>Convert this article to speech https://blog.example.com/article</user> <response> → Got it! Parsing article... <example name="image-generation-short-prompt"> <user>Generate an image: cyberpunk city at night</user> <response> → Short prompt detected. Would you like help enriching it with style/lighting/composition details, or use it as-is? </response> </example> <example name="image-generation-detailed-prompt"> <user>Generate an image: "Cyberpunk city at night, neon lights reflecting on wet streets, towering skyscrapers with holographic ads, flying vehicles, cinematic composition, highly detailed, 8K quality"</user> <response> → Generating image... <example name="image-with-reference"> <user>Generate an image in this style: https://example.com/style-ref.jpg, prompt: "a futuristic car"</user> <response> → Generating image with reference... <example name="status-check"> <user>Done yet?</user> <response> ✓ Podcast generated! </examples>

Listenhub

dvcrn3 星标2026年3月15日

职业
分类: 内容创作

⛔ Hard Constraints (Inviolable)

The scripts are the ONLY interface. Period.

┌─────────────────────────────────────────────────────────┐ │ AI Agent ──▶ ./scripts/*.sh ──▶ ListenHub API │ │ ▲ │ │ │ │ │ This is the ONLY path. │ │ Direct API calls are FORBIDDEN. │ └─────────────────────────────────────────────────────────┘

Category	Examples	How to Obtain
API Base URL	`api.marswave.ai/...`	✗ Cannot — internal to scripts
Endpoints	`podcast/episodes`, etc.	✗ Cannot — internal to scripts
Speaker IDs	`cozy-man-english`, etc.	✓ Call `get-speakers.sh`
Request schemas	JSON body structure	✗ Cannot — internal to scripts
Response formats	Episode ID, status codes	✓ Documented per script

Option	Default	Description
`--wait`	off	Enable polling mode
`--max-polls`	30	Maximum poll attempts
`--timeout`	300	Maximum total wait (seconds)
`--interval`	10	Base poll interval (seconds)

Listenhub

⛔ Hard Constraints (Inviolable)

Listenhub

⛔ Hard Constraints (Inviolable)

Script Location

Private Data (Cannot Be Searched)

Design Philosophy

Environment

ListenHub API Key

Image Generation API Key

Mode Detection

Ambiguous "Convert to speech" Guidance

Interaction Flow

Step 1: Receive input + detect mode

Step 2: Submit generation

Step 3: Query status

Step 4: Show results

Script Reference

Podcast (One-Stage)

Podcast (Two-Stage: Text → Review → Audio)

Speech (Multi-Speaker)

Get Available Speakers

Explain

TTS

Image Generation

Check Status

Language Adaptation

AI Responsibilities

Black Box Principle

Mode-Specific Behavior

Prompt Handling (Image Generation)

Article Writing

Article Writing

Content Engine

Brand Voice

Article Writing

Article Writing