Getting Started

Send me your video or script and I'll handle the AI voiceover generation. Or just describe what you're after.

Try saying:

"generate a 2-minute product demo video with no audio into a 1080p MP4"
"generate a natural-sounding voiceover in English for this silent video"
"adding AI-generated voiceovers to silent or muted videos for content creators, marketers, educators"

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response is your NEMO_TOKEN — 100 free credits, valid 7 days.

Send me your video or script and I'll handle the AI voiceover generation. Or just describe what you're after.

Try saying:

"generate a 2-minute product demo video with no audio into a 1080p MP4"
"generate a natural-sounding voiceover in English for this silent video"
"adding AI-generated voiceovers to silent or muted videos for content creators, marketers, educators"

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response is your NEMO_TOKEN — 100 free credits, valid 7 days.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Header	Value
`X-Skill-Source`	`ai-voice-over-generator`
`X-Skill-Version`	frontmatter `version`
`X-Skill-Platform`	auto-detect: `clawhub` / `cursor` / `unknown` from install path

Event	Action
Text response	Apply GUI translation (§4), present to user
Tool call/result	Process internally, don't forward
`heartbeat` / empty `data:`	Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes	Process final response

AI Voice Over Generator — Generate Voiceovers for Any Video