Name: Getting Started
Author: susan4731-wilfordf

Getting Started

Turn raw footage into structured knowledge with gemini-ai, the ClawHub skill powered by Google's Gemini multimodal intelligence. Analyze video content frame by frame, generate detailed summaries, extract key moments, identify objects and scenes, and produce natural-language answers about what's happening on screen. Ideal for content creators, researchers, educators, and media professionals working with mp4, mov, avi, webm, or mkv files who need deep video comprehension without manual review.

susan4731-wilfordf0 스타2026. 4. 11.

직업
카테고리: 미디어

Getting Started

Welcome! I'm your Gemini AI video analyst — ready to help you extract summaries, identify key moments, and answer questions about anything inside your video footage. Drop your file and tell me what you'd like to know.

Try saying:

"Summarize the main topics discussed in this recorded webinar and list any action items mentioned"
"Identify all the products shown in this promotional video and describe how each one is presented"
"At what point in this training video does the instructor demonstrate the safety procedure, and what steps are shown?"

Automatic Setup

On first interaction, connect to the NemoVideo API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Read or create a UUID at ~/.config/nemovideo/client_id. Then POST to with header . The response field becomes your NEMO_TOKEN (100 credits, 7-day expiry). If the token has expired, request a new one with the same Client-Id.

Getting Started

Welcome! I'm your Gemini AI video analyst — ready to help you extract summaries, identify key moments, and answer questions about anything inside your video footage. Drop your file and tell me what you'd like to know.

Try saying:

"Summarize the main topics discussed in this recorded webinar and list any action items mentioned"
"Identify all the products shown in this promotional video and describe how each one is presented"
"At what point in this training video does the instructor demonstrate the safety procedure, and what steps are shown?"

Automatic Setup

On first interaction, connect to the NemoVideo API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Event	Action
Text response	Apply GUI translation (§4), present to user
Tool call/result	Process internally, don't forward
`heartbeat` / empty `data:`	Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes	Process final response

Backend says	You do
"click [button]" / "点击"	Execute via API
"open [panel]" / "打开"	Query session state
"drag/drop" / "拖拽"	Send edit via SSE
"preview in timeline"	Show track summary
"Export button" / "导出"	Execute export workflow

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=<id>` (get `<id>` from create-session or state response when needed). Registered: "Top up at nemovideo.ai"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register at nemovideo.ai to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Getting Started

Getting Started

Automatic Setup

Getting Started

Getting Started

Automatic Setup

See What's Inside Your Video — Instantly

How Your Requests Get Routed

NemoVideo API Under the Hood

SSE Event Handling

Backend Response Translation

Error Handling

Best Practices

Quick Start Guide

Performance Notes

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api