Analyze videos with Google Gemini multimodal AI. Download from any URL (Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, 1000+ sites) and get transcripts, descriptions, and answers to questions. Use when asked to watch, analyze, summarize, or transcribe a video, or answer questions about video content. Triggers on video URLs or requests involving video understanding.
Analyze videos using Google Gemini's multimodal video understanding. Supports 1000+ video sources via yt-dlp.
yt-dlp — brew install yt-dlp / pip install yt-dlpffmpeg — brew install ffmpeg (for merging video+audio streams)GEMINI_API_KEY environment variableReturns structured JSON:
[MM:SS] timestampsuv run {baseDir}/scripts/analyze_video.py "<video-url>"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -q "What product is shown?"
uv run {baseDir}/scripts/analyze_video.py "<video-url>" -p "Custom prompt" --raw
uv run {baseDir}/scripts/analyze_video.py "<video-url>" --download-only -o video.mp4
| Flag | Description | Default |
|---|---|---|
-q / --question | Question to answer (added to default fields) | none |
-p / --prompt | Override entire prompt (ignores -q) | structured JSON |
-m / --model | Gemini model | gemini-2.5-flash |
-o / --output | Save output to file | stdout |
--keep | Keep downloaded video file | false |
--download-only | Download only, skip analysis | false |
--max-size | Max file size in MB | 500 |
--raw | Raw text output instead of JSON | false |
Any URL supported by yt-dlp: Loom, YouTube, TikTok, Vimeo, Twitter/X, Instagram, Dailymotion, Twitch, and 1000+ more.
-q for targeted questions on top of the full analysisuv