Name: Video Frame Reader
Author: JayceBatallones

Skills suchen.../

Video Frame Reader | Skills Pool

conda install -n "media tools" pillow numpy --quiet -y

cd {baseDir}/scripts
python3 -m venv venv
source venv/bin/activate
pip install Pillow numpy --quiet

conda run -n "media tools" python "{baseDir}/scripts/extract_keyframes.py" "<video_path>" --method scene --ensure-last

source {baseDir}/scripts/venv/bin/activate
# Fast default (scene detection) with optional last-frame inclusion
python3 {baseDir}/scripts/extract_keyframes.py "<video_path>" --method scene --ensure-last

{
  "keyframe_count": 52,
  "image_size": "266x576",
  "total_tokens": 10400,
  "cost_usd_opus": 0.156,
  "cost_usd_sonnet": 0.031,
  "cost_usd_haiku": 0.0104,
  "files": ["/.../key_0001.jpg", ...]
}

# Split files array into 4 equal chunks, then call all 4 in one message:

Task(subagent_type="general-purpose", model="haiku", description="Frames 1/4",
  prompt="""
User intent: {Intent from Step 1}

Analyze these sequential video frames in order and summarize key actions,
screens, and anything relevant to the user's intent:
{chunk 1 paths, one per line}
""")

Task(subagent_type="general-purpose", model="haiku", description="Frames 2/4",
  prompt="""
User intent: {Intent from Step 1}

Analyze these sequential video frames in order and summarize key actions,
screens, and anything relevant to the user's intent:
{chunk 2 paths, one per line}
""")

Task(subagent_type="general-purpose", model="haiku", description="Frames 3/4",
  prompt="""...""")

Task(subagent_type="general-purpose", model="haiku", description="Frames 4/4",
  prompt="""...""")

Option	Default	Description
`-m, --method`	scene	Extraction method: `scene` (fast) or `similarity` (slow)
`-t, --threshold`	0.3	Scene threshold for `scene` method (lower = more frames kept)
`-q, --quality`	30	JPEG quality (1-100)
`-s, --scale`	0.3	Resize scale
`-o, --output`	`<video_name>_keyframes/`	Output directory
`-w, --workers`	CPU-1	Parallel compression workers (`similarity` method only)
`--ensure-last`	off	Include last frame (`scene` method only)

# More aggressive reduction
conda run -n "media tools" python "{baseDir}/scripts/extract_keyframes.py" video.mp4 --method scene -t 0.2 -q 20 -s 0.2 --ensure-last

# Similarity method
conda run -n "media tools" python "{baseDir}/scripts/extract_keyframes.py" video.mp4 --method similarity -t 0.85 -q 30 -s 0.3 -w 6

# More aggressive reduction (lower threshold, quality, and size)
python3 {baseDir}/scripts/extract_keyframes.py video.mp4 --method scene -t 0.2 -q 20 -s 0.2 --ensure-last

# Similarity method (slower, more precise)
python3 {baseDir}/scripts/extract_keyframes.py video.mp4 --method similarity -t 0.85 -q 30 -s 0.3 -w 6

Video Frame Reader

Requirements

Workflow

1. Capture User Intent

2. Setup (First Time Only)

Video Frame Reader

Requirements

Workflow

1. Capture User Intent

2. Setup (First Time Only)

3. Extract Keyframes

4. Invoke Parallel Subagents After Approval

Options

Token Reduction Example

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api