This skill allows you to analyze image files by providing a prompt and the file path. It uses the gemini-3-flash-preview model for fast and accurate vision analysis.

Workflow for Agents

When you encounter an image file and need to understand its content:

Identify the absolute path of the image file.
Decide what information you need from the image (e.g., general description, text extraction, UI component identification).
Execute the describe.py script.

Example Usage

python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"

Specific Use Cases

OCR: python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"
UI Analysis: python3 scripts/describe.py --prompt "Identify the main UI components and their layout." --image "mockup.jpg"

This skill allows you to analyze image files by providing a prompt and the file path. It uses the gemini-3-flash-preview model for fast and accurate vision analysis.

Workflow for Agents

When you encounter an image file and need to understand its content:

Identify the absolute path of the image file.
Decide what information you need from the image (e.g., general description, text extraction, UI component identification).
Execute the describe.py script.

Example Usage

python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"

Specific Use Cases

OCR: python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"
UI Analysis: python3 scripts/describe.py --prompt "Identify the main UI components and their layout." --image "mockup.jpg"

Image Describer

Workflow for Agents

Example Usage

Specific Use Cases

Image Describer

Workflow for Agents

Example Usage

Specific Use Cases

Resource Details

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api