Describe or analyze image content using Gemini 3 Flash. Use this skill when you need to understand what is in an image or pdf file (e.g., icons, screenshots, diagrams) to proceed with a task.
This skill allows you to analyze image files by providing a prompt and the file path. It uses the gemini-3-flash-preview model for fast and accurate vision analysis.
When you encounter an image file and need to understand its content:
describe.py script.python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"
python3 scripts/describe.py --prompt "Extract all text from this image." --image "screenshot.png"python3 scripts/describe.py --prompt "Identify the main UI components and their layout." --image "mockup.jpg"python3 scripts/describe.py --prompt "What does this icon represent?" --image "icon.svg"scripts/describe.py: Vertex AI SDK (google-genai) を使用して、画像や PDF を解析する Python スクリプト。