Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
A specialized skill for AI Agents to leverage "Reasoning-Driven" image generation. Based on the advanced prompting architecture of Google's Gemini 3 (Nano Banana Pro), this skill moves beyond keyword stuffing to structured, logic-based creative briefs.
Subject + Action + Context + Composition + Lighting.| Component | Description | Example |
|---|---|---|
| Subject | Detailed entity description | "A stoic robot barista with exposed copper wiring" |
| Action | Dynamic interaction | "Pouring a latte art leaf with mechanical precision" |
| Context | Environment & Atmosphere | "Inside a neon-lit cyberpunk cafe at midnight" |
| Composition | Camera & Lens choice | "Close-up, 85mm lens, f/1.8 aperture" |
| Lighting | Mood & Direction | "Volumetric blue rim light, warm cafe glow" |
| Style | Aesthetic anchor | "Cinematic, photorealistic, 4K production value" |
The sign reads "OPEN 24/7".Before calling the script, the Agent MUST rewrite the user's prompt into a logic-driven Reasoning Brief:
featuring a sign that says "STORE NAME" in a weathered serif font.Provide the agent with a subject and a specific scenario.
The generate-nano-art.sh script translates the logic into a structured Gemini 3-style prompt.
# Generating a reasoning-driven image
bash scripts/generate-nano-art.sh \
--subject "a glass chess piece" \
--action "shattering into liquid shards" \
--context "on a obsidian table" \
--style "macro photography"
This skill applies a "Logic Wrapper" around the core/media/generate-image.sh primitive, converting fragmented inputs into a coherent, reasoning-ready narrative prompt.