Name: Auto Image Generation
Author: brudex

Auto Image Generation | Skills Pool

Context	Where to save the primary raster	Notes
Social campaign slot	`workspace/drafts/social/<campaign>/posts/<post-id>/post-image.png` (or `.jpg`)	Still write full brief + `generated-` under `workspace/drafts/images/<date>-<slug>/`, then copy or symlink* the chosen asset to `post-image.png`. Add `posts/<post-id>/image-alt.txt` (one line, ≤125 chars) for accessibility / HypeEngine.
LinkedIn long-form article	`workspace/drafts/linkedin/<date>-<slug>/article-hero.png` (or `.jpg`)	Hero for the article body + intern handoff; reference in `article.md` (e.g. `![Hero](article-hero.png)` after title) and in `README-handoff.md`.
Teaser row (same article)	Reuse `article-hero.png` for the feed bundle when the calendar row points at the LinkedIn folder—or generate `teaser-image.png` in that folder if the teaser needs a distinct crop.

Method	When to use
Post-process composite (default when PNG exists)	Generate scene with reserved empty top-left (no brand lettering—see step 4 under Logo placement), then overlay `logo-mark.png` / `logo-primary.png` with ImageMagick, sharp, etc. Log `COMPOSITED_LOGO` in `gemini-render.md`.
Reference image (single-pass)	Pass *`logo-.png` as multimodal `inline_data`*. Instruction: “Only the attached image is the mark—top-left per logo-usage.md; do not redraw; do not add typographic wordmark.”*
Text-only fallback	Only if no `logo-.png` exists and* `logo-usage.md` explicitly allows emergency typeset—otherwise stop or add PNGs first.
Never	No logo-from-memory; no wrong icon set; no “logo” that is only styled text of the company name when PNG assets exist; no `media.licdn.com` URLs in the API—save as `logo-mark.png` locally.

Asset source: Use logo-primary.png or logo-mark.png from brand-images/ (or BRAND_IMAGES_DIR). Files must be true RGBA PNGs—alpha channel outside the mark, no baked-in square panel of “lighter navy” (that is a bad export, not the hero). Do not paste remote URLs into prompts—download, re-export with transparency if you see a box in previews, then save. Preflight: Open the PNG on a checkerboard background; if a solid rectangle remains, fix the asset before any composite.
Placement spec (matches default logo-usage.md): top-left, inset ~2.5–4% of canvas from top and left; logo width ~8–14% of full image width (primary) or ~6–10% (mark). Keep clear space; main illustration (hand/phone/background figure) must not overlap the logo box.
Multimodal single-pass: Alongside prompt-master.txt, pass the logo PNG as reference. Instruction: “Attached image is the only authorized mark—place in top-left; preserve alpha / transparency; do not redraw; do not put a square panel, matte, or second background behind it; do not add typographic wordmark.”
Prompt reserve + composite (recommended default): In prompt-master.txt, include “Top-left: empty reserved band (same navy as background), no brand text, no fake logo lettering, no contrasting square behind where a logo will sit.” Overlay the mark with alpha-respecting compositing (e.g. ImageMagick compose over / sharp composite with input alpha—not flattening to JPEG first). Use this when multimodal still returns text-as-logo or paints a logo matte box.
Prompt-only typeset — last resort per table above when no PNG exists.
safe-zones.md: Record logo lock rectangle (inset, max width) for composite math and LinkedIn crops.
Post-render logo QA: If the top-left shows only styled text spelling the product name (not your file-based mark), treat as failed—run composite path and replace post-image.png / article-hero.png; log FIXED_TEXT_AS_LOGO. If a visible rectangle or wrong-tone slab sits behind the mark (model-added or opaque pixels in logo-*.png), fix the source PNG alpha and re-composite; log FIXED_LOGO_MATTE.

STYLE LOCK — NON-NEGOTIABLE: Flat 2D vector illustration only (clean corporate editorial / app-marketing vector), like Figma or Illustrator flats — NOT anime, NOT manga, NOT cinematic digital painting, NOT semi-realistic character art, NOT 3D render. NO holographic or floating sci-fi UI, NO cyberpunk, NO server room or data center, NO futuristic city skyline, NO cyan/teal neon glow as the dominant look, NO “tech command center” or tactical jumpsuit characters. NO large hero faces in painterly style. REQUIRED COMPOSITION: **hand holding smartphone** in foreground with simple quiz/lesson UI. **Mid/background:** a **second figure actively using learning tech**—**dynamic pose** (standing, seated at desk, walking with phone, subtle “yes!” / progress gesture, café perch, **or** relaxed lounge—**vary across posts; do NOT default to “always on a sofa”**); same quiz/lesson theme on **phone or tablet** in that figure’s hands. Small floating minimalist line icons (books, brain-in-lightbulb, checkmark, graduation cap) plus 1–2 topic icons from the post copy. REQUIRED PALETTE unless brand kit overrides: deep navy background, bright yellow accents, white linework. The ONLY “tech” is the phone/tablet screens — flat UI mockups, not glowing Blade Runner panels. **BRAND CORNER:** Do **not** draw the product name as typographic “logo” text in the top-left—**leave a clean navy reserve** for the **real `logo-mark.png` / `logo-primary.png`** (post-composite) or use the **attached logo image** only; **no fake wordmark lettering**; **no** lighter **square**, **plate**, or **panel** behind the mark—the logo must float on **transparent** pixels or true hero background only.

Source	Minimum extraction
Social `post-body.md`	First hook line + must-win message + any named topic, course, or outcome in the `## Publish-ready` block (or equivalent).
LinkedIn `article.md`	Title + TL;DR or lede + first H2 theme (or strongest concrete example in the opening sections).

Workspace context: USER.md, SOUL.md; brand-images/ per Brand kit section (env BRAND_IMAGES_DIR or workspace/brand-images/).

Output root:

workspace/drafts/images/<YYYY-MM-DD>-<slug>/

Brief: use case (feed post, story, Meta ad 1:1, display 1200×628, YT thumbnail), must-win message, legal (no competitor logos, no fake badges).
Tools: Gemini produces actual image files in this setup. Order: try OpenClaw image_generate (or other host image tool) first if available; else generateContent with x-goog-api-key — see workspace/INTEGRATIONS.md.

Use case → aspect ratio matrix

Map explicitly:

Placement	Ratio	Min resolution (guide)
IG feed	4:5 or 1:1	1080 wide min
Stories/Reels cover	9:16	1080×1920
Meta feed ad	1:1 / 4:5	per Ads Manager
YT thumbnail	16:9	1280×720 min
LinkedIn link post	1.91:1	1200×627 typical

Concept sheet (concept.md)
- 1–2 sentences creative idea + audience + emotion (trust, urgency, curiosity).
Master prompt (prompt-master.txt)
- Single detailed prompt: Style benchmark (this skill) + subject tied to post/article copy + composition, lighting, palette (brand kit where present). Default style = flat 2D vector per benchmark unless campaign visual-dna.md overrides.
Negative prompt (negative-prompt.txt)
- Always include: watermark, lowres, blurry, extra fingers, competitor logos, fake App Store badge, gore, photorealistic named celebrity (unless rights cleared).
- Always include the Negative prompt hints (benchmark-specific) list from Style benchmark (anime, holographic UI, server room, cyberpunk, etc.) unless visual-dna.md explicitly defines a different illustration mode.
- Add use-case negatives (e.g. “cluttered UI” for app mockups).
Text-on-image policy (text-overlay.md)
- If text required: ≤5 words for thumb/story; font style note; contrast (WCAG-style: light on dark band).
- If no text: state “clean image; caption carries copy.”
Safe zones (safe-zones.md)
- Reserve top-left for logo-primary / logo-mark per Logo placement — top-left (inset + max width); for 9:16 add top/bottom UI overlay avoidance; for YT thumb: right third often occluded by timestamp—keep face/keyword left.
A/B variants (variants.md)
- Table: Variant | What changed | Hypothesis | Prompt delta summary.
- Minimum 2 variants for ads; 3 for social tests when requested.
Brand compliance block
- In README-handoff.md: checklist against USER.md (colors, banned motifs, disclosure if sponsored creative).
Optional brief.json
- Keys: aspect_ratio, width, height, prompt, negative_prompt, variants[].
Optional pixel pass (gemini-render.md)
- If keys exist: document model id, request timestamp, output filenames, and any safety filter or refusal in gemini-render.md next to the images.
Scheduling
- Campaign assets: date-prefix folders; retain previous days for audit.

Auto Image Generation

When Gemini is available (this workspace — default)

Auto Image Generation

When Gemini is available (this workspace — default)

End-to-end flow (this workspace)

Brand kit (`brand-images/`) — QuizFactor logo & product look

Where the folder lives (resolve in this order)

How the QuizFactor / product logo gets into the image (not “magic prompt only”)

Logo placement — top-left (default for feed + article heroes)

Merging post copy + product truth + brand kit

Optional host script

Style benchmark (flat vector educational)

STYLE LOCK — mandatory first lines (anti-drift)

Canonical visual system (structure & style)

Background figure — dynamic variants (not sofa-only)

Content from copy (mandatory)

Brand kit precedence (vs. benchmark defaults)

Negative prompt hints (benchmark-specific)

Post-render visual QA (one retry)

Aligning images with copy (style & quality)

Prerequisites

Credentials & API (qf-style)

High-level Workflow

Outputs (required)

Agent Checklist

Article Writing

Article Writing

Content Engine

Brand Voice

Article Writing

Article Writing

Auto Image Generation

When Gemini is available (this workspace — default)

Auto Image Generation

When Gemini is available (this workspace — default)

Chaining from social posts & LinkedIn articles (canonical paths)

End-to-end flow (this workspace)

Brand kit (brand-images/) — QuizFactor logo & product look

Where the folder lives (resolve in this order)

How the QuizFactor / product logo gets into the image (not “magic prompt only”)

Logo placement — top-left (default for feed + article heroes)

Merging post copy + product truth + brand kit

Optional host script

Style benchmark (flat vector educational)

STYLE LOCK — mandatory first lines (anti-drift)

Canonical visual system (structure & style)

Background figure — dynamic variants (not sofa-only)

Content from copy (mandatory)

Brand kit precedence (vs. benchmark defaults)

Negative prompt hints (benchmark-specific)

Post-render visual QA (one retry)

Aligning images with copy (style & quality)

Prerequisites

Credentials & API (qf-style)

High-level Workflow

Outputs (required)

Agent Checklist

Article Writing

Article Writing

Content Engine

Brand Voice

Article Writing

Article Writing

Brand kit (`brand-images/`) — QuizFactor logo & product look