Generate image generation model prompts for blog post illustrations in a colorful cartoon infographic style with cute character metaphors and pastel color-coded zones. Use this skill whenever the user wants to create a visual or illustration for a blog post or article — system architectures, workflows, concept explanations, comparisons, or any abstract idea that needs a playful visual. Trigger on keywords like "画图", "插图", "图片提示词", "配图", "illustration", "infographic", "image prompt", or when the user shares article content and asks for an accompanying image or diagram to be generated by an image model.
Generate prompts for image generation models (Gemini, Midjourney, DALL-E, etc.) that produce colorful cartoon-style infographic illustrations for blog posts.
The output is a text prompt — not an image. The user takes the prompt to their preferred image generation model.
The prompt itself must be written in English, regardless of the conversation language. Image models perform best with English prompts. Only use Chinese in the prompt when the user explicitly requests Chinese labels in the image.
Every illustration shares these visual traits. This is the non-negotiable foundation that keeps illustrations consistent across different blog posts.
Read the text that needs illustration. Identify:
This is the most important step. Each abstract component needs a concrete visual form that hints at its function.
Principles:
Reference examples:
| Function | Weak | Strong |
|---|---|---|
| Links/connects items | Robot with wires | Spider weaving silk between cards |
| Audits/cleans/maintains | Robot with magnifying glass | Gardener pruning dead branches |
| Generates profile from behavioral data | Brain with arrows | Painter creating portrait from scattered fragments |
| Equal partnership | Two robots | Two silhouettes back-to-back, one human-shaped, one geometric |
| Free exploration with occasional output | Floating robot | Firefly drifting lazily, glowing when it finds something |
| Filters or guards | Shield icon | Cat sitting on a fence, letting some things pass |
| Schedules or orchestrates | Clock icon | Conductor with a baton, cueing different performers |
Choose a layout pattern based on the relationship structure:
If an element deliberately breaks the pattern (e.g., something autonomous that doesn't fit the main structure), position it outside the organized zones — floating, slightly translucent, with dashed connections. This visual separation communicates "this one is different" without explanation.
Before writing the prompt, list all text labels that should appear in the image. Default to English for everything — image models render English reliably.
Keep labels short — 2-4 words per label. Only use Chinese labels when the user explicitly asks for them.
Structure the prompt as:
Keep the prompt 200-400 words. Image models perform worse with extremely long prompts — be specific about what matters, brief about the rest.