AI Image Generation (Gemini)

Generate image variations using Google's Gemini image generation model with reference images for style and character consistency. The model supports up to 14 reference images per request and can maintain consistency across multiple characters.

Prerequisites

GEMINI_API_KEY environment variable must be set
- Get a key at https://aistudio.google.com/apikey
- The key needs billing enabled for image generation (~$0.067/image at 1K resolution)
Deno runtime installed (for the generation script)

Workflow

Step 1 — Understand what the user wants

Clarify the subject, pose, expression, context, and where the asset will be used (app screen, social media, website, etc.). This context helps craft the right prompt and choose the right aspect ratio.

AI Image Generation (Gemini)

Prerequisites

GEMINI_API_KEY environment variable must be set
- Get a key at https://aistudio.google.com/apikey
- The key needs billing enabled for image generation (~$0.067/image at 1K resolution)
Deno runtime installed (for the generation script)

Flag	Default	Options
`--variants`	4	1-8 (each is a separate API call)
`--aspect`	1:1	1:1, 3:4, 4:3, 9:16, 16:9, 2:3, 3:2
`--size`	1K	512, 1K, 2K, 4K

Use Case	Aspect Ratio
Full-body character poses	`3:4`
App icons, avatars, social profiles	`1:1`
Mobile screens, in-app cards	`9:16` or `3:4`
Banner/header images, OG images	`16:9` or `4:3`
Bust/upper-body portraits	`1:1` or `4:3`

Image Gen

AI Image Generation (Gemini)

Prerequisites

Workflow

Step 1 — Understand what the user wants

Image Gen

AI Image Generation (Gemini)

Prerequisites

Workflow

Step 1 — Understand what the user wants

Step 2 — Select reference images

Step 3 — Craft the prompt

Step 4 — Generate variations

Step 5 — Pick the best variant

Step 6 — Post-process

Rate Limits

Troubleshooting

Frontend Slides

Frontend Slides

Frontend Slides

Ascii Art

Popular Web Designs

Meme Generation