Generate images using the Jimeng API based on text prompts. Use this skill when users request AI-generated images from the Jimeng (即梦AI) service, artwork, illustrations, or visual content creation. Supports text-to-image and image-to-image generation with customizable ratios and resolutions.
This skill enables image generation using a locally deployed Jimeng API service (Docker). It converts text prompts into high-quality images and automatically downloads them to the project's /pic folder. The skill supports text-to-image generation, image-to-image composition, customizable aspect ratios (1:1, 16:9, etc.), and multiple resolution levels (1k, 2k, 4k).
API Endpoint: http://localhost:5100
Use this skill when users request:
IMPORTANT: The Jimeng API must be running locally via Docker before using this skill.
Region-specific prefixes:
your_session_id)us- prefix (e.g., us-your_session_id)hk- prefix (e.g., hk-your_session_id)jp- prefix (e.g., jp-your_session_id)sg- prefix (e.g., sg-your_session_id)⚠️ nanobanana Model Resolution Rules:
ratio and resolution parametersratio parameters (e.g., 16:9, 4:3)Always ask the user for their Session ID before proceeding, as the skill does not include a pre-configured credential.
Example prompt to user:
"要使用即梦API生成图片,我需要您的Session ID。您可以从即梦网站(jimeng.jianying.com)的浏览器Cookie中获取 sessionid。
如果使用国际站,请在sessionid前添加对应前缀(us-/hk-/jp-/sg-)。
请提供您的 Session ID。"
⚠️⚠️ IMPORTANT PARAMETER DISCIPLINE
Rationale: Tools may “helpfully” add options (e.g., --ratio 16:9) that the user didn’t request, overriding script defaults. This is prohibited. Pass only the parameters the user asked for; otherwise, rely on defaults.
generate_image.py script — REMINDER: only pass parameters explicitly requested by the user; do not add/guess any optional flagsRead, view_image). STOP AFTER SAVING.Generate images from text descriptions.
Minimal default usage (no optional params):
python scripts/generate_image.py text \
"a cute cat" \
--session-id "YOUR_SESSION_ID"
Only include optional parameters when the user explicitly requests them.
With user-specified parameters (only when requested):
python scripts/generate_image.py text \
"futuristic city at sunset with flying cars" \
--session-id "YOUR_SESSION_ID" \
--model "jimeng-4.0" \
--ratio "16:9" \
--resolution "2k"
Parameters:
prompt (required): Text description of the desired image--session-id: Jimeng session ID (required)--model: Model to use (default: jimeng-4.0)
jimeng-5.0, jimeng-4.6, jimeng-4.5, jimeng-4.1, jimeng-4.0, jimeng-3.1, jimeng-3.0, nanobanana (international only)--ratio: Aspect ratio (default: 1:1)
1:1, 4:3, 3:4, 16:9, 9:16, 3:2, 2:3, 21:9--resolution: Resolution level (default: 2k)
1k, 2k, 4k--intelligent-ratio: Enable smart ratio detection based on prompt keywords ⚠️ Only works for jimeng-4.0/jimeng-4.1/jimeng-4.5 models; other models will ignore this parameter--negative-prompt: Negative prompt (elements to avoid)--sample-strength: Sampling strength (0.0-1.0)--api-url: Custom API URL (default: http://localhost:5100)--output-dir: Custom output directory (defaults to project_root/pic)Transform or compose images based on text guidance.
Example user request:
"把这张照片转换成油画风格,色彩鲜艳,笔触明显"
Script usage:
# Using local file
python scripts/generate_image.py image \
"transform to oil painting style, vivid colors, visible brushstrokes" \
--session-id "YOUR_SESSION_ID" \
--images "/path/to/image.jpg"
# Using image URL
python scripts/generate_image.py image \
"anime style, cute cat" \
--session-id "YOUR_SESSION_ID" \
--images "https://example.com/cat.jpg"
# Multiple images (up to 10)
python scripts/generate_image.py image \
"merge these images into a cohesive scene" \
--session-id "YOUR_SESSION_ID" \
--images "image1.jpg" "image2.png" "image3.jpg"
Parameters:
--images: One or more image paths or URLs (1-10 images)Supported formats: JPG, PNG, WebP Size limit: Recommended <10MB per image
⚠️ IMPORTANT: This feature only works with the jimeng-4.0, jimeng-4.1, and jimeng-4.5 models. Other models (jimeng-3.0, nanobanana, etc.) will ignore the --intelligent-ratio flag.
Use --intelligent-ratio to automatically select the best aspect ratio based on prompt keywords.
Example:
python scripts/generate_image.py text \
"奔跑的狮子,竖屏" \
--session-id "YOUR_SESSION_ID" \
--model "jimeng-4.0" \
--intelligent-ratio
| Resolution | Ratio | Dimensions |
|---|---|---|
| 1k | 1:1 | 1024×1024 |
| 4:3 | 768×1024 | |
| 3:4 | 1024×768 | |
| 16:9 | 1024×576 | |
| 9:16 | 576×1024 | |
| 3:2 | 1024×682 | |
| 2:3 | 682×1024 | |
| 21:9 | 1195×512 | |
| 2k (default) | 1:1 | 2048×2048 |
| 16:9 | 2560×1440 | |
| 4:3 | 2304×1728 | |
| 4k | 1:1 | 4096×4096 |
| 16:9 | 5120×2880 | |
| 21:9 | 6048×2592 |
scripts/generate_image.py
.git, .claude, etc.)/pic folder if it doesn't existjimeng_YYYYMMDD_HHMMSS_N.png){project_root}/pic/jimeng_{timestamp}_{index}.pngThe script requires:
pip install requests Pillow
Note: Pillow is required for WebP to PNG conversion. If not installed, WebP images will be saved as-is.
User requests image generation
↓
Is Jimeng API running at localhost:5100?
├─ No → Instruct user to start Docker service
└─ Yes → Continue
↓
Do we have Session ID?
├─ No → Request Session ID from user → Store for session
└─ Yes → Continue
↓
Text-to-Image or Image-to-Image?
├─ Text-to-Image
│ └─ Run: generate_image.py text "prompt" --session-id ID (add --ratio/--resolution/--model ONLY if user explicitly requests)
└─ Image-to-Image
└─ Run: generate_image.py image "prompt" --session-id ID --images PATH1 [PATH2...]
↓
Script executes:
1. Calls Jimeng API (文生图 or 图生图)
2. Receives image URLs
3. Downloads all images to /pic folder
4. Reports file paths
↓
Inform user of results
├─ Success → Show file paths only
└─ Failure → Report error, suggest troubleshooting
↓
HARD STOP — DO NOT READ/OPEN/ANALYZE IMAGES; DO NOT CALL `Read`/`view_image`; TASK COMPLETE
"Session ID required"
"Invalid session or authentication failed"
"Error downloading image"
"Model not supported"
nanobanana only works with international sites (us-/hk-/jp-/sg- prefix)jimeng-3.1 only works with domestic sites"nanobanana resolution mismatch"
ratio and resolution parameters are ignoredjimeng-4.0 model instead"intelligent_ratio not working"
--intelligent-ratio flag only works with jimeng-4.0, jimeng-4.1, and jimeng-4.5 modelsjimeng-4.0, jimeng-4.1, or jimeng-4.5 if you need intelligent ratio detectionRead/view_image; STOP after savingExample 1: Simple text-to-image
User: "用即梦生成一张日落山景图"
Claude: "我来帮您使用即梦API生成图片。首先需要确认您的Session ID..."
[User provides sessionid]
Claude: [Executes generate_image.py text mode]
Claude: "✅ 成功生成4张图片!
- /project/pic/jimeng_20250116_153022_1.png
- /project/pic/jimeng_20250116_153022_2.png
- /project/pic/jimeng_20250116_153022_3.png
- /project/pic/jimeng_20250116_153022_4.png"
Example 2: High-resolution with custom ratio
User: "生成4K分辨率的21:9超宽屏壮丽山水风景"
Claude: [Executes with --resolution "4k" --ratio "21:9"]
Claude: "✅ 已生成4K超宽屏图片(6048×2592):
- 已保存到 /project/pic/jimeng_20250116_154530_1.png 等4张图片"
Example 3: Image-to-image transformation
User: "把这张猫咪照片转成动漫风格"
Claude: "请提供猫咪照片的路径或URL"
[User provides image path]
Claude: [Executes image mode with --images]
Claude: "✅ 图片风格转换完成,生成了4个动漫风格版本:
- /project/pic/jimeng_20250116_160015_1.png
- ..."
Example 4: Intelligent ratio
User: "生成一张竖屏手机壁纸,星空主题"
Claude: [Executes with --intelligent-ratio, auto-detects 9:16]
Claude: "✅ 已根据'竖屏'关键词自动选择9:16比例,生成星空壁纸4张"
The Jimeng API returns image variations per request:
{
"created": 1763260188,
"data": [
{"url": "https://p3-dreamina-sign.byteimg.com/...image1.png"},
{"url": "https://p26-dreamina-sign.byteimg.com/...image2.png"},
{"url": "https://p26-dreamina-sign.byteimg.com/...image3.png"},
{"url": "https://p3-dreamina-sign.byteimg.com/...image4.png"}
]
}
All images are automatically downloaded and saved with sequential numbering.