Skill-Datei

Doubao Image & Video Generation

Name: Doubao Image & Video Generation
Author: ziyu

Generate images and videos using Volcengine Doubao (豆包) AI models — Seedream for images and Seedance for videos. Use this skill whenever the user asks to generate, create, or produce images or videos with Doubao/豆包, Seedream, Seedance, or Volcengine/火山引擎. Also use when the user needs AI-generated game sprites, pixel art, concept art, character designs, scene illustrations, or any visual asset generation where Doubao is the preferred provider. Triggers on: '生成图片', '生成视频', 'generate image', 'generate video', 'create artwork', 'make a sprite', 'doubao', '豆包', 'seedream', 'seedance', or any request involving AI image/video generation that should use the Volcengine platform.

ziyu0 Sterne14.03.2026

Beruf
Kategorien: Design

Skill-Inhalt

Generate images with Seedream and videos with Seedance via the Volcengine ARK API.

Quick Reference

Capability	Model	Script
Text-to-Image	Seedream 4.0 / 4.5 / 5.0	`scripts/generate_image.py`
Image-to-Image	Seedream (with reference images)	`scripts/generate_image.py`
Transparent PNG	Seedream + rembg post-processing	`scripts/generate_image.py --remove-bg`
Text-to-Video	Seedance 1.5 Pro / 1.0 variants	`scripts/generate_video.py`
Image-to-Video	Seedance (with first frame)	`scripts/generate_video.py`
Check Video Status	—	`scripts/get_video_task_status.py`

Configuration

Verwandte Skills

Doubao Image & Video Generation | Skills Pool

uv run <skill-path>/scripts/generate_image.py \
  --prompt "a cute orange cat sitting on a wooden barrel, pixel art style" \
  --output ./output.png

Parameter	Flag	Default	Description
Prompt	`--prompt`	(required)	Image description. English or Chinese both work well.
Output	`--output`	`./generated_image.png`	Where to save the image
Model version	`--version`	`4.5`	Seedream version: `4.0`, `4.5`, or `5.0`
Size	`--size`	`2K`	Resolution: see table below
Reference images	`--image`	(none)	URL(s) for img2img / style reference. Can pass multiple times.
Watermark	`--watermark`	`false`	Add watermark
Count	`-n`	`1`	Number of images to generate (Seedream 4.x only, 1-4)
Seed	`--seed`	(none)	Random seed for reproducibility (-1 for random)
Response format	`--response-format`	`url`	`url` (download link) or `b64_json` (inline base64)
Remove background	`--remove-bg`	`false`	Remove background → outputs transparent RGBA PNG
API key	`--api-key`	`$ARK_API_KEY`	Override API key
Base URL	`--base-url`	(default)	Override API endpoint

uv run scripts/generate_image.py \
  --prompt "a post-apocalyptic vehicle driving through desert ruins, dramatic lighting" \
  --output ./vehicle_concept.png \
  --size 2K

uv run scripts/generate_image.py \
  --prompt "top-down 2D game sprite of a rusty armored truck, pixel art, 64x64, transparent background" \
  --output ./sprite_truck.png \
  --version 5.0 \
  --size 2K

uv run scripts/generate_image.py \
  --prompt "same style but in winter setting with snow" \
  --image "https://example.com/reference.png" \
  --output ./winter_version.png

uv run scripts/generate_image.py \
  --prompt "combine these styles into a new character design" \
  --image "https://example.com/style1.png" \
  --image "https://example.com/style2.png" \
  --output ./fused_design.png

uv run scripts/generate_image.py \
  --prompt "a warrior character holding a sword, game sprite, white background" \
  --remove-bg \
  --output ./warrior_sprite.png

uv run scripts/generate_image.py \
  --prompt "a sci-fi weapon icon, game UI, clean design" \
  -n 4 \
  --seed 42 \
  --version 4.5 \
  --output ./weapon_icon.png
# Outputs: weapon_icon_1.png, weapon_icon_2.png, weapon_icon_3.png, weapon_icon_4.png

uv run scripts/generate_image.py \
  --prompt "a cute robot character, game sprite, white background" \
  --remove-bg \
  --output ./robot.png

{
  "status": "success",
  "file": "./output.png",
  "size": "3136x1344",
  "model": "doubao-seedream-4-5-251128",
  "tokens_used": 4197376
}

{
  "status": "error",
  "error": "API returned 400: invalid size for model version 4.5"
}

# Step 1: Submit
uv run <skill-path>/scripts/generate_video.py \
  --prompt "a rusted vehicle driving through a sandstorm" \
  --output ./video_output.mp4

# Step 2: Check status (the task_id is printed by step 1)
uv run <skill-path>/scripts/get_video_task_status.py \
  --task-id "task_abc123" \
  --output ./video_output.mp4

Parameter	Flag	Default	Description
Prompt	`--prompt`	(required)	Video description
Output	`--output`	`./generated_video.mp4`	Where to save the video
Model	`--model`	`doubao-seedance-1-5-pro-251215`	Model name
Duration	`--duration`	`5`	Video length in seconds (3-8)
Ratio	`--ratio`	`16:9`	Aspect ratio: `16:9`, `9:16`, `1:1`, `21:9`
First frame	`--first-frame`	(none)	Image URL for image-to-video
Reference images	`--ref-image`	(none)	Reference image URLs (lite i2v models only). Can pass multiple.
Watermark	`--watermark`	`false`	Add watermark
API key	`--api-key`	`$ARK_API_KEY`	Override API key
Base URL	`--base-url`	(default)	Override API endpoint

Parameter	Flag	Default	Description
Task ID	`--task-id`	(required)	The task ID from generate_video.py
Output	`--output`	`./generated_video.mp4`	Where to save when complete
Wait	`--wait`	`false`	Poll until complete (up to 10 min)
API key	`--api-key`	`$ARK_API_KEY`	Override API key
Base URL	`--base-url`	(default)	Override API endpoint

Model	Description
`doubao-seedance-1-5-pro-251215`	Latest pro model (default, best quality)
`doubao-seedance-1-0-pro`	Pro v1.0
`doubao-seedance-1-0-pro-fast`	Faster pro generation
`doubao-seedance-1-0-lite-t2v`	Lightweight text-to-video
`doubao-seedance-1-0-lite-i2v-250428`	Lightweight image-to-video (supports multiple ref images)

uv run scripts/generate_video.py \
  --prompt "a convoy of armored vehicles crossing a desert, dust clouds, cinematic" \
  --duration 5 \
  --ratio 16:9 \
  --output ./convoy.mp4

uv run scripts/generate_video.py \
  --prompt "the vehicle starts moving forward, dust kicks up" \
  --first-frame "https://example.com/vehicle_still.png" \
  --output ./vehicle_moving.mp4

uv run scripts/get_video_task_status.py \
  --task-id "task_abc123" \
  --output ./convoy.mp4 \
  --wait

{
  "status": "submitted",
  "task_id": "task_abc123",
  "model": "doubao-seedance-1-5-pro-251215",
  "message": "Video generation started. Use get_video_task_status.py --task-id task_abc123 --wait to check."
}

{
  "status": "succeeded",
  "task_id": "task_abc123",
  "video_url": "https://...",
  "file": "./convoy.mp4"
}

{
  "status": "processing",
  "task_id": "task_abc123",
  "message": "Still generating..."
}

Version	Allowed Sizes
4.0	1K, 2K, 4K
4.5	2K, 4K
5.0	2K, 3K

Doubao Image & Video Generation

Quick Reference

Configuration

Doubao Image & Video Generation

Quick Reference

Configuration

Image Generation (Seedream)

When to Use

How to Run

Parameters

Size Options by Version

Examples

Transparent PNG (Background Removal)

Output

Video Generation (Seedance)

When to Use

How to Run

Parameters for generate_video.py

Parameters for get_video_task_status.py

Available Models

Examples

Output

Prompting Tips

Error Handling

Frontend Slides

Frontend Slides

Frontend Slides

Ascii Art

Popular Web Designs

Meme Generation