Name: Media Agents
Author: anokye-labs

Search skills.../

Media Agents | Skills Pool

User: "Create 3 social media variants of this product photo"

Fleet dispatch:
  ├─ generator-1 → 1200×628 Facebook variant
  ├─ generator-2 → 1080×1080 Instagram variant
  └─ generator-3 → 1200×675 Twitter variant

All run in parallel. Aggregate results when all complete.

Analyze Input → Plan Pipeline → Execute Steps → Validate Output

Pipeline for "Create a product hero image":
  1. generate  → fal-ai flux-pro, 1920×1080, product photography style
  2. enhance   → sharpen, color-correct, remove artifacts
  3. resize    → create 3 variants: hero (1920×1080), thumb (400×300), og (1200×630)
  4. validate  → check dimensions, file size <2MB, format is WebP

After This Step	Save What
Generation	Base image path, model used, prompt, seed
Enhancement	Enhanced image path, operations applied
Resize/Convert	Each variant path, dimensions, format
Validation	Quality scores, pass/fail per variant

output/
├── .checkpoint.json        ← workflow state
├── base_hero.png           ← generation output
├── base_hero_enhanced.png  ← enhancement output
├── hero_1920x1080.webp     ← final variant
├── thumb_400x300.webp      ← final variant
└── og_1200x630.webp        ← final variant

{
  "workflow_id": "hero-image-20260206-1842",
  "status": "in_progress",
  "current_step": 3,
  "steps": [
    {"name": "generate", "status": "done", "output": "base_hero.png"},
    {"name": "enhance", "status": "done", "output": "base_hero_enhanced.png"},
    {"name": "resize", "status": "in_progress", "completed": ["hero_1920x1080.webp"]},
    {"name": "validate", "status": "pending"}
  ]
}

Check	Method	Threshold
Dimensions	`get_metainfo` → width, height	Must match request ±1px
Format	`get_metainfo` → format	Must match requested format
File size	File system check	Must be under specified limit
Quality	`detect` for artifacts	Confidence > 0.8 for expected content
Content	`ocr` if text expected	Text must be legible and correct

User: "Resize this image to 800×600"

→ Call ImageSorcery resize with width=800, height=600
→ Return: "Resized to 800×600. Saved to output/image_resized.png (42KB)"

User: "Resize this image to 800×600"

→ "I'd be happy to help resize your image! There are several approaches
   we could take. The most common method is bilinear interpolation..."

✅ Generated hero image (1920×1080, 245KB)
✅ Created 3 variants: hero, thumbnail, og-image
✅ All variants validated — WebP format, under 2MB

Output directory: output/hero-image/

Attempt 1: immediate
Attempt 2: wait 1s  (±25% jitter)
Attempt 3: wait 2s  (±25% jitter)
Attempt 4: wait 4s  (±25% jitter)
Attempt 5: fail — report error to user

✅ Generated base image (1920×1080)
✅ Created hero variant (1920×1080, WebP)
❌ Thumbnail resize failed: out of memory
✅ OG image variant (1200×630, WebP)

2 of 3 variants created. Retry thumbnail? [yes/no]

Step 1: Analyze
  - No source image → generate from scratch
  - Need: hero banner, likely 1920×1080, high quality

Step 2: Plan
  1. Generate base image via fal-ai flux-pro
  2. Enhance: sharpen, adjust contrast
  3. Create variants: hero (1920×1080), thumbnail (400×300), og (1200×630)
  4. Validate all outputs

Step 3: Execute
  [checkpoint] generate → output/base_headphones.png
  [checkpoint] enhance → output/base_headphones_enhanced.png
  [parallel]
    ├─ resize → output/hero_1920x1080.webp
    ├─ resize → output/thumb_400x300.webp
    └─ resize → output/og_1200x630.webp
  [checkpoint] validate all variants

Step 4: Report
  ✅ 3 variants created, all validated
  Output: output/hero-headphones/

Step 1: Analyze
  - Source: user's logo file
  - Need: platform-specific variants with proper dimensions

Step 2: Plan (parallel fleet dispatch)
  generator-1 → Facebook cover (820×312)
  generator-2 → Instagram post (1080×1080)
  generator-3 → Twitter header (1500×500)
  generator-4 → LinkedIn banner (1584×396)
  validator   → check all outputs

Step 3: Execute
  [parallel] All 4 generators run simultaneously
  [sequential] Validator checks each output
  [checkpoint] Save all results

Step 4: Report
  ✅ 4 social media variants created
  All pass dimension and format validation
  Output: output/social-kit/

Step 1: Analyze
  - Source: user's photo (read path, get_metainfo)
  - Need: enhanced + web-optimized variants

Step 2: Plan (sequential chain)
  1. Analyze source dimensions and format
  2. Sharpen via unsharp mask
  3. Color-correct (auto white balance)
  4. Export WebP variants: original size + 50% + thumbnail
  5. Validate file sizes under 500KB

Step 3: Execute
  [checkpoint] get_metainfo → 4032×3024, JPEG, 3.2MB
  [checkpoint] sharpen → output/photo_sharp.png
  [checkpoint] color-correct → output/photo_corrected.png
  [parallel]
    ├─ export → output/photo_full.webp (4032×3024)
    ├─ export → output/photo_half.webp (2016×1512)
    └─ export → output/photo_thumb.webp (400×300)
  [checkpoint] validate sizes

Step 4: Report
  ✅ Enhanced and exported 3 web-ready variants
  Original: 3.2MB → Largest variant: 380KB (88% reduction)
  Output: output/photo-enhanced/

Pattern	When	How
Fleet dispatch	Independent subtasks	Parallel agents, aggregate results
Sequential chain	Dependent steps	Output → next input, checkpoint each
Checkpoint	After every mutation	Write state + output to disk
Retry	Transient errors	Exponential backoff, max 5 attempts
Regenerate	Quality failure	Adjust params, max 2 retries
Partial report	Mixed success/failure	Report each step, offer retry
Read-only	Always	Write to output/, never touch source

Role	Responsibility	Tools
generator	Create base media via fal.ai models	`fal-ai` skill
processor	Transform media (resize, crop, convert, enhance)	ImageSorcery MCP
validator	Check quality, dimensions, format compliance	ImageSorcery `detect`, `get_metainfo`, `ocr`
optimizer

Role	Responsibility	Tools
generator	Create base media via fal.ai models	`fal-ai` skill
processor	Transform media (resize, crop, convert, enhance)	ImageSorcery MCP
validator	Check quality, dimensions, format compliance	ImageSorcery `detect`, `get_metainfo`, `ocr`
optimizer

Media Agents

Media Agents — Agentic Workflow Patterns

1. Fleet Pattern

Agent Roles

Media Agents

Media Agents — Agentic Workflow Patterns

1. Fleet Pattern

Agent Roles

Parallel Dispatch

Result Aggregation

2. Multi-Step Reasoning

Step 1: Analyze Input

Step 2: Plan Pipeline

Step 3: Execute Steps

Step 4: Validate Output

3. Checkpoint Pattern

When to Checkpoint

Checkpoint Storage

Resume on Failure

Validation Checks

4. Actions-First Design

Do This

Not This

Batching Rules

Response Format

5. Read-Only by Default

Output Rules

Confirmation Required For

No Confirmation Needed For

6. Error Handling

API Failures (Transient)

Quality Failures

Partial Failures

Permanent Failures

7. Available Agent Types

media-generator

image-processor

video-processor

quality-validator

8. Example Workflows

Workflow A: Product Hero Image

Workflow B: Batch Social Media Kit

Workflow C: Image Enhancement Pipeline

Quick Reference

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api