CLI Commands

All media commands run via bash. Accept JSON params as a single argument. Use a separate bash tool call for each media command — do not chain multiple media commands in a single bash call. This allows parallel execution and correct UI rendering.

Use the correct api_credentials suffix for billing:

api_credentials=["llm-api:image"] — image generation
api_credentials=["llm-api:video"] — video generation
api_credentials=["llm-api:audio"] — text-to-speech and transcription

If the runtime has the elevenlabs skill installed, prefer it for dedicated text-to-speech, voice, and transcription workflows instead of building custom wrappers.

asi-generate-image

Generate images from text prompts. Supports img2img with reference images.

asi-generate-image '{"prompt": "A sunset over mountains", "filename": "sunset", "aspect_ratio": "16:9"}'

CLI Commands

Use the correct api_credentials suffix for billing:

api_credentials=["llm-api:image"] — image generation
api_credentials=["llm-api:video"] — video generation
api_credentials=["llm-api:audio"] — text-to-speech and transcription

If the runtime has the elevenlabs skill installed, prefer it for dedicated text-to-speech, voice, and transcription workflows instead of building custom wrappers.

asi-generate-image

Generate images from text prompts. Supports img2img with reference images.

asi-generate-image '{"prompt": "A sunset over mountains", "filename": "sunset", "aspect_ratio": "16:9"}'

Parameter	Required	Default	Description
`prompt`	yes	—	Detailed description of the image
`filename`	yes	—	Output filename without extension (adds .png)
`aspect_ratio`	no	`"1:1"`	`"1:1"`, `"3:4"`, `"4:3"`, `"9:16"`, `"16:9"`
`model`	no	`"nano_banana_2"`	`"nano_banana_2"`, `"nano_banana_pro"`, `"gpt_image_1_5"`
`images`	no	—	List of absolute image paths for img2img (max 10, PNG/JPEG/WebP)
`background`	no	—	`"transparent"`, `"opaque"`, or `"auto"` (only for `gpt_image_1_5`)

Parameter	Required	Default	Description
`prompt`	yes	—	Scene description including action, camera movement, style
`filename`	yes	—	Output filename without extension (adds .mp4)
`aspect_ratio`	no	`"16:9"`	`"16:9"` (landscape) or `"9:16"` (portrait)
`duration`	no	`8`	Sora: 4, 8, 12 seconds. Veo: 4, 6, 8 seconds
`model`	no	`"sora_2"`	`"sora_2"`, `"sora_2_pro"`, `"veo_3_1"`, `"veo_3_1_fast"`
`image_path`	no	—	Absolute path to starting frame image

Parameter	Required	Default	Description
`file_path`	yes	—	Absolute path to .txt (single speaker) or .json (multi-speaker dialogue)
`voice`	no	`"kore"`	Voice name for single-speaker .txt files. Ignored for .json dialogue
`model`	no	`"gemini_2_5_pro_tts"`	`"gemini_2_5_pro_tts"` or `"elevenlabs_tts_v3"`

Parameter	Required	Default	Description
`file_path`	yes	—	Absolute path to audio/video file
`diarize`	no	`false`	Identify speakers (up to 32)
`num_speakers`	no	—	Hint for expected number of speakers (1-32)
`timestamps`	no	`"none"`	`"none"` (plain txt), `"word"`, `"character"` (json with timing)
`language_code`	no	—	ISO 639-1 code (e.g. `"en"`, `"es"`). Auto-detected if omitted

Media

CLI Commands

asi-generate-image

Media

CLI Commands

asi-generate-image

asi-generate-video

asi-text-to-speech

asi-transcribe-audio

Prose

Coding Agent (bash-first)

Create Prompt

Strategic Compact

Strategic Compact

Strategic Compact