Name: Generating Videos
Author: yikart

Search skills.../

Generating Videos | Skills Pool

Parameter	Values	Default	Description
duration	1-15	-	Video duration in seconds
resolution	480p, 720p	720p	Video resolution
aspectRatio	1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3	9:16	Video aspect ratio
imageUrl	string	-	Reference image URL for image-to-video

Model	Speed	Quality	Video Extension	Reference Images	Use Case
veo-3.1-fast-generate-preview	Fast	Good	Yes	No	Default Veo - Most use cases
veo-3.1-generate-preview	Slow	Better	Yes	Yes (max 3)	Best quality + all features
veo-3.1-fast-generate-001	Fast	Good	No	No	Simple generation (no extension)
veo-3.1-generate-001	Slow	Better	No	No	Higher quality (no extension)

Parameter	Values	Default	Description
duration	4, 6, 8	8	Video duration in seconds
resolution	720p, 1080p, 4000	720p	Video resolution (1080p/4K takes longer)
aspectRatio	16:9, 9:16	9:16	Video aspect ratio (vertical by default for social media)
negativePrompt	string	-	What to exclude from the video
seed	number	-	Random seed for reproducibility

A man in a suit stands at a podium, speaking confidently: "Welcome to the future of technology."
一位穿西装的男子站在讲台上，自信地说："欢迎来到科技的未来。"

Mode	Parameters	Models Supported
Text-to-video	prompt only	All
Image-to-video	prompt + image	All
First-last-frame	prompt + image + lastFrame	All
Video extension	prompt + video	Preview models only
Reference images	prompt + referenceImages	veo-3.1-generate-preview only

Target Duration	Recommended Approach
≤ 15 seconds	Grok direct generation (preferred, faster and cost-effective)
16-36 seconds	Veo Video Extension (dynamic initial + N extensions)
> 36 seconds	First-Last-Frame Storyboard

For each segment URL:
  uploadAndGetVid(url) → vid://segment_N, duration_N, width, height

{
  "Canvas": { "Width": 1920, "Height": 1080 },
  "Track": [[
    { "Type": "video", "Source": "vid://segment_1", "TargetTime": [0, 8000] },
    { "Type": "video", "Source": "vid://segment_2", "TargetTime": [8000, 16000] },
    { "Type": "video", "Source": "vid://segment_3", "TargetTime": [16000, 24000] }
  ]]
}

{
  "Canvas": { "Width": 1920, "Height": 1080 },
  "Track": [[
    {
      "Type": "video",
      "Source": "vid://segment_1",
      "TargetTime": [0, 8000],
      "Extra": [{ "Type": "transition", "Source": "1182376", "Duration": 500 }]
    },
    {
      "Type": "video",
      "Source": "vid://segment_2",
      "TargetTime": [7500, 15500],
      "Extra": [{ "Type": "transition", "Source": "1182376", "Duration": 500 }]
    },
    { "Type": "video", "Source": "vid://segment_3", "TargetTime": [15000, 23000] }
  ]]
}

submitDirectEditTask(Canvas, Track) → taskId
wait 90 seconds
poll getVideoEditTaskStatus(taskId) every 30 seconds until completed

**Generated Video**:
![Video](url)

Video URL: url

All N video segments completed!

| # | Status | Preview | URL |
|---|--------|---------|-----|
| 1 | ✅ | ![Segment 1](url1) | url1 |
| 2 | ✅ | ![Segment 2](url2) | url2 |
| ... | ... | ... | ... |

**All Video URLs**:
1. url1
2. url2
...

**Final Video** (concatenated from N segments):
![Final Video](final_url)

Final Video URL: final_url

Scenario	Use Model	Reason
Video ≤ 15s (text-to-video / image-to-video)	Grok (preferred)	Grok supports 1-15s directly, faster and cost-effective
Video 16-36s (continuous)	Veo	Requires Video Extension, Grok max is 15s
First-last-frame storyboard	Veo	Grok does not support first-last-frame
Reference images (style consistency)	Veo (generate-preview)	Grok does not support reference images
User explicitly requests Veo

Scenario	Use Model	Reason
Video ≤ 15s (text-to-video / image-to-video)	Grok (preferred)	Grok supports 1-15s directly, faster and cost-effective
Video 16-36s (continuous)	Veo	Requires Video Extension, Grok max is 15s
First-last-frame storyboard	Veo	Grok does not support first-last-frame
Reference images (style consistency)	Veo (generate-preview)	Grok does not support reference images
User explicitly requests Veo

Target	Initial	Extensions	Actual
18s	4s	2	18s
22s	8s	2	22s
27s	6s	3	27s
29s	8s	3	29s
36s	8s	4	36s

Mode	Parameters	Description
Text-to-video	prompt only	Generate video from text description
Image-to-video	prompt + imageUrl	Generate video using a reference image

Generating Videos

Video Generation

Model Selection Strategy

Generating Videos

Video Generation

Model Selection Strategy

Language Rule

Grok Video Models (Preferred)

Grok Parameters

Grok Generation Modes

Grok Workflow

Veo 3.1 Models (Advanced Features)

Veo Parameters

Prompt Guide (applies to both Grok and Veo)

Prompt Structure

1. Subject & Background (主题与背景)

2. Action (动作)

3. Style (风格)

4. Camera Work (镜头)

5. Atmosphere (氛围)

Audio Generation Guide

1. Dialogue (对话)

2. Sound Effects (音效)

3. Ambient Noise (环境音)

Generation Modes

Video Extension

Limitations

Workflow

Workflow

Grok Video Generation (Preferred)

Veo Video Generation

First-Last-Frame Generation (Veo only)

Duration-Based Model Selection Strategy

When to Use Each Approach

Why Grok for ≤ 15s?

Veo Extension (for 16-36s only)

Extension Workflow

Long Videos Strategy

Approach 1: First-Last-Frame Storyboard

Approach 2: Video Extension

Video Concatenation Guide

Prerequisites

Concatenation Workflow

For First-Last-Frame Videos - NO TRANSITIONS

For Independent Segments - Optional Transitions

Output Requirements

Single Video Output

Multiple Videos Output (Batch/Storyboard)

After Concatenation

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api