스킬 파일

Evaluate Video Quality

Name: Evaluate Video Quality
Author: hao-ai-lab

Evaluate generated video quality using available metrics (SSIM, loss trajectory, caption consistency)

hao-ai-lab3,401 스타2026. 3. 9.

직업
카테고리: 미디어

스킬 내용

Purpose

Assess the quality of videos generated by a training run. Combines multiple signals to give a holistic quality assessment. This skill is evolving — new metrics will be added as they are developed.

Prerequisites

Generated videos available locally or via W&B artifacts.
For SSIM: reference videos from official implementations.
For caption consistency: LLM access (optional, stub for now).

Inputs

Parameter	Required	Description
`video_paths`	Yes	List of paths to generated videos
`reference_paths`	No	Paths to reference videos (for SSIM)
`prompts`	No	Prompts used to generate videos (for caption check)

관련 스킬

Evaluate Video Quality | Skills Pool

loss_summary

pytest fastvideo/tests/ssim/ -vs --video-path <generated> --reference-path <reference>

from fastvideo.tests.ssim.ssim_utils import compute_ssim
score = compute_ssim(generated_video, reference_video)
# score > 0.85 is typically "acceptable"

import json
with open(loss_summary_path) as f:
    summary = json.load(f)

final_loss = summary["train_loss"]
runtime = summary["_runtime"]
steps = summary["_step"]

Prompt: "A golden retriever playing in the snow"
Video: <path>

Score the video on:
1. Object presence (is there a golden retriever?)
2. Action accuracy (is it playing?)
3. Environment match (is there snow?)
4. Overall coherence (does it look natural?)

Each 1-5, total /20.

## Video Quality Report: <experiment_name>

| Metric | Score | Threshold | Status |
|--------|-------|-----------|--------|
| SSIM (avg) | 0.87 | > 0.80 | ✅ Pass |
| Loss trajectory | decreasing | decreasing | ✅ Pass |
| Caption consistency | 16/20 | > 14/20 | ✅ Pass |

### Per-Video Scores
| Video | SSIM | Caption |
|-------|------|---------|
| video_001.mp4 | 0.89 | 17/20 |
| video_002.mp4 | 0.85 | 15/20 |

SSIM Range	Quality
> 0.90	Excellent — very close to reference
0.80–0.90	Good — acceptable for most uses
0.70–0.80	Fair — noticeable differences
< 0.70	Poor — significant quality issues

Evaluate Video Quality

Purpose

Prerequisites

Inputs

Evaluate Video Quality

Purpose

Prerequisites

Inputs

Available Metrics

SSIM (Active)

Loss Trajectory (Active)

Caption Consistency (Draft — Not Yet Calibrated)

Steps

Outputs

References

Changelog

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api