Name: Video Use
Author: browser-use

Video Use

Edit any video by conversation. Transcribe, cut, color grade, generate overlay animations, burn subtitles — for talking heads, montages, tutorials, travel, interviews. No presets, no menus. Ask questions, confirm the plan, execute, iterate, persist. Production-correctness rules are hard; everything else is artistic freedom.

browser-use1,259 estrellas15 abr 2026

Ocupación
Categorías: Medios

Principle

LLM reasons from raw transcript + on-demand visuals. The only derived artifact that earns its keep is a packed phrase-level transcript (takes_packed.md). Everything else — filler tagging, retake detection, shot classification, emphasis scoring — you derive at decision time.
Audio is primary, visuals follow. Cut candidates come from speech boundaries and silence gaps. Drill into visuals only at decision points.
Ask → confirm → execute → iterate → persist. Never touch the cut until the user has confirmed the strategy in plain English.
Generalize. Do not assume what kind of video this is. Look at the material, ask the user, then edit.
Artistic freedom is the default. Every specific value, preset, font, color, duration, pitch structure, and technique in this document is a worked example from one proven video — not a mandate. Read them to understand what's possible and why each worked. Then make your own taste calls based on what the material actually is and what the user actually wants. The only things you MUST do are in the Hard Rules section below. Everything else is yours.

Principle

LLM reasons from raw transcript + on-demand visuals. The only derived artifact that earns its keep is a packed phrase-level transcript (takes_packed.md). Everything else — filler tagging, retake detection, shot classification, emphasis scoring — you derive at decision time.
Audio is primary, visuals follow. Cut candidates come from speech boundaries and silence gaps. Drill into visuals only at decision points.
Ask → confirm → execute → iterate → persist. Never touch the cut until the user has confirmed the strategy in plain English.
Generalize. Do not assume what kind of video this is. Look at the material, ask the user, then edit.
Artistic freedom is the default. Every specific value, preset, font, color, duration, pitch structure, and technique in this document is a worked example from one proven video — not a mandate. Read them to understand what's possible and why each worked. Then make your own taste calls based on what the material actually is and what the user actually wants. The only things you MUST do are in the Hard Rules section below. Everything else is yours.

Video Use

Principle

Video Use

Principle

Hard Rules (production correctness — non-negotiable)

Directory layout

Setup

Helpers

The process

Cut craft (techniques)

The packed transcript (primary reading view)

Editor sub-agent brief (for multi-take selection)

Color grade (when requested)

Subtitles (when requested)

Animations (when requested)

Output spec

EDL format

Memory — `project.md`

Anti-patterns

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api

Video Use

Principle

Video Use

Principle

Hard Rules (production correctness — non-negotiable)

Directory layout

Setup

Helpers

The process

Cut craft (techniques)

The packed transcript (primary reading view)

Editor sub-agent brief (for multi-take selection)

Color grade (when requested)

Subtitles (when requested)

Animations (when requested)

Output spec

EDL format

Memory — project.md

Anti-patterns

Songsee

Video Frames

Gifgrep

Qqbot Media

Camsnap

Openai Whisper Api

Memory — `project.md`