Name: Media Ingest
Author: garrytan

Media Ingest Skill

Ingest video, audio, PDF, book, screenshot, and GitHub repo content into the brain.

Filing rule: Read skills/_brain-filing-rules.md before creating any new page.

Contract

This skill guarantees:

Every ingested media item has a brain page with analysis (not just a transcript dump)
Transcripts (video/audio) saved in raw and human-readable formats
Entity extraction: every person and company mentioned gets back-linked
Raw source files preserved via gbrain files upload-raw
Filing by primary subject, not by media format

Every mention of a person or company with a brain page MUST create a back-link.

Format	Action
YouTube/video URL	Fetch transcript (Whisper, transcription service, or captions)
Audio file	Transcribe with available STT service
PDF	Extract text (OCR if needed)
Book PDF	Extract text, identify chapters/sections
Screenshot/image	OCR via vision model, extract text and entities
GitHub repo	Clone, read README + key files, summarize architecture