Name: Staging Golden Tier
Author: haberlah

Search skills.../

Staging Golden Tier | Skills Pool

# List staging files
gws drive files list --params '{"q": "\"1W169BsPkexomhf6Qeyz-6dFOkTn-Z2UW\" in parents and trashed=false", "includeItemsFromAllDrives": true, "supportsAllDrives": true, "fields": "files(id,name,modifiedTime,mimeType,size)", "pageSize": 100}'

# Download .docx files
gws drive files get --params '{"fileId": "<ID>", "alt": "media", "supportsAllDrives": true}' --output "/tmp/staging/<filename>"

# Convert .docx to .md
pandoc -f docx -t markdown --wrap=none "/tmp/staging/<filename>.docx" -o "/tmp/staging/<filename>.md"

python3 scripts/clean_transcript.py "/tmp/staging/<file>.md" -o "/tmp/staging/<file>_cleaned.md"

python3 scripts/generate_frontmatter.py \
  --participant "Name" \
  --role "Support Coordinator" \
  --stage 4 \
  --session-type "MVP Testing" \
  --date "2026-04-09" \
  --source "Gemini embedded transcript" \
  --source-file "original.docx" \
  --content-type transcript \
  --has-companion-notes true \
  "/tmp/staging/<file>_cleaned.md"

# Lock processed files
chmod 444 "Golden_Tier/{path}_transcript.md"
chmod 444 "Golden_Tier/{path}_notes.md"

# Update manifest
python3 scripts/update_manifest.py \
  --manifest "Golden_Tier/manifest.yaml" \
  --transcript "Golden_Tier/{path}_transcript.md" \
  --notes "Golden_Tier/{path}_notes.md"

Check	Method	Pass criteria
YAML frontmatter	Parse YAML block	All required fields present and correctly typed
Word count	Count body words, compare to frontmatter	Within +/- 5 of `word_count` field
Speaker format (Read AI)	Regex: `^\\[^]+\\* \[[^\]]+\] \(\d+:\d{2}:\d{2}\):`	Every speaker turn matches
Speaker format (Gemini)	Regex: `^[A-Z][^:]+:` after a timestamp block	Consistent speaker labels
No pandoc artifacts	Search for `{.underline}`, `[~~`, trailing `\`	Zero matches
No Gemini noise	Search for "You should review", "Suggested next steps"	Zero matches in transcript files
No unmerged turns	Check for consecutive identical speaker labels	None found (or justified)
File permissions	`stat -f %Lp` or `ls -la`	`444` (read-only)
File location	Path check	Correct subfolder and naming convention
Manifest consistency	Parse manifest, check for duplicates, verify totals	Clean

Staging Golden Tier

Prerequisites

Reference files

Workflow

Staging Golden Tier

Prerequisites

Reference files

Workflow

Phase 1 — Inventory

Phase 2 — Download & Convert

Phase 3 — Parse & Extract

Phase 4 — Clean

Phase 5 — Format & Place

Phase 6 — Lock & Update Index

Phase 7 — Final Verification

Safety

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns