Document Analysis | Skills Pool

Archivo del skill

Document Analysis

Content analysis for documents — PDFs, gists, articles, markdown, arxiv papers, code snippets, meeting notes. Dual-lens (Creator View + Engineer View). Two user-invokable depths (Standard / Deep); Quick Scan is triage- only. Outputs to .research/analysis/<doc-slug>/.

jasonmichaelbell78-creator2 estrellas17 abr 2026

Ocupación
Categorías: Documentos

Contenido de la habilidad

Document Version: 2.0 Last Updated: 2026-04-15 Status: ACTIVE

Shared conventions: See .claude/skills/shared/CONVENTIONS.md

/analyze router: This skill is the document-handler arm of /analyze — direct invocation and router dispatch both supported. Handoff contract: the router passes {target, auto_detected_type: "document"} as if the skill were invoked directly.

Document Analysis

Analysis of documents as knowledge artifacts. Mirrors /repo-analysis structure for cohesion with sibling CAS handlers. Handles any text-readable source: PDFs, GitHub gists, articles, markdown files, arxiv papers, code snippets, meeting notes.

Warm-up (shown at invocation)

/document-analysis <path-or-url> [--depth=standard|deep]
  depth:          standard (default) | deep | quick (triage)
  phases:         PHASE N of M  (M = 9 Standard / Deep, 1 Quick)
  est. time:      Quick ~15s | Standard 2-4m | Deep 5-10m
  output:         .research/analysis/<slug>/
  prior feedback: {replay per CONVENTIONS §18 if prior state file exists}

Skills relacionados

You want to…	Use this
Analyze one PDF / gist / article / paper	`/document-analysis` (here)
Let router auto-pick repo vs site vs PDF	`/analyze <target>`
Cross-source synthesis across 3+	`/synthesize`
GitHub repo	`/repo-analysis`
Website / blog	`/website-analysis`
Video / audio	`/media-analysis`

/document-analysis <path-or-url>
/document-analysis <path> --depth=standard
/document-analysis <path> --depth=deep
/document-analysis <path> --depth=quick       # triage only

Artifact	Phase	Format / Notes
`analysis.json`	0+	Core record (schema v3.0)
`findings.jsonl`	3/5	One JSON object per line
`creator-view.md`	4	Conversational prose, 6 sections
`summary.md`	5	Health bands
`value-map.json`	6	Candidates ranked
`deep-read.md`	2	Internal references catalog
`content-eval.jsonl`	3.5	Evaluated references
`coverage-audit.jsonl`	6b	Unexplored items + user decisions
`extraction-journal.jsonl`	routing	Append-only cross-source record

VALIDATE       Guards         -> File exists? URL reachable? Supported type? Prior feedback replay (§18)?
PHASE 0 of 9   Quick Scan     -> Read first page/section, classify, lightweight creator lens
GATE           Interactive    -> "Run Standard/Deep?" — flag bypasses
PHASE 1 of 9   Content Load   -> Read full document
PHASE 2 of 9   Deep Read      -> Internal references, citations, linked resources
PHASE 3 of 9   Dimension Wave -> 6 dimensions: depth, methodology, actionability, novelty, clarity, source quality
PHASE 3.5 of 9 Content Eval   -> Evaluate embedded references — BEFORE Creator View
PHASE 4 of 9   Creator View   -> 6 sections, home context comparison
PHASE 5 of 9   Engineer View  -> Quality bands via shared scoring
PHASE 6 of 9   Value Map      -> Pattern/knowledge/content/anti-pattern candidates
PHASE 6b of 9  Coverage Audit -> Unread sections, unfollowed references
PHASE 6c of 9  Tag Suggestion -> Per _shared/TAG_SUGGESTION.md
SELF-AUDIT + ROUTING

Dimension	What It Measures
Content Depth	How thoroughly topics are covered
Methodology	Rigor of reasoning, evidence quality
Actionability	How directly applicable the ideas are
Novelty	Original insights vs rehashed knowledge
Clarity	Writing quality, organization, accessibility
Source Quality	Author credibility, citation quality, recency

Gate	Default
`--depth` unspecified	`standard`
Quick → Standard gate unanswered	`proceed to Standard`
Scope-explosion (>100 pages)	`first 50 pages`
Coverage Audit unanswered	`skip all` (logged)
Tag Suggestion unanswered	never auto-approve — block
Routing menu unanswered	`7. Done` (cleanup + invocation track)
Prior Feedback Replay (CONV §18)	`continue unchanged` (logged as shown)

Option	Action
1. Extract value	Present candidates from value-map
2. Send to TDMS	Transform findings to TDMS format
3. Deep-plan this	Inject analysis as research context
4. Save to memory	Persist key findings
5. Adoption verdict	Full assessment (if applicable)
6. Explore insights	Deeper conversation about findings
7. Done	Cleanup, confirm artifacts, track invocation
8. Cross-source synthesis	If 3+ sources analyzed, offer `/synthesize`

cd scripts/reviews && npx tsx write-invocation.ts --data '{
  "skill":"document-analysis","type":"skill","success":true,
  "schema_version":1,"completeness":"stub",
  "origin":{"type":"manual"},
  "context":{"target":"DOC_SLUG","mode":"document","depth":"DEPTH",
             "score":SCORE,"decisions":DECISION_COUNT,
             "candidates":CANDIDATE_COUNT}
}'

Version	Date	Description
2.0	2026-04-15	Skill-audit batch 2026-04-15-analysis-quartet Wave 2. Breaking: phase renumber — Phase 2 (Dimension Wave) → Phase 3, Phase 2b (Deep Read) → Phase 2, Phase 4b (Content Eval) → Phase 3.5 (Cat 2-E + Pattern 10 combined). Structural: /analyze router ack, Warm-up, Routing Guide, NEW Integration section, NEW Retro section, NEW invocation tracking, Delegation & Defaults, consolidated Guard Rails top-5, PDF scope-explosion soft prompt (>100 pages), Done-when gates, PHASE N of M, Tag Suggestion → _shared ref, Prior Feedback Replay per CONVENTIONS §18, output table. T28 tagline removed from user-visible description (preserved in v1.0 history below).
1.1	2026-04-09	PDF fallback for Windows, add summary.md + deep-read.md + content-eval.jsonl + coverage-audit.jsonl to output list (Session #270 E2E test)
1.0	2026-04-08	Initial creation (T28 CAS, Session #269)