Plan Verification Skill — Semantic Evaluation

Purpose

Evaluate a slide_outline.json holistically by reading the original paper (document.md), the structured content analysis (content_analysis.md), and the proposed plan. Produce a scored assessment with per-dimension reasoning and, when the score is below threshold, actionable improvement directions that the planner can use to revise.

This is an LLM-driven semantic evaluation — it judges meaning, not syntax. Structural integrity checks (asset ID existence, figure/table separation) are handled separately by the lightweight verify_plan tool; you do NOT need to repeat those here.

Inputs (read from the virtual filesystem)

File	Purpose
`/docs/document.md`	The full parsed paper — ground truth for claims and evidence
`/docs/content_analysis.md`

Plan Verification Skill — Semantic Evaluation

Purpose

Inputs (read from the virtual filesystem)

File	Purpose
`/docs/document.md`	The full parsed paper — ground truth for claims and evidence
`/docs/content_analysis.md`

Verification

Plan Verification Skill — Semantic Evaluation

Purpose

Inputs (read from the virtual filesystem)

Verification

Plan Verification Skill — Semantic Evaluation

Purpose

Inputs (read from the virtual filesystem)

Evaluation Dimensions (5 dimensions, each scored 1-10)

1. Contribution Coverage

2. Narrative Flow & Coherence

3. Redundancy & Duplication

4. PMRC Arc Adherence

5. Audience Clarity & Slide Design Quality

Scoring & Output

Overall Score

Thresholds

Output Format

Evaluation Strategy

Reading Order

Calibration Guidance

Improvement Directions Quality

Anti-patterns to Flag

Return Summary

Things Mac

Trello

Production Scheduling

Jira Integration

Production Scheduling

Cost Aware Llm Pipeline