Source auditor — classifies, scores reliability, verifies metadata, and recommends inclusion/exclusion for corpus sources
Evaluate every source in the collected corpus for reliability, classify source types, verify metadata accuracy, flag risky materials, and recommend inclusion or exclusion. Produce an audit report for Gate 2 human approval.
This skill implements PRD Stage 2 (Section 17.3) — auditing portion, and PRD Agent 12.4.
$ARGUMENTS[0]: workspace path (e.g., workspaces/ar-2026-03-22-a1b2c3d4)Read from the workspace:
{workspace}/sources/corpus_manifest.json — list of all source IDs{workspace}/sources/academic/*.json — academic source records{workspace}/sources/policy/*.json — policy source records{workspace}/sources/media/*.json — media source records{workspace}/analysis/scoping/scoping_report.json — scope boundaries for relevance assessmentRead every source record file listed in the corpus manifest.
For each source, assess and update the following fields:
Source type verification (source_type):
Reliability scoring (reliability_score, 0.0 to 1.0):
Score based on these criteria:
Verification status (verification_status):
verified: the source clearly exists, metadata is accuratepartially_verified: the source likely exists but some metadata could not be confirmedunverified: could not confirm the source exists or metadata is questionablePeer review status (peer_reviewed):
Inclusion recommendation:
Add a new field audit_recommendation with values:
include: reliability >= 0.4 and relevant to scopeexclude: reliability < 0.3 or clearly outside scopeflag: reliability 0.3-0.4 or relevance is borderline — needs human decisionAudit notes:
Add a field audit_notes with a brief explanation of the scoring rationale.
For any source where authors is empty ([]), or contains only placeholder values like "Various", "Unknown", "et al.", or "N/A":
"{title}" authors {year}site:{domain_from_url} "{title}"authors field with the actual author names foundauthors to ["[authors unidentified]"] and add a note to audit_notes: "Author identification failed despite targeted search"This step is critical — proper author attribution is essential for academic citations. Do NOT leave "Various" or "Unknown" in the final source records.
For sources with verification_status of unverified, run a targeted WebSearch:
"{title}" "{first_author}" {year} to confirm the source existsOverwrite each source record file with the updated fields (reliability_score, verification_status, peer_reviewed, source_type, audit_recommendation, audit_notes).
Write {workspace}/sources/audit/source_audit_report.json:
{
"total_sources": 0,
"included": 0,
"excluded": 0,
"flagged": 0,
"average_reliability": 0.0,
"by_category": {
"academic": { "count": 0, "avg_reliability": 0.0 },
"policy": { "count": 0, "avg_reliability": 0.0 },
"media": { "count": 0, "avg_reliability": 0.0 }
},
"excluded_sources": [
{ "source_id": "string", "title": "string", "reason": "string" }
],
"flagged_sources": [
{ "source_id": "string", "title": "string", "concern": "string" }
]
}
Write {workspace}/sources/audit/source_audit_summary.md:
A human-readable summary (600 words max) that includes:
This summary is presented to the user at Gate 2 for approval.