Score H2EWD manual chapters on quality, value, timeliness, standalone, and voice. Reads full files, uses calibrated anchors, writes scores to frontmatter, and regenerates the search index.
Score one or more QMD chapters on 5 dimensions and write scores to frontmatter.
/score-chapters knowledge/strategy/earth-optimization-prize.qmd
/score-chapters knowledge/problem/*.qmd
/score-chapters all # score all chapters in _quarto-manual.yml
Rigor, analytical depth, and clarity. Rigor can come from EITHER empirical analysis OR mechanism design — both count equally.
Empirical rigor (e.g. 1-pct-treaty-impact): formal methodology, Monte Carlo, peer-reviewed sources, explicit limitations, uncertainty quantification.
Mechanism design rigor (e.g. earth-optimization-prize, incentive-alignment-bonds): well-specified rules, game-theoretic reasoning, worked examples with concrete numbers, failure mode analysis, comparison to alternatives, addresses objections.
Three components, scored together:
Relevance today.
Can a new reader landing on this page — with NO prior reading of the book — understand the argument, get value from it, and potentially use it as an entry point to the broader work?
What standalone measures:
What standalone does NOT penalize:
What DOES indicate low standalone:
Scale:
How well does the writing match the H2EWD signature style?
The gold standard is index-manual.qmd: an alien anthropologist observing human absurdity with dark humor that carries rigorous arguments. Every joke makes a point. The comedy comes from the GAP between naive tone and devastating truth.
Key voice mechanics (from the wishonia-style skill):
Scale:
Note: Academic papers (1-pct-treaty-impact, dfda-spec, invisible-graveyard) SHOULD have lower voice scores. That's correct for their format. Voice measures entertainment value, not quality.
Before scoring, read these calibration files:
index-manual.qmd — Voice=10, the gold standard style reference. Read lines 19-300.knowledge/economics/1-pct-treaty-impact.qmd — Quality=10, Value=10, Timeliness=9, Standalone=10, Voice=5. Read first 300 lines.knowledge/problem/genetic-slavery.qmd — Quality=7, Value=7, Timeliness=8, Standalone=7, Voice=6. Mid-tier anchor.knowledge/solution/decentralized-census-bureau.qmd — Quality=4, Value=4, Timeliness=5, Standalone=5, Voice=4. Low-tier anchor.QMD files contain Quarto variable shortcodes like {{< var military_to_government_clinical_trials_spending_ratio >}} that render as actual values (e.g., "604"). Agents MUST read the variable-resolved version, not the raw QMD, because:
To generate a preview with resolved variables:
cd E:/code/disease-eradication-plan
python scripts/preview-qmd-with-variables.py <file.qmd>
# Or save to a temp file for agent consumption:
python scripts/preview-qmd-with-variables.py <file.qmd> -o /tmp/preview.md
For batch scoring, generate previews for all files into a temp directory:
mkdir -p /tmp/h2ewd-previews
for f in knowledge/problem/*.qmd knowledge/solution/*.qmd ...; do
python scripts/preview-qmd-with-variables.py "$f" -o "/tmp/h2ewd-previews/$(echo $f | tr '/' '-')"
done
Then tell agents to read from /tmp/h2ewd-previews/ instead of the raw QMD files.
Scores are written to the ORIGINAL .qmd frontmatter, not to the preview files.
python scripts/preview-qmd-with-variables.py <file> -o /tmp/preview.mdscores: block in the ORIGINAL .qmd frontmatter using Editcd E:/code/disease-eradication-plan && python -c "from dih_models.search_index_generator import SearchIndexGenerator; from pathlib import Path; SearchIndexGenerator(Path('.')).generate_chat_index()"Include in every agent prompt:
How critical is this piece to the H2EWD system actually working? This is NOT about writing quality — a dry spec that defines a critical subsystem scores higher than a beautifully written tangent.