Audit Reproducibility

Point-estimate + SE: ATT = -1.632 (0.584) , $\beta = 0.342$ (0.091) , hat{\tau} = 1.28** with starred significance
Table cells: &amp; -1.632$^{***}$ &amp; 0.584 &amp; in LaTeX table environments
Counts: our sample of 2,847 firms , $N = 2{,}847$
Summary stats: mean = 0.423 , SD = 0.087
P-values: p &lt; 0.01 , $p = 0.003$

Compare numeric claims in a manuscript (point estimates, standard errors, p-values, counts) against the actual outputs produced by the analysis pipeline. Report PASS / FAIL per claim against the tolerance thresholds defined in .claude/rules/replication-protocol.md.

Core principle: If the paper says ATT = -1.632 (0.584) and the code produces -1.628 (0.591), we verify — numerically — that the difference is within the documented tolerance. No more "looks close enough" eyeballing.

When to use

Before submission. Catches the "I updated the analysis but forgot to update Table 2" bug.
Before releasing a replication package. Verifies the code actually reproduces the paper.