Skill for venue-calibrated scoring, score weighting formulas, decision rules, anti-bias mechanisms, and score interpretation across different venue tiers.
42:T106d,
Use this skill when computing review scores, applying decision rules, or calibrating review standards to a specific venue.
Every review scores these 6 dimensions plus confidence:
| Dimension | Range | Description |
|---|---|---|
| Overall | 1-10 | Holistic assessment |
| Soundness | 1-10 | Technical correctness |
| Novelty | 1-10 | Originality of contribution |
| Clarity | 1-10 | Writing and presentation quality |
| Significance | 1-10 | Impact and importance |
| Reproducibility | 1-10 | Can results be reproduced? |
| Confidence |
| 1-5 |
| Reviewer's self-assessed expertise |
| Score | Meaning |
|---|---|
| 8-10 | Strong accept — top 10% of submissions |
| 6-7 | Weak accept — above threshold, some issues |
| 5 | Borderline — could go either way |
| 3-4 | Weak reject — below threshold, significant issues |
| 1-2 | Strong reject — fundamental flaws |
Acceptance threshold: Mean ≥ 7, no critical issues
| Score | Meaning |
|---|---|
| 7-10 | Strong accept |
| 5-6 | Accept with revisions |
| 4 | Borderline |
| 2-3 | Reject |
| 1 | Strong reject |
Acceptance threshold: Mean ≥ 6, critical issues addressed
| Score | Meaning |
|---|---|
| 6-10 | Accept |
| 4-5 | Accept with minor revisions |
| 3 | Borderline |
| 1-2 | Reject |
Acceptance threshold: Mean ≥ 5, no fatal flaws
The weighted final score combines dimensions with fixed weights:
final_score = (
0.30 × mean(soundness) +
0.20 × mean(novelty) +
0.20 × mean(significance) +
0.15 × mean(clarity) +
0.15 × mean(reproducibility)
)
These weights can be overridden in .review-config.yaml: