Scoring Calibration

Use this skill when computing review scores, applying decision rules, or calibrating review standards to a specific venue.

Score Dimensions

Every review scores these 6 dimensions plus confidence:

Dimension	Range	Description
Overall	1-10	Holistic assessment
Soundness	1-10	Technical correctness
Novelty	1-10	Originality of contribution
Clarity	1-10	Writing and presentation quality
Significance	1-10	Impact and importance
Reproducibility	1-10	Can results be reproduced?

Score	Meaning
8-10	Strong accept — top 10% of submissions
6-7	Weak accept — above threshold, some issues
5	Borderline — could go either way
3-4	Weak reject — below threshold, significant issues
1-2	Strong reject — fundamental flaws