Name: Ai Ml Experiment Rigor
Author: frinfo702

AI/ML Experiment Rigor

Treat the experiment as a specification, not a pile of runs.

Workflow

Freeze the claim.
- What exact statement should the results support?
- What would falsify it?
Define the comparison set.
- Minimum baselines.
- Fairness conditions.
- Required ablations.
Write the experiment spec before running jobs.
- Data splits.
- Preprocessing.
- Metrics.
- Hyperparameter search space.
- Logging and saved artifacts.
Budget the runs.
- Cheap sanity checks.
- Coarse search.
- Final reruns with fixed settings.
Audit reproducibility.
- Seeds.
- Non-deterministic operators.
- Dependency versions.
- Dataset versioning.