Use this skill when the model exists and the question is whether it is good enough.

Overview

This skill focuses on choosing and interpreting the right evaluation metrics for the problem, then comparing candidate models or thresholds.

When to Use This Skill

Comparing candidate models with consistent metrics
Reviewing precision/recall/F1/AUC, regression error, calibration, or ranking quality
Stress-testing validation strategy before deployment or publication

Not For / Boundaries

Building the training pipeline itself: use training-machine-learning-models
Engineering features: use engineering-features-for-machine-learning
Checking train/test contamination: use ml-data-leakage-guard

Typical Outputs

Metric suite recommendations

Use this skill when the model exists and the question is whether it is good enough.

Overview

This skill focuses on choosing and interpreting the right evaluation metrics for the problem, then comparing candidate models or thresholds.

When to Use This Skill

Comparing candidate models with consistent metrics
Reviewing precision/recall/F1/AUC, regression error, calibration, or ranking quality
Stress-testing validation strategy before deployment or publication

Not For / Boundaries

Building the training pipeline itself: use training-machine-learning-models
Engineering features: use engineering-features-for-machine-learning
Checking train/test contamination: use ml-data-leakage-guard

Typical Outputs

Metric suite recommendations

Evaluating Machine Learning Models

Overview

When to Use This Skill

Not For / Boundaries

Typical Outputs

Evaluating Machine Learning Models

Overview

When to Use This Skill

Not For / Boundaries

Typical Outputs

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Evaluating Machine Learning Models

Overview

When to Use This Skill

Not For / Boundaries

Typical Outputs

Evaluating Machine Learning Models

Overview

When to Use This Skill

Not For / Boundaries

Typical Outputs

Related Skills

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns