Scope

Use this skill for experiment metadata and results:

training runs
evaluation sweeps
WandB linkage
benchmark summaries
run reports

Do not use this skill for generic refactors, architecture cleanup, or standalone CHANGELOG.md updates. Use engineering-change-tracking for code-change traceability. Do not use this skill to decide which hypothesis to test next. Use research-methodology for experiment selection and interpretation.

Required linkage

Each tracked run should capture:

run_id
git_commit, git_tag, git_branch
script or config used
checkpoint or output path
WandB URL or run name
key metrics and short conclusion

Training scripts should call get_git_info() from training/utils_training.py and pass the returned values into wandb.init(config=...).

Experiment Tracking

Scope

Required linkage

Experiment Tracking

Scope

Required linkage

Before training

During training

After training

After evaluation

Minimum summary

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio

Experiment Tracking

Scope

Required linkage

Experiment Tracking

Scope

Required linkage

Before training

During training

After training

After evaluation

Minimum summary

Related skills

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio