Integrate Weights & Biases (W&B) as the single system of record for research runs, configs, sweeps, code snapshots, artifacts, checkpoints, and paper-ready analysis. Use when setting up or refactoring experiment tracking, sweep execution, result export, checkpoint resume/eval, or LLM evaluation workflows around W&B.
Treat W&B as the experiment source of truth.
wandb.configwandb.logDo not build a parallel logging system unless the project already requires local logs as a cache or failure fallback.
If working from inside divan/, move to the parent project before editing runtime code. Keep the skill in divan/, but put training code, scripts, configs, and tests in the parent project.
Add a small tracking block or equivalent computed fields to every run config.