Run research/paper/code reproduction tasks with strict execution logging. Use when requests involve reproducing results, debugging training/evaluation pipelines, Slurm/HPC experiments, dataset/model setup, or iterative small-task execution where every subtask must be written to a repository-level CODEX_RECORD.md.
Keep reproduction work and record-keeping coupled:
CODEX_RECORD.md.CODEX_RECORD.md./home focused on code, configs, CODEX_RECORD.md, and standard lightweight experiment results/scratchEvidenceexact-training-resume-guard.scripts/update_codex_record.py.TaskStatusSummaryEvidenceNextEvidence.Create or append entry:
python ~/.codex/skills/repro-record-workflow/scripts/update_codex_record.py \
--record /path/to/repo/CODEX_RECORD.md \
--task "Run smoke preflight on A100 node" \
--status failed \
--summary "Import failed in trainer package" \
--evidence "Job 10087183; log: /path/gui_g2_smoke-10087183.out" \
--next "Patch missing import and resubmit smoke test"
Create record file if missing:
cp ~/.codex/skills/repro-record-workflow/references/record-template.md /path/to/repo/CODEX_RECORD.md