Systematically improve Mycelium instructions through measurement. Adapted from n-trax.

Commands

`baseline` -- Capture current performance

Run /eval-runner run-all
Record to .claude/optimization/baseline.json: timestamp, CLAUDE.md hash, overall and per-category metrics (pass_rate, avg_iterations, avg_time)

`test <variant>` -- Test a variant

Read variant from .claude/optimization/variants/<variant>.md
Apply the CLAUDE.md changes described
Run /eval-runner run-all
Store results in .claude/optimization/results/<variant>.json
Compare against baseline
Do NOT auto-revert -- let user decide

`report` -- Compare all variants

Generate comparison table: variant, pass rate, delta vs baseline, avg iterations, decision (keep/revert).

Systematically improve Mycelium instructions through measurement. Adapted from n-trax.

Commands

`baseline` -- Capture current performance

Run /eval-runner run-all
Record to .claude/optimization/baseline.json: timestamp, CLAUDE.md hash, overall and per-category metrics (pass_rate, avg_iterations, avg_time)

`test <variant>` -- Test a variant

Read variant from .claude/optimization/variants/<variant>.md
Apply the CLAUDE.md changes described
Run /eval-runner run-all
Store results in .claude/optimization/results/<variant>.json
Compare against baseline
Do NOT auto-revert -- let user decide

`report` -- Compare all variants

Generate comparison table: variant, pass rate, delta vs baseline, avg iterations, decision (keep/revert).

Prompt Optimizer

Commands

`baseline` -- Capture current performance

`test <variant>` -- Test a variant

`report` -- Compare all variants

Prompt Optimizer

Commands

`baseline` -- Capture current performance

`test <variant>` -- Test a variant

`report` -- Compare all variants

`exemplar <eval-name>` -- Capture winning trajectory

Workflow

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio

Prompt Optimizer

Commands

baseline -- Capture current performance

test <variant> -- Test a variant

report -- Compare all variants

Prompt Optimizer

Commands

baseline -- Capture current performance

test <variant> -- Test a variant

report -- Compare all variants

exemplar <eval-name> -- Capture winning trajectory

Workflow

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio

`baseline` -- Capture current performance

`test <variant>` -- Test a variant

`report` -- Compare all variants

`baseline` -- Capture current performance

`test <variant>` -- Test a variant

`report` -- Compare all variants

`exemplar <eval-name>` -- Capture winning trajectory