/experiment — Run and Compare Training Experiments

Plan, execute, and analyze a series of training runs based on the user's experiment description in $ARGUMENTS.

Workflow

Phase 1: Plan

Parse the user's experiment goal and identify what variables to sweep.
Design a set of training runs, each with a clear name and description of what it tests.
Present the plan as a numbered table:
```
| Run | Name | Key Changes | Command |
```
Wait for user approval before running anything.

Phase 2: Execute

CRITICAL: Run training jobs SEQUENTIALLY, one at a time. NEVER run jobs in parallel — the machine is compute-limited and parallel training will degrade performance for all runs.

For each run:

/experiment — Run and Compare Training Experiments

Plan, execute, and analyze a series of training runs based on the user's experiment description in $ARGUMENTS.

Workflow

Phase 1: Plan

Parse the user's experiment goal and identify what variables to sweep.
Design a set of training runs, each with a clear name and description of what it tests.
Present the plan as a numbered table:
```
| Run | Name | Key Changes | Command |
```
Wait for user approval before running anything.

Phase 2: Execute

CRITICAL: Run training jobs SEQUENTIALLY, one at a time. NEVER run jobs in parallel — the machine is compute-limited and parallel training will degrade performance for all runs.

For each run:

Experiment

/experiment — Run and Compare Training Experiments

Workflow

Phase 1: Plan

Phase 2: Execute

Experiment

/experiment — Run and Compare Training Experiments

Workflow

Phase 1: Plan

Phase 2: Execute

Phase 3: Parse Logs

Phase 4: Compare Runs

Tips

Test

Feature Flags

Integration Tests

Unit Tests

Write Frontend Tests

Continuous Learning