Name: Experiment Workflow
Author: kubotadaichi

Skills suchen.../

Experiment Workflow | Skills Pool

backlog doc list                    # List all documents
backlog doc view DOC-N              # Read a specific document

Check backlog for experiment history and project state:

backlog search --type task exp --plain   # Search experiment tasks
backlog overview                          # Project-level summary

Design the next experiment with a clear hypothesis

backlog task create "expXXX: Short description of experiment" \
  -d "Hypothesis: ... / Changes from base: ... / Expected outcome: ..." \
  -l exp -l expXXX \
  --ac "Training completes without errors" \
  --ac "CV score recorded" \
  --priority medium

task new-exp EXP=exp002                              # From template
task new-exp EXP=exp002 SOURCE=exp001                 # Copy from existing experiment

backlog task edit TASK-N -d "Hypothesis: ... / Changes from base: ... / Expected outcome: ..."
backlog task edit TASK-N --plan "Implementation approach: ..."

Condition	Strategy
Time-series problem	TimeSeriesSplit
Train/test split by distinct groups	StratifiedGroupKFold
Categorical target or imbalanced classes	StratifiedKFold
Multi-label classification	MultilabelStratifiedKFold
None of the above	KFold

from settings import Config, DirectorySettings

def predict(model, df, ...):
    """推論処理。inference.pyからも呼び出される。"""
    ...

def main(debug: bool = False) -> None:
    settings = DirectorySettings(exp_name="expXXX")
    config = Config()

    if debug:
        settings.artifact_dir = settings.artifact_dir / "debug"
        settings.output_dir = settings.artifact_dir
        config.epochs = 1

    # ... data loading, training, model save ...

    # バリデーション推論
    val_predictions = predict(model, val_df)
    # 評価メトリクスを計算し、OOF予測をCSVに保存

if __name__ == "__main__":
    import tyro
    tyro.cli(main)

Accelerator	Default Machine Type	Command
NVIDIA_L4 (default)	`g2-standard-8`	`task train-vertex EXP=expXXX ACCELERATOR_TYPE=NVIDIA_L4`
NVIDIA_TESLA_V100	`n1-highmem-8`	`task train-vertex EXP=expXXX ACCELERATOR_TYPE=NVIDIA_TESLA_V100`
NVIDIA_TESLA_A100	`a2-highgpu-1g`	`task train-vertex EXP=expXXX ACCELERATOR_TYPE=NVIDIA_TESLA_A100`
CPU only	`n1-highmem-8`	`task train-vertex EXP=expXXX`

task train-local EXP=exp002                                        # Run training locally
task train-vertex EXP=exp002 ACCELERATOR_TYPE=NVIDIA_L4             # Vertex AI with L4 (auto machine type)
task train-local EXP=exp002 EXTRA_ARGS="--debug"                   # Debug run locally (epochs=1, data limited)
task train-vertex EXP=exp002 EXTRA_ARGS="--debug"                  # Debug run on Vertex AI
task run-local SCRIPT=models/exp002/inference.py                    # Run inference locally

# Recode CV score immediately after training completes
backlog task edit TASK-N --append-notes "CV score: 0.8765 (config summary)"
backlog task edit TASK-N --check-ac 1 --check-ac 2

backlog task edit TASK-N --append-notes "Public LB: 0.8750"
backlog task edit TASK-N --final-summary "CV=0.8765, LB=0.8750. Next: try feature X (see TASK-M)"
backlog task edit TASK-N -s "Done"

Phase	Action
Understand	Review competition docs in backlog (`backlog doc list`)
Plan	Review backlog and past experiments, create experiment task
Create	`task new-exp EXP=expXXX` to create experiment directory and backlog task
Implement	Write train.py, settings.py, run code quality checks
Train	`task train-local` or `task train-vertex`
Record	Update backlog task with results

Phase	Action
Understand	Review competition docs in backlog (`backlog doc list`)
Plan	Review backlog and past experiments, create experiment task
Create	`task new-exp EXP=expXXX` to create experiment directory and backlog task
Implement	Write train.py, settings.py, run code quality checks
Train	`task train-local` or `task train-vertex`
Record	Update backlog task with results

Experiment Workflow

Workflow Overview

Phase 0: Understand the Competition

Experiment Workflow

Workflow Overview

Phase 0: Understand the Competition

Phase 1: Plan

Create Experiment Task in Backlog

Phase 2: Create

Phase 3: Implement and Verify

Decide Validation Strategy

Implement train.py

Implement inference.py (Submission Pipeline)

Code Quality Checks

Commit

Phase 4: Train

Vertex AI GPU options

Debug mode

Phase 5: Record Results

References

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio