Name: Run Experiment
Author: wanshuiyin

Search skills.../

Run Experiment | Skills Pool

rsync -avz --include='*.py' --exclude='*' <local_src>/ <server>:<remote_dst>/

# 1. Push from local
git add -A && git commit -m "sync: experiment deployment" && git push

# 2. Pull on server
ssh <server> "cd <remote_dst> && git pull"

Check if wandb is already in the script — look for import wandb or wandb.init. If present, skip to Step 4.

If not present, add W&B logging to the training script:

import wandb
wandb.init(project=WANDB_PROJECT, name=EXP_NAME, config={...hyperparams...})

# Inside training loop:
wandb.log({"train/loss": loss, "train/lr": lr, "step": step})

# After eval:
wandb.log({"eval/loss": eval_loss, "eval/ppl": ppl, "eval/accuracy": acc})

# At end:
wandb.finish()

Metrics to log (add whichever apply to the experiment):
- train/loss — training loss per step
- train/lr — learning rate
- eval/loss, eval/ppl, eval/accuracy — eval metrics per epoch
- gpu/memory_used — GPU memory (via torch.cuda.max_memory_allocated())
- speed/samples_per_sec — throughput
- Any custom metrics the experiment already computes

Verify wandb login on the target machine:

ssh <server> "wandb status"  # should show logged in
# If not logged in:
ssh <server> "wandb login <WANDB_API_KEY>"

ssh <server> "screen -dmS <exp_name> bash -c '\
  eval \"\$(<conda_path>/conda shell.bash hook)\" && \
  conda activate <env> && \
  CUDA_VISIBLE_DEVICES=<gpu_id> python <script> <args> 2>&1 | tee <log_file>'"

# Linux with CUDA
CUDA_VISIBLE_DEVICES=<gpu_id> python <script> <args> 2>&1 | tee <log_file>

# Mac with MPS (PyTorch uses MPS automatically)
python <script> <args> 2>&1 | tee <log_file>

ssh <server> "screen -ls"

## Remote Server
- SSH: `ssh my-gpu-server`
- GPU: 4x A100 (80GB each)
- Conda: `eval "$(/opt/conda/bin/conda shell.bash hook)" && conda activate research`
- Code dir: `/home/user/experiments/`
- code_sync: rsync          # default. Or set to "git" for git push/pull workflow
- wandb: false              # set to "true" to auto-add W&B logging to experiment scripts
- wandb_project: my-project # W&B project name (required if wandb: true)
- wandb_entity: my-team     # W&B team/user (optional, uses default if omitted)

## Local Environment
- Mac MPS / Linux CUDA
- Conda env: `ml` (Python 3.10 + PyTorch)

Run Experiment

Workflow

Step 1: Detect Environment

Step 2: Pre-flight Check

Run Experiment

Workflow

Step 1: Detect Environment

Step 2: Pre-flight Check

Step 3: Sync Code (Remote Only)

Option A: rsync (default)

Option B: git (when `code_sync: git` is set in AGENTS.md)

Step 3.5: W&B Integration (when `wandb: true` in AGENTS.md)

Step 4: Deploy

Remote (via SSH + screen)

Local

Step 5: Verify Launch

Step 6: Feishu Notification (if configured)

Key Rules

AGENTS.md Example

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing

Run Experiment

Workflow

Step 1: Detect Environment

Step 2: Pre-flight Check

Run Experiment

Workflow

Step 1: Detect Environment

Step 2: Pre-flight Check

Step 3: Sync Code (Remote Only)

Option A: rsync (default)

Option B: git (when code_sync: git is set in AGENTS.md)

Step 3.5: W&B Integration (when wandb: true in AGENTS.md)

Step 4: Deploy

Remote (via SSH + screen)

Local

Step 5: Verify Launch

Step 6: Feishu Notification (if configured)

Key Rules

AGENTS.md Example

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing

Option B: git (when `code_sync: git` is set in AGENTS.md)

Step 3.5: W&B Integration (when `wandb: true` in AGENTS.md)