技能档案

Autoraysearch

Name: Autoraysearch
Author: sagunkayastha

Autonomous ML iteration on a Ray cluster. Generates train.py boilerplate from a model file and runs the autoresearch loop. User provides model.py; the skill handles Ray, DDP, dataset, and metric extraction.

sagunkayastha0 星标2026年3月15日

职业
分类: 机器学习

技能内容

autoraysearch — Ray-native Autonomous ML Research

Extends autoresearch with two key additions:

Boilerplate generation — user provides model.py, skill generates train.py (Ray + DDP)
Parallel experiments — optionally runs N experiments simultaneously across the cluster

Environment

Python: user's Python environment with Ray + PyTorch installed
Ray cluster: multi-node, typically 2x4-GPU setup (8 GPUs total)

See references/ray-cluster.md for full cluster setup and known issues.

File Structure

model.py    ← autoresearch scope (user's model, nn.Module subclass named "Model")
train.py    ← generated boilerplate, never modified by autoresearch
autoraysearch-results.tsv  ← results log (gitignored)

相关技能

Autoraysearch | Skills Pool

Subcommand	Purpose
`/autoraysearch`	Run the autonomous loop (sequential)
`/autoraysearch --parallel N`	Run N experiments simultaneously per iteration
`/autoraysearch:plan`	Interactive wizard: goal → model → train.py → baseline → launch
`/autoraysearch:setup`	Generate train.py from a model file (no goal wizard)

# Head node
ray start --head --port=6379 --temp-dir=/tmp/$USER/ray --num-gpus=4
# Worker node
ray start --address=<head-ip>:6379 --num-gpus=4

LOOP:
  1. Review:  read model.py + git log + results log
  2. Ideate:  pick next atomic change to model.py
  3. Modify:  one change to model.py
  4. Commit:  git commit before verify
  5. Verify:  python train.py
  6. Extract: val_acc from last line of stdout
  7. Decide:  improved → keep | same/worse → git reset --hard HEAD~1
  8. Log:     append to autoraysearch-results.tsv
  9. Repeat:  NEVER STOP, NEVER ASK "should I continue?"

# metric_direction: higher_is_better
iteration  commit   metric   delta   status   description
0          baseline 0.8728   0.0     baseline initial state
1          a1b2c3d  0.8810   +0.008  keep     increase conv channels 64->128 in first block
2          -        0.8690   -0.004  discard  add extra ResBlock (underfits in 20 epochs)

Autoraysearch

autoraysearch — Ray-native Autonomous ML Research

Environment

File Structure

Autoraysearch

autoraysearch — Ray-native Autonomous ML Research

Environment

File Structure

Subcommands

When to Activate

Planning Phase (/autoraysearch:plan)

Setup Phase (legacy, no goal wizard)

The Loop

Sequential mode (default)

Parallel mode (--parallel N)

Critical Rules

What autoraysearch Can Tune in model.py

Results Log Format

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns