Autonomous deep learning experimentation. An AI agent modifies training code, runs fixed-budget experiments, evaluates results against a single metric, and keeps or discards changes -- looping indefinitely until manually stopped.

Inspired by karpathy/autoresearch: the human writes the program (instructions), the agent writes the code.

When to Use

Overnight autonomous hyperparameter/architecture search
Iterating on model design with a fixed compute budget
Any deep learning experiment loop where one metric determines success
When the user wants to sleep while the agent runs experiments

Prerequisites

Before invoking this skill, ensure:

A training script exists that the agent will modify. It should:
- Run on a single GPU
- Complete within a fixed time budget (default: 5 minutes wall clock)

Inspired by karpathy/autoresearch: the human writes the program (instructions), the agent writes the code.

When to Use

Overnight autonomous hyperparameter/architecture search
Iterating on model design with a fixed compute budget
Any deep learning experiment loop where one metric determines success
When the user wants to sleep while the agent runs experiments

Prerequisites

Before invoking this skill, ensure:

A training script exists that the agent will modify. It should:
- Run on a single GPU
- Complete within a fixed time budget (default: 5 minutes wall clock)

File	Purpose
experiment-protocol.md	Detailed experiment loop with crash handling and decision rules
analysis-template.md	Jupyter notebook template for post-session analysis

Autoresearch

When to Use

Prerequisites

Autoresearch

When to Use

Prerequisites

Core Design Principles

Single File to Modify

Fixed Time Budget

Single Metric

Simplicity Criterion

Never Stop

Setup Phase

Step 1: Agree on Scope

Step 2: Create Experiment Branch

Step 3: Read In-Scope Files

Step 4: Verify Environment

Step 5: Initialize Results Log

Step 6: Run Baseline

Step 7: Confirm and Go

The Experiment Loop

Analysis

Strategy Tips for the Agent

Idea Generation

Risk Management

What NOT to Do

Reference Files

Pytorch Patterns

Regex Vs Llm Structured Text

Effect

Flags

WPF to WinUI 3 Migration Skill

At Dispatch V2