Name: Autoresearch
Author: huzengyun3268

Autoresearch

Autonomous experiment loop for AI agents. Use when the user wants to run systematic experiments — optimizing hyperparameters, searching for better configurations, ablation studies, or any task where an agent should iteratively try changes, measure results, and keep or discard based on a metric. Triggers on phrases like "run experiments", "optimize", "autoresearch", "ablation", "hyperparameter search", "find the best config".

huzengyun32680 estrellas28 mar 2026

Ocupación
Categorías: LLM & AI

Autoresearch: Autonomous Experiment Protocol for AI Agents

You are now operating as an autonomous researcher. Your job is to systematically explore a search space by running experiments one at a time, measuring results against a clear metric, and building on what works.

Core philosophy: Humans set direction and constraints. You perform exhaustive exploration within those boundaries. Your randomness is a feature — you'll try things humans wouldn't think of. But you must be disciplined: one variable at a time, hypothesis first, measure after.

Overview

Autoresearch enforces two things that make AI agents effective researchers:

Discipline: Change only one variable at a time. Form a hypothesis, run the experiment, confirm or refute. Without this, you'll tweak three things at once, get a result, and have no clue which made the difference.
Memory: Git history is your experiment notebook. You can see what you've already tried, what worked, what didn't. Without this, you'd endlessly repeat yourself. With it, you iteratively build on your own results.

<the command> ``` <any additional context from the user> ```

Autoresearch

huzengyun32680 estrellas28 mar 2026

Ocupación
Categorías: LLM & AI

Autoresearch: Autonomous Experiment Protocol for AI Agents

Overview

Autoresearch enforces two things that make AI agents effective researchers:

Discipline: Change only one variable at a time. Form a hypothesis, run the experiment, confirm or refute. Without this, you'll tweak three things at once, get a result, and have no clue which made the difference.

Memory: Git history is your experiment notebook. You can see what you've already tried, what worked, what didn't. Without this, you'd endlessly repeat yourself. With it, you iteratively build on your own results.

Domain	Metric	Target File	Run Command
ML training	val_loss, val_bpb	train.py	`python train.py`
Compiler optimization	benchmark time	config.toml	`make bench`
Web performance	Lighthouse score	webpack.config.js	`npm run build && lighthouse`
Algorithm tuning	ops/sec	solver.py	`python benchmark.py`
Prompt engineering	eval accuracy	prompts.yaml	`python eval.py`
Database tuning	query latency	postgresql.conf	`pgbench`
CSS/rendering	layout shift score	styles.css	`npm run perf-test`

Autoresearch

Autoresearch: Autonomous Experiment Protocol for AI Agents

Overview

Autoresearch

Autoresearch: Autonomous Experiment Protocol for AI Agents

Overview

Commands

Phase 1: Setup (`/autoresearch setup`)

Questions to resolve with the user:

Write the config file

Time Budget

Constraints

Branch

Notes

Initialize the experiment

Phase 2: Experiment Loop (`/autoresearch run`)

Before each experiment

Run the experiment

After each experiment

Decision logic

Strategy guidance

Critical rules

Phase 3: Analyze (`/autoresearch analyze`)

Adapting to Different Domains

For Other Agents

Reference

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

Autoresearch

Autoresearch: Autonomous Experiment Protocol for AI Agents

Overview

Autoresearch

Autoresearch: Autonomous Experiment Protocol for AI Agents

Overview

Commands

Phase 1: Setup (/autoresearch setup)

Questions to resolve with the user:

Write the config file

Time Budget

Constraints

Branch

Notes

Initialize the experiment

Phase 2: Experiment Loop (/autoresearch run)

Before each experiment

Run the experiment

After each experiment

Decision logic

Strategy guidance

Critical rules

Phase 3: Analyze (/autoresearch analyze)

Adapting to Different Domains

For Other Agents

Reference

Openai Whisper

Voice Call

Prose

Clawhub

Sherpa Onnx Tts

Openai Whisper Api

Phase 1: Setup (`/autoresearch setup`)

Phase 2: Experiment Loop (`/autoresearch run`)

Phase 3: Analyze (`/autoresearch analyze`)