Name: Autoresearch
Author: LuxorLabs

搵技能.../

Autoresearch | Skills Pool

/branch-create chore/autoresearch-<short-goal-slug>

# Autoresearch: <goal>

## Objective

<one-paragraph description of what we're optimizing and why>

## Metric

- **Primary**: <metric name> (<unit>) — <lower|higher> is better
- **Secondary**: <any secondary metrics to track, e.g. "test count must not decrease">

## Command

\`\`\`bash
<the benchmark command>
\`\`\`

## Checks Command

\`\`\`bash
<correctness checks command, if any — e.g. gotest, golint, cd apps/app && pnpm type-check>
\`\`\`

## Files in Scope

<list of directories/files that may be modified>

## Off Limits

<files/patterns that must not be touched>

## Constraints

<things that must not break>

## Baseline

- **Value**: <filled after first run>
- **Date**: <filled after first run>

## What's Been Tried

<filled as experiments run — keep this updated>

## Dead Ends

<approaches that were tried and reverted — don't repeat these>

#!/usr/bin/env bash
set -euo pipefail
<the benchmark command>

#!/usr/bin/env bash
set -euo pipefail
<the correctness check command(s)>

{"type":"config","name":"<goal>","metric_name":"<name>","metric_unit":"<unit>","direction":"<lower|higher>","timestamp":"<ISO 8601>"}
{"run":1,"metric":<value>,"status":"keep","description":"baseline","timestamp":"<ISO 8601>"}

git add -f autoresearch.md autoresearch.sh autoresearch.jsonl
git add -f autoresearch.checks.sh 2>/dev/null || true
git commit -m "chore: autoresearch baseline for <goal>"

time ./autoresearch.sh 2>&1

./autoresearch.checks.sh 2>&1

Outcome	Action
Metric improved	KEEP — stage and commit the change
Metric equal + simpler	KEEP — less code/complexity at same perf is a win
Metric equal or worse	DISCARD — revert all changes
Benchmark crashed	DISCARD — revert all changes
Checks failed	DISCARD — revert all changes, even if metric improved

git add -A
git commit -m "perf: <short description of what changed>

Autoresearch run #<N>: <metric_name> <old_value> → <new_value> <unit> (<percentage>% improvement)"

git checkout -- .
git clean -fd

{"run":<N>,"metric":<value>,"status":"<keep|discard|crash|checks_failed>","description":"<what was tried>","timestamp":"<ISO 8601>"}

Ensure the working tree is clean (revert any in-progress changes if needed)
Run /preflight to ensure the final state is clean (codegen, lint, format, tests all pass)
Save the session history for the PR — read autoresearch.md and prepare a summary to include in the PR body (step 7).
Exclude session files from the PR but keep them on disk for local reference:
```
git rm --cached autoresearch.md autoresearch.sh autoresearch.jsonl
git rm --cached autoresearch.checks.sh 2>/dev/null || true
git commit -m "chore: untrack autoresearch session files"
```
The files remain in the working directory (already gitignored via autoresearch.* in .gitignore). They are also recoverable from earlier commits on the branch via git show <commit>:autoresearch.md.
Run /changelog-add to generate a changelog entry summarizing the optimization work
Summarize results to the user:
- Total experiments run
- Experiments kept vs discarded
- Baseline metric → current best metric (total improvement)
- Top 3 most impactful changes
Run /create-pr to open a pull request. Include the session history from step 3 as a collapsible <details> section in the PR body under "## Autoresearch Log". The /create-pr skill will handle conventional commit title and ask for user approval. Since /preflight and /changelog-add already ran, the create-pr pre-flight checks should pass automatically.

Autoresearch

Phase 1 — Setup

1.1 Gather parameters

Autoresearch

Phase 1 — Setup

1.1 Gather parameters

1.2 Check for existing session

1.3 Create branch

1.4 Understand the code

1.5 Write session files

1.6 Run baseline

Phase 2 — The Experiment Loop

LOOP FOREVER. Never ask "should I continue?" Never stop unless the user interrupts.

2.1 Think

2.2 Change

2.3 Measure

2.4 Decide

2.5 Log

2.6 Periodic preflight

2.7 Repeat

Behavioral Rules

Monorepo-Specific Guidance

Resume Protocol

Stopping

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio