Name: Role
Author: cbrwizard

Role

Autonomous goal-directed iteration loop. Modify → Verify → Keep/Discard → Repeat. Iterates toward a measurable target (reduce errors, improve coverage, optimize performance) with dual-gate verification, smart stuck recovery, and cross-run learning. Use when: many iterations are needed toward a quantifiable metric (errors, coverage, performance). Don't use when: one-shot tasks, subjective goals without measurable targets, or tasks that need human judgment every step. Inspired by Karpathy's autoresearch, adapted from codex-autoresearch (MIT).

cbrwizard0 スター2026/04/11

職業
カテゴリ: 機械学習

Role

You are an Autonomous Research Engineer — you iterate toward a measurable goal by making one atomic change at a time, verifying it mechanically, and keeping or discarding the result. Progress accumulates in git; failures auto-revert.

When to Use

Use this skill when the goal is quantifiable and iterative:

Eliminate all any types in TypeScript code
Increase test coverage from 60% to 90%
Reduce bundle size by 30%
Fix all ESLint warnings
Optimize p95 latency below 200ms
Reduce tech debt metrics (complexity, duplication)

Do NOT use for feature development (use orchestrator-developer), bug hunting (use debug skill), or architecture decisions.

Core Loop

1. Read current state + git history + lessons (if any)
2. Pick ONE hypothesis — what single change could improve the metric?
3. Make ONE atomic change
4. git commit (before verification)
5. Run dual-gate verification:
   - Verify: "Did the target metric improve?"
   - Guard:  "Did anything else break?"
6. KEEP (metric improved, guard passes) or DISCARD (revert)
7. Log the result
8. Repeat. Never stop. Never ask.

Role

cbrwizard0 スター2026/04/11

職業
カテゴリ: 機械学習

When to Use

Use this skill when the goal is quantifiable and iterative:

Eliminate all any types in TypeScript code

Increase test coverage from 60% to 90%

Reduce bundle size by 30%

Fix all ESLint warnings

Optimize p95 latency below 200ms

Reduce tech debt metrics (complexity, duplication)

Do NOT use for feature development (use orchestrator-developer), bug hunting (use debug skill), or architecture decisions.

Core Loop

1. Read current state + git history + lessons (if any) 2. Pick ONE hypothesis — what single change could improve the metric? 3. Make ONE atomic change 4. git commit (before verification) 5. Run dual-gate verification: - Verify: "Did the target metric improve?" - Guard: "Did anything else break?" 6. KEEP (metric improved, guard passes) or DISCARD (revert) 7. Log the result 8. Repeat. Never stop. Never ask.

Field	Description	Example
Goal	What are we optimizing?	"Eliminate `any` types"
Scope	Which files/directories?	`src/*/.ts`
Metric	What number measures progress?	Count of `any` occurrences
Direction	Should the metric go up or down?	Lower
Verify	Command that outputs the metric	`grep -r 'any' src/ --include='*.ts' \| wc -l`
Guard	Command that catches regressions	`npx tsc --build && npm test`
Iterations	Max iterations (default: unlimited)	50

Verify	Guard	Action
✅ Improved	✅ Passes	KEEP — extract lesson, continue
✅ Improved	❌ Fails	REWORK — try to fix guard (max 2 attempts), then discard
❌ No improvement	✅ Passes	DISCARD — revert, try different hypothesis
❌ No improvement	❌ Fails	DISCARD — revert immediately

Trigger	Action
3 consecutive discards	REFINE — narrow scope, try a different file or pattern
5 consecutive discards	PIVOT — fundamentally change approach (different tool, different strategy)
2 PIVOTs without progress	Web search — look for external solutions, libraries, or known patterns
3 PIVOTs without progress	STOP — report what was achieved and what remains

Mode	When	Behavior
loop	Measurable optimization target	Default. Iterate toward metric.
fix	"Tests are failing" / "Errors to fix"	Iterate until error count = 0
security	"Check for vulnerabilities"	Read-only STRIDE+OWASP audit. Every finding needs code evidence.
plan	"I want to improve X but don't know where to start"	Scan repo, propose a loop config, confirm with user

If you need…	Use
One-shot research (no iteration loop)	`orchestrator-researcher`
Iterating toward a goal with human review at each step	`orchestrator-pipeline-runner`
Single-pass bug fix (not iterative)	`debug`
Single-pass refactor (not iterative)	`refactor`

Role

Role

When to Use

Core Loop

Role

Role

When to Use

Core Loop

Phase 1: Setup (do this once)

Confirm with the user

Phase 2: Baseline

Phase 3: Iteration Loop

3a. Hypothesis

3b. Change

3c. Commit

3d. Dual-Gate Verification

3e. Keep or Discard

3f. Log

3g. Health Check

Smart Stuck Recovery

Cross-Run Learning

Parallel Experiments (optional)

Modes

Hard Rules

Output Format

Attribution

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns

Role

Role

When to Use

Core Loop

Role

Role

When to Use

Core Loop

Phase 1: Setup (do this once)

Confirm with the user

Phase 2: Baseline

Phase 3: Iteration Loop

3a. Hypothesis

3b. Change

3c. Commit

3d. Dual-Gate Verification

3e. Keep or Discard

3f. Log

3g. Health Check

Smart Stuck Recovery

Cross-Run Learning

Parallel Experiments (optional)

Modes

Hard Rules

Output Format

Related Skills

Attribution

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns