Name: Self Improve
Author: Yeachan-Heo

搜索技能.../

Self Improve | Skills Pool

.omc/self-improve/
├── config/                    # User configuration
│   ├── settings.json          # agents, benchmark, thresholds, sealed_files
│   ├── goal.md                # Improvement objective + target metric
│   ├── harness.md             # Guardrail rules (H001/H002/H003)
│   └── idea.md                # User experiment ideas
├── state/                     # Runtime state
│   ├── agent-settings.json    # iterations, best_score, status, counters
│   ├── iteration_state.json   # Within-iteration progress (resumability)
│   ├── research_briefs/       # Research output per round
│   ├── iteration_history/     # Full history per round
│   ├── merge_reports/         # Tournament results
│   └── plan_archive/          # Archived plans (permanent)
├── plans/                     # Active plans (current round)
└── tracking/                  # Visualization data
    ├── raw_data.json          # All candidate scores
    ├── baseline.json          # Initial benchmark score
    ├── events.json            # Config changes
    └── progress.png           # Generated chart

Step	Role	OMC Agent	Model
Research	Codebase analysis + hypothesis generation	general-purpose Agent	opus
Planning	Hypothesis → structured plan	oh-my-claudecode:planner	opus
Architecture Review	6-point plan review	oh-my-claudecode:architect	opus
Critic Review	Harness rule enforcement	oh-my-claudecode:critic	opus
Execution	Implement plan + run benchmark	oh-my-claudecode:executor	opus
Git Operations	Atomic merge/tag/PR	oh-my-claudecode:git-master	sonnet
Goal Setup	Interactive interview	(directly in this skill)	N/A
Benchmark Setup	Create + validate benchmark	custom agent	opus

File	Purpose
`.omc/self-improve/config/settings.json`	User config: `number_of_agents`, `benchmark_command`, `benchmark_format`, `benchmark_direction`, `max_iterations`, `plateau_threshold`, `plateau_window`, `target_value`, `primary_metric`, `sealed_files`, `regression_threshold`, `circuit_breaker_threshold`, `target_branch`, `current_repo_url`, `fork_url`, `upstream_url`
`.omc/self-improve/state/agent-settings.json`	Runtime: `iterations`, `best_score`, `plateau_consecutive_count`, `circuit_breaker_count`, `status`, `goal_slug` (derived: lowercase underscore from goal objective, persisted for cross-session consistency)
`.omc/self-improve/state/iteration_state.json`	Per-iteration progress for resumability
`.omc/self-improve/config/goal.md`	Improvement objective, target metric, scope
`.omc/self-improve/config/harness.md`	Guardrail rules (H001, H002, H003)

Check if target repo path exists. If not configured, ask user for the path to the repository to improve.
Create .omc/self-improve/ directory structure by copying from templates/ in this skill directory.
Read .omc/self-improve/state/agent-settings.json. Check si_setting_goal, si_setting_benchmark, si_setting_harness.
Trust confirmation (mandatory, cannot be skipped): a. If trust_confirmed is already true in agent-settings.json, skip to step 5 (resume path). b. Display the target repo path and ask user to confirm: "Self-improve will run benchmark commands inside {repo_path}. This executes arbitrary code in that repository. Confirm? [yes/no]" c. If user declines: abort setup and exit. Do NOT proceed. d. Record consent: set trust_confirmed: true in agent-settings.json.
If goal not set → read si-goal-clarifier.md from this skill directory and run the 4-dimension Socratic interview directly in this context (Objective, Metric, Target, Scope). Write result to .omc/self-improve/config/goal.md.
If benchmark not set → read si-benchmark-builder.md from this skill directory, spawn a custom Agent(model=opus) with its content as prompt. The agent surveys the repo, creates or wraps a benchmark, validates 3x, and records baseline. After benchmark is set, confirm the benchmark command with user: "Benchmark command: {benchmark_command}. This will be run repeatedly during the loop. Confirm? [yes/no]" If user declines: abort setup and exit.
If harness not set → confirm default harness rules (H001/H002/H003) with user or customize.
Gate: All of si_setting_goal, si_setting_benchmark, si_setting_harness, trust_confirmed must be true.
Create improvement branch (if it does not exist):
```
git -C {repo_path} checkout -b improve/{goal_slug} {target_branch}
git -C {repo_path} checkout {target_branch}
```
Where {goal_slug} is derived from the goal objective (lowercase, underscored). If the branch already exists, skip creation. Persist goal_slug in agent-settings.json.
Mode exclusivity: Call state_list_active. If autopilot, ralph, or ultrawork is active, refuse to start.
Write initial state: state_write(mode='self-improve', active=true, iteration=0, started_at=<now>)

Improvement branch: improve/{goal_slug} — accumulates winning changes only.
Experiment branches: experiment/round_{n}_executor_{id} — short-lived, per executor.
Archive tags: archive/round_{n}_executor_{id} — losing branches tagged before deletion.

Worktree setup (SKILL.md creates before each executor):

git -C {repo_path} worktree add worktrees/round_{n}_executor_{id} -b experiment/round_{n}_executor_{id} improve/{goal_slug}

Winner merges via oh-my-claudecode:git-master:

Merge experiment/round_{n}_executor_{winner_id} into improve/{goal_slug} with --no-ff
Message: "Iteration {n}: {hypothesis} (score: {before} → {after})"

Push after merge: git -C {repo_path} push origin improve/{goal_slug} (backup, non-blocking)
Losers archived: Tag + delete via git-master.

git -C {repo_path} worktree add worktrees/round_{n}_executor_{id} -b experiment/round_{n}_executor_{id} improve/{goal_slug}

git -C {repo_path} worktree remove worktrees/round_{n}_executor_{id} --force
git -C {repo_path} worktree prune

Condition	Check
User stop	`status == "user_stopped"` in agent-settings or state cleared
Target reached	`best_score` meets/exceeds `target_value` (respecting direction)
Plateau	`plateau_consecutive_count >= plateau_window`
Max iterations	`iterations >= max_iterations`
Circuit breaker	`circuit_breaker_count >= circuit_breaker_threshold`

Update agent-settings.json with final status
If target_reached AND auto_pr is true in settings: spawn git-master to create PR from improve/{goal_slug} to upstream. If auto_pr is false (default): skip PR creation. Log: "PR creation skipped (auto_pr: false). Run manually: gh pr create --head improve/{goal_slug} --base {target_branch}"
Run plot_progress.py one final time

Print summary report:

=== Self-Improvement Loop Complete ===
Status: {status}
Iterations: {iterations}
Best Score: {best_score} (baseline: {baseline})
Improvement: {delta} ({delta_pct}%)

Run /oh-my-claudecode:cancel for clean state cleanup

Situation	Action
Agent fails to produce output	Retry once. If still no output, log and continue.
Researcher produces empty brief	Proceed — planners work from history alone.
All plans rejected by critic	Skip execution. Log. Continue to next iteration.
All executors fail	Skip tournament. Record failures. Continue.
Merge conflict	Reject candidate, try next.
Re-benchmark regression	Reject candidate, revert merge, try next.
Push failure	Log warning. Continue — push is backup.
Worktree already exists	Remove and recreate.
Settings corrupted	Report and stop.

Tag	Description
`architecture`	Model/component structure changes
`training_config`	Optimizer, LR, scheduler, batch size
`data`	Data loading, augmentation, preprocessing
`infrastructure`	Mixed precision, distributed training, compiled kernels
`optimization`	Algorithmic/numerical optimizations
`testing`	Evaluation methodology changes
`documentation`	Documentation-only changes
`other`	Does not fit above — explain in evidence

Self Improve

Self-Improvement Orchestrator

Autonomous Execution Policy

Self Improve

Self-Improvement Orchestrator

Autonomous Execution Policy

State Tracking

Agent Mapping

Inputs

Setup Phase

Git Strategy

Improvement Loop

Step 0 — Stale Worktree Cleanup (mandatory, runs every iteration)

Step 1 — Refresh State

Step 2 — Check Stop Request

Step 3 — Check User Ideas

Step 4 — Research

Step 5 — Plan

Step 6 — Review

Step 7 — Execute

Step 8 — Tournament Selection

Step 9 — Record & Visualize

Step 10 — Cleanup

Step 11 — Stop Condition Check

Resumability

Completion

Error Handling

Approach Family Taxonomy

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns