Name: Evolution Decide
Author: gbPagano

搜索技能.../

Evolution Decide | Skills Pool

Read All Artifacts — Open implementation.md and benchmark.md to review what was implemented and how it performed. Open iteration.json for the full context including hypothesis, state, and metrics.
Evaluate Against Acceptance Policy — Determine the outcome based on:
- Accepted: The implementation succeeded, the benchmark evidence satisfies the documented policy, benchmark.sufficientForPromotion is true with a statistically meaningful improvement over the baseline, and the candidate can be promoted under the versioning policy.
- Rejected: The implementation succeeded and benchmark evidence is strong enough to show the candidate is weaker than or equivalent to the baseline. The candidate is discarded.
- Inconclusive: The implementation succeeded but the benchmark evidence is insufficient for a clear accept/reject decision (for example: only screening evidence, weak signal, high variance, or confirmation still required). The candidate may be refined in a future iteration.
- Failed: The implementation, correctness gate, or benchmark infrastructure failed (build error, test failure, benchmark crash). No evaluation of the candidate's merit is possible.
Write decision.md — Record the decision with:
- The final outcome (accepted, rejected, inconclusive, failed)
- The reasoning for the decision
- Key evidence cited from the benchmark results, including policyStage, completed games, and whether benchmark.sufficientForPromotion was satisfied
- For an accepted outcome, the promotion metadata required by the versioning policy (previousVersion, promotedVersion, baseline refs, and version-artifact path)
- Any recommendations for future iterations
Update iteration.json — Follow the canonical state-machine contract in tasks/prd-wiggum-evolution-loop.md:
- Enter the decision phase only from benchmarked by transitioning to deciding before choosing an outcome
- Set the final state to exactly one of "accepted", "rejected", "inconclusive", or "failed"
- Keep stateMachine.currentPhase aligned with the final state
- Add decision object with:
  - outcome — one of the allowed final states
  - reasoning — explanation of the decision
  - evidence — key metrics that supported the decision, including benchmark policy fields used for the call

# Iteration N Decision

## Outcome

One of: accepted, rejected, inconclusive, failed.

## Reasoning

Explanation of why this outcome was selected based on the evidence.

## Evidence

- Implementation: summary of what was changed
- Benchmark: key results (games completed, win rate, ELO estimate)
- Policy: which acceptance criteria were met or not met

## Recommendations

Suggestions for future iterations (e.g., "retry with more games", "explore related idea from iteration X", "abandon this direction").

State	Meaning	Baseline
`accepted`	Candidate validated and promoted	Updated to new version
`rejected`	Candidate evaluated and discarded	Unchanged
`inconclusive`	Evidence insufficient for clear decision	Unchanged
`failed`	Implementation or benchmark infrastructure failure	Unchanged

Evolution Decide

Evolution Decide Skill

The Job

Inputs

Evolution Decide

Evolution Decide Skill

The Job

Inputs

What to Do

Decision Format

Allowed Final States

Scope Constraints

Output Contract

References

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc