Name: Subagent Driven Development
Author: spencer-life

Subagent Driven Development

This skill should be used when executing implementation plans with independent tasks in the current session. It dispatches fresh subagents per task with multi-stage review. Self-contained with all subagent prompts embedded. Includes pre-flight checks, data verification, and automatic Codex cross-model review at phase boundaries.

spencer-life0 星标2026年4月16日

职业
分类: 效率与集成

Execute a plan by dispatching fresh subagents per task, with multi-stage review after each. All prompts embedded — no external files needed.

Core principle: Fresh subagent per task + staged review + cross-model verification = high quality, fast iteration.

NEVER pushes to remote. Commits only.

When to Use

You have an implementation plan (from /plan or a written plan doc)
Tasks are mostly independent (not tightly coupled)
Staying in this session (not handing off to parallel sessions)

The Full Pipeline

For each task:

1. PRE-FLIGHT (if task touches DB or has dependencies)
   → pre-flight agent checks scope, context, data safety
   → Address issues before proceeding

2. APPROACH CHECK (all tasks)
   → Implementer outputs BEFORE writing any code:
     a. Understanding of the task (2-3 sentences)
     b. Planned approach (what files to change, in what order)
     c. Assumptions being made (what's assumed true but not verified)
     d. Tradeoffs (or "none" for simple tasks)
   → Orchestrator reviews the approach
   → If wrong direction or bad assumptions: correct BEFORE coding
   → If sound: proceed to IMPLEMENT
   → Trivial tasks (< 10 lines): 1-2 sentences suffice

3. IMPLEMENT
   → implementer subagent with full task text + context
   → TDD enforced: write test → watch it fail → implement → pass
   → Mocks ONLY for truly external services (APIs you can't call locally)
   → Commits when done

4. UNIT TEST GATE
   → Run the FULL test suite (not just the new tests)
   → ALL tests must pass before proceeding
   → If any fail: return to implementer with failures, fix, re-run
   → Repeat until green. This gate is non-negotiable.

5. SPEC + CODE QUALITY REVIEW (parallel)
   → Dispatch BOTH reviewers as parallel agents (model: sonnet)
   → Spec reviewer checks implementation matches spec
   → Code quality reviewer checks bugs, security, simplicity, surgical scope
   → Collect both verdicts
   → If either rejects: send COMBINED feedback to implementer
   → Implementer fixes all issues in one pass (not two separate rounds)
   → Re-run both reviewers in parallel again
   → Must both pass before proceeding

6. DATA VERIFICATION (if task modified database)
   → data-verifier agent runs VERIFICATION.md ground-truth checks
   → Fix failures → re-verify → must pass

7. SMOKE TEST GATE
   → If smoke_test.sh exists in project root: run it
   → ALL real-connection checks must pass
   → If fail: return to implementer to fix real integration issues
   → If no smoke_test.sh: warn "No smoke test defined" but don't block

8. CODEX BACKGROUND AUDIT + LOOKAHEAD PRE-FLIGHT (async, non-blocking)
   → Fire headless Codex in background (read-only, --sandbox read-only)
   → If there's a NEXT task in the queue that needs pre-flight:
     dispatch pre-flight for it in background too (model: haiku, run_in_background)
   → Do NOT wait for either — mark task complete and move on
   → Codex report reviewed at next natural pause
   → Pre-flight result ready by time next task starts

9. Mark task complete → move to next task immediately

Subagent Driven Development

spencer-life0 星标2026年4月16日

职业
分类: 效率与集成

The Full Pipeline

For each task: 1. PRE-FLIGHT (if task touches DB or has dependencies) → pre-flight agent checks scope, context, data safety → Address issues before proceeding 2. APPROACH CHECK (all tasks) → Implementer outputs BEFORE writing any code: a. Understanding of the task (2-3 sentences) b. Planned approach (what files to change, in what order) c. Assumptions being made (what's assumed true but not verified) d. Tradeoffs (or "none" for simple tasks) → Orchestrator reviews the approach → If wrong direction or bad assumptions: correct BEFORE coding → If sound: proceed to IMPLEMENT → Trivial tasks (< 10 lines): 1-2 sentences suffice 3. IMPLEMENT → implementer subagent with full task text + context → TDD enforced: write test → watch it fail → implement → pass → Mocks ONLY for truly external services (APIs you can't call locally) → Commits when done 4. UNIT TEST GATE → Run the FULL test suite (not just the new tests) → ALL tests must pass before proceeding → If any fail: return to implementer with failures, fix, re-run → Repeat until green. This gate is non-negotiable. 5. SPEC + CODE QUALITY REVIEW (parallel) → Dispatch BOTH reviewers as parallel agents (model: sonnet) → Spec reviewer checks implementation matches spec → Code quality reviewer checks bugs, security, simplicity, surgical scope → Collect both verdicts → If either rejects: send COMBINED feedback to implementer → Implementer fixes all issues in one pass (not two separate rounds) → Re-run both reviewers in parallel again → Must both pass before proceeding 6. DATA VERIFICATION (if task modified database) → data-verifier agent runs VERIFICATION.md ground-truth checks → Fix failures → re-verify → must pass 7. SMOKE TEST GATE → If smoke_test.sh exists in project root: run it → ALL real-connection checks must pass → If fail: return to implementer to fix real integration issues → If no smoke_test.sh: warn "No smoke test defined" but don't block 8. CODEX BACKGROUND AUDIT + LOOKAHEAD PRE-FLIGHT (async, non-blocking) → Fire headless Codex in background (read-only, --sandbox read-only) → If there's a NEXT task in the queue that needs pre-flight: dispatch pre-flight for it in background too (model: haiku, run_in_background) → Do NOT wait for either — mark task complete and move on → Codex report reviewed at next natural pause → Pre-flight result ready by time next task starts 9. Mark task complete → move to next task immediately

Role	Model	Why
Implementer	`opus`	Writes code, TDD, complex reasoning — the hard one
Code quality reviewer	`sonnet`	Structured review with clear checklist; orchestrator checks verdict
Spec reviewer	`sonnet`	Diff-vs-spec comparison — structured and fast
Data verifier	`sonnet`	May construct queries, compare values — moderate complexity
Pre-flight	`haiku`	Run commands, check 4 categories, report — simple and structured
Integration reviewer	`sonnet`	Diff analysis and merge safety
Explore agents	`sonnet`	Research, file reading, summarization

Subagent Driven Development

When to Use

The Full Pipeline

Subagent Driven Development

When to Use

The Full Pipeline

Subagent Model Routing

Setup (Before First Task)

Subagent Prompts

Pre-Flight (Agent: `pre-flight`)

Implementer

Spec Reviewer

Code Quality Reviewer

Data Verifier (Agent: `data-verifier`)

Background Codex Audit (async, non-blocking)

Workflow Rules

Execution order

Stage order (strict)

When a reviewer rejects

Git rules

When a subagent asks questions

When a subagent fails

Parallel Handoff Mode

Example Flow

Red Flags — STOP

Feishu Perm

Discord

Coding Agent (bash-first)

Apple Notes

Feishu Wiki

Bear Notes

Subagent Driven Development

When to Use

The Full Pipeline

Subagent Driven Development

When to Use

The Full Pipeline

Subagent Model Routing

Setup (Before First Task)

Subagent Prompts

Pre-Flight (Agent: pre-flight)

Implementer

Spec Reviewer

Code Quality Reviewer

Data Verifier (Agent: data-verifier)

Background Codex Audit (async, non-blocking)

Workflow Rules

Execution order

Stage order (strict)

When a reviewer rejects

Git rules

When a subagent asks questions

When a subagent fails

Parallel Handoff Mode

Example Flow

Red Flags — STOP

Feishu Perm

Discord

Coding Agent (bash-first)

Apple Notes

Feishu Wiki

Bear Notes

Pre-Flight (Agent: `pre-flight`)

Data Verifier (Agent: `data-verifier`)