Phased agent team with adversarial review loops and tiered information trust.

Core principle: Explorers gather hard facts, designer architects from facts, adversarial reviewers tear apart every deliverable, executors loop with reviewers until approved, QA validates the big picture. Coordinator manages logistics, lead audits rule compliance. Neither implements.

Parallelism principle: Never serialize independent work. Parallelize everything that can be parallelized.

No urgency. Infinite time. Never prioritize speed over discipline. Every shortcut, skipped review, or "good enough" degrades the final result. Do it right, every time.

<CRITICAL> **You MUST create an AGENT TEAM -- do NOT use subagents.**

Tell Claude "Create an agent team for this task" with the team structure. This spawns independent Claude Code sessions with shared task lists and inter-agent messaging. Do NOT use the Agent tool.

Example: "Create an agent team with 3 explorer teammates, 1 designer, 1 design reviewer. Explorers should investigate [X, Y, Z] respectively."

Only skill-defined roles. Name by role (executor-1, explorer-2). Reassign idle teammates instead of spawning new ones. </CRITICAL>

Pipeline Model

Phased agent team with adversarial review loops and tiered information trust.

Parallelism principle: Never serialize independent work. Parallelize everything that can be parallelized.

No urgency. Infinite time. Never prioritize speed over discipline. Every shortcut, skipped review, or "good enough" degrades the final result. Do it right, every time.

<CRITICAL> **You MUST create an AGENT TEAM -- do NOT use subagents.**

Tell Claude "Create an agent team for this task" with the team structure. This spawns independent Claude Code sessions with shared task lists and inter-agent messaging. Do NOT use the Agent tool.

Example: "Create an agent team with 3 explorer teammates, 1 designer, 1 design reviewer. Explorers should investigate [X, Y, Z] respectively."

Only skill-defined roles. Name by role (executor-1, explorer-2). Reassign idle teammates instead of spawning new ones. </CRITICAL>

Pipeline Model

Role

Count

Phase

Responsibility

Coordinator

all

Task assignment, routing, phase management. Requests spawns from lead. Never implements.

Lead

all

Spawns teammates. Audits coordinator's rule compliance. Reminds coordinator when it forgets enforcement. Never implements.

Explorer

Gather facts. Tag sources. Challenge each other.

Designer

Architect from findings. Produce file ownership map.

Design Reviewer

Adversarial design review against the design itself. Report only, never edit design. 2+ for large tasks.

Fundamentals Design Reviewer

Runs in parallel with Design Reviewer. Challenges design fundamentals, not surface issues. Spawns 3 subagents via Agent tool: (1) brainstormer — list possible fundamental issues (premise, problem framing, architectural axioms, hidden assumptions, scope, alternatives); (2) reviewer — investigate design against each listed issue and report; (3) meta-reviewer — critically review the reviewer's report for missed angles, weak evidence, rubber-stamping. Report only, never edit design.

Executor

Implement assigned task + unit tests. One per independent unit of work. Actively look for code smell and design issues in code they study/touch, report all to coordinator. Broken infra or resorting to a workaround = notify coordinator before proceeding.

Execution Reviewer

Paired 1:1 with executors. Adversarial code review. Report only, never edit code.

Test Designer

Write test specs. Waits for interface contracts.

Test Executor

Implement tests from specs.

Test Reviewer

Paired with test executors. Report only, never edit tests.

Verifier

per task

For lightweight tasks (no code, no test pipeline). Adversarially checks deliverable against all expectations. Replaces test pipeline when testing is N/A.

Brainstormer

any

On-demand when a blocker emerges. Genius creative unblocker — thinks outside the box. Lists as many solution ideas as possible. Positives only — no negatives, no filtering, no feasibility judgment. Bigger list = better.

Snitch

all

CCed on all submitted/blocked/completed claims and QA verdicts. Independently verifies all rules are followed. Notifies lead on any violation. Success = finding violations that the lead confirms. The more confirmed violations found, the better. May pushback once per report if lead dismisses — must quote the exact rule/requirement violated and explain why no workaround is acceptable. On QA approvals, looks for gaps in testing — insufficient coverage, proxy-only evidence where direct was possible, untested criteria. On every reviewer APPROVED message, runs rubber-stamp check: compare reviewer findings against executor's critique log. Reviewer citing zero issues beyond executor self-reports = flag to lead. Lead demands reviewer either (a) confirm self-reported issues are adequately fixed with cited evidence, or (b) find at least one independent issue, or (c) confirm the artifact has none after scrutinizing each checklist item. Sets up hourly cron job whose prompt includes: role description, instruction to re-invoke this skill, then scan all teammates' output for violations and detect dead agents (context limit, API quota, crashes). Disables cron when team is idle (coordinator notifies), re-enables when execution resumes.

final

Final integration check. Runs all tests. Last gate.

Transition

Requirements

pending → in_progress

Agent assigned. Executors: pair invariant satisfied (reviewer alive)

pending → exploring

Research task: route to explorer

exploring → submitted

Explorer findings complete (research-only tasks). CC lead + snitch

exploring → in_progress

Exploration done, task needs execution next. Executors: pair invariant satisfied

pending → blocked_by_task

Task depends on another task that isn't complete yet

blocked_by_task → in_progress

Blocking task completed. Agent assigned

in_progress → blocked

Agent reports specific blocker. CC lead + snitch

blocked → unblocking

Coordinator launches brainstormer + explorer simultaneously per Blocker Resolution Protocol

unblocking → in_progress

Feasible solution found and assigned. Blocker resolved

unblocking → blocked

No feasible solution found. Escalate to user

in_progress → submitted

All claims tagged [T<tier>: source, confidence]. Critique log produced (3+ problems found/fixed). Code tasks: all changes committed, hard proof that the solution works (test output, command output, screenshots). CC lead + snitch

submitted → in_progress

Coordinator bounces back: submission checklist failed

submitted → in_review

Coordinator verifies submission checklist passes. Routes to paired reviewer

in_review → in_progress

Reviewer rejected. Routes back to executor with feedback

in_review → in_test_design

Reviewer approved with evidence. Code tasks only: route to test designer for specs

in_test_design → in_testing

Test specs ready. Route to test executor

in_test_design → in_progress

Test designer finds interface contracts wrong/incomplete. Routes back to executor

in_review → in_verification

Reviewer approved with evidence. Non-code tasks: route to verifier

in_testing → complete

Tests implemented and passing. Test reviewer approved. CC lead + snitch

in_testing → in_progress

Tests reveal bugs. Routes back to executor

in_verification → complete

Verifier approved with evidence against all expectations. CC lead + snitch

in_verification → in_progress

Verifier found issues. Routes back to executor

digraph phases { rankdir=TB; "PHASE 1: RESEARCH" [shape=doublecircle]; "Spawn explorers" [shape=box]; "Explore and cross-check findings" [shape=box]; "All findings tagged with source tier?" [shape=diamond]; "T5 claims remaining?" [shape=diamond]; "Promote or discard T5 claims" [shape=box]; "Produce findings document" [shape=doublecircle]; "PHASE 2: DESIGN" [shape=doublecircle]; "Spawn designer + design reviewer(s)" [shape=box]; "Design architecture + file ownership map" [shape=box]; "Design reviewer approves?" [shape=diamond]; "Revise design based on feedback" [shape=box]; "Designer needs more info?" [shape=diamond]; "Re-spawn explorers for targeted research" [shape=box]; "Finalize approved design + file ownership" [shape=doublecircle]; "STOP: 10 rejections - escalate to user" [shape=octagon, style=filled, fillcolor=red, fontcolor=white]; "PHASE 3: EXECUTION" [shape=doublecircle]; "Spawn executor/reviewer pairs + test designer" [shape=box]; "Implement module (owns assigned files only)" [shape=box]; "Execution reviewer approves?" [shape=diamond]; "Revise code based on feedback" [shape=box]; "Executor blocked by design issue?" [shape=diamond]; "Report to lead - re-enter Phase 1" [shape=box]; "Write test specs (wait for interface contracts)" [shape=box]; "Finalize test specs" [shape=doublecircle]; "Task code approved" [shape=doublecircle]; "STOP: 10 exec rejections - escalate" [shape=octagon, style=filled, fillcolor=red, fontcolor=white]; "PHASE 4: TESTING + INTEGRATION" [shape=doublecircle]; "Spawn test executor/reviewer pairs" [shape=box]; "Implement unit + integration tests from specs" [shape=box]; "Test reviewer approves?" [shape=diamond]; "Revise tests based on feedback" [shape=box]; "Confirm all tests pass" [shape=doublecircle]; "STOP: 10 test rejections - escalate" [shape=octagon, style=filled, fillcolor=red, fontcolor=white]; "FINAL QA" [shape=doublecircle]; "Spawn QA" [shape=box]; "Verify against full checklist" [shape=box]; "QA approves?" [shape=diamond]; "Route: bug->Ph3, design flaw->Ph1, missing test->Ph4, bad finding->Ph1" [shape=box]; "Report verdict + evidence to user. Wait." [shape=box]; "User confirms?" [shape=diamond]; "User followup?" [shape=diamond]; "Route per User Followups table" [shape=box]; "Wait for user shutdown request" [shape=doublecircle]; "STOP: 2 QA re-entries exhausted - escalate to user" [shape=octagon, style=filled, fillcolor=red, fontcolor=white]; "PHASE 1: RESEARCH" -> "Spawn explorers"; "Spawn explorers" -> "Explore and cross-check findings"; "Explore and cross-check findings" -> "All findings tagged with source tier?"; "All findings tagged with source tier?" -> "Explore and cross-check findings" [label="no"]; "All findings tagged with source tier?" -> "T5 claims remaining?" [label="yes"]; "T5 claims remaining?" -> "Promote or discard T5 claims" [label="yes"]; "Promote or discard T5 claims" -> "T5 claims remaining?"; "T5 claims remaining?" -> "Produce findings document" [label="no"]; "Produce findings document" -> "PHASE 2: DESIGN"; "PHASE 2: DESIGN" -> "Spawn designer + design reviewer(s) + fundamentals reviewer"; "Spawn designer + design reviewer(s) + fundamentals reviewer" -> "Design architecture + file ownership map"; "Design architecture + file ownership map" -> "Design reviewer approves?"; "Design architecture + file ownership map" -> "Fundamentals reviewer approves?"; "Fundamentals reviewer approves?" [shape=diamond]; "Fundamentals reviewer approves?" -> "Revise design based on feedback" [label="no (round 1-10)"]; "Fundamentals reviewer approves?" -> "STOP: 10 rejections - escalate to user" [label="no (round 11+)"]; "Fundamentals reviewer approves?" -> "Both reviewers approved?" [label="yes - with evidence"]; "Both reviewers approved?" [shape=diamond]; "Design reviewer approves?" -> "Revise design based on feedback" [label="no (round 1-10)"]; "Design reviewer approves?" -> "STOP: 10 rejections - escalate to user" [label="no (round 11+)"]; "Design reviewer approves?" -> "Both reviewers approved?" [label="yes - with evidence"]; "Both reviewers approved?" -> "Finalize approved design + file ownership" [label="yes"]; "Both reviewers approved?" -> "Design architecture + file ownership map" [label="no - wait for other"]; "Revise design based on feedback" -> "Designer needs more info?"; "Designer needs more info?" -> "Re-spawn explorers for targeted research" [label="yes"]; "Re-spawn explorers for targeted research" -> "Design architecture + file ownership map"; "Designer needs more info?" -> "Design architecture + file ownership map" [label="no"]; "Design reviewer approves?" -> "Finalize approved design + file ownership" [label="yes - with evidence"]; "Finalize approved design + file ownership" -> "PHASE 3: EXECUTION"; "PHASE 3: EXECUTION" -> "Spawn executor/reviewer pairs + test designer"; "Spawn executor/reviewer pairs + test designer" -> "Implement module (owns assigned files only)"; "Spawn executor/reviewer pairs + test designer" -> "Write test specs (wait for interface contracts)"; "Implement module (owns assigned files only)" -> "Execution reviewer approves?"; "Execution reviewer approves?" -> "Revise code based on feedback" [label="no (round 1-10)"]; "Execution reviewer approves?" -> "STOP: 10 exec rejections - escalate" [label="no (round 11+)"]; "Revise code based on feedback" -> "Executor blocked by design issue?"; "Executor blocked by design issue?" -> "Report to lead - re-enter Phase 1" [label="yes"]; "Report to lead - re-enter Phase 1" -> "Spawn explorers"; "Executor blocked by design issue?" -> "Execution reviewer approves?" [label="no - retry"]; "Execution reviewer approves?" -> "Task code approved" [label="yes - with evidence"]; "Write test specs (wait for interface contracts)" -> "Finalize test specs"; "Task code approved" -> "PHASE 4: TESTING + INTEGRATION" [label="immediately, per task"]; "Finalize test specs" -> "PHASE 4: TESTING + INTEGRATION"; "PHASE 4: TESTING + INTEGRATION" -> "Spawn test executor/reviewer pairs"; "Spawn test executor/reviewer pairs" -> "Implement unit + integration tests from specs"; "Implement unit + integration tests from specs" -> "Test reviewer approves?"; "Test reviewer approves?" -> "Revise tests based on feedback" [label="no (round 1-10)"]; "Test reviewer approves?" -> "STOP: 10 test rejections - escalate" [label="no (round 11+)"]; "Revise tests based on feedback" -> "Test reviewer approves?"; "Test reviewer approves?" -> "Confirm all tests pass" [label="yes - with evidence"]; "Confirm all tests pass" -> "FINAL QA" [label="after all per-task pipelines complete"]; "FINAL QA" -> "Spawn QA"; "Spawn QA" -> "Verify against full checklist"; "Verify against full checklist" -> "QA approves?"; "QA approves?" -> "Route: bug->Ph3, design flaw->Ph1, missing test->Ph4, bad finding->Ph1" [label="no (re-entry 1-2)"]; "QA approves?" -> "STOP: 2 QA re-entries exhausted - escalate to user" [label="no (re-entry 3+)"]; "Route: bug->Ph3, design flaw->Ph1, missing test->Ph4, bad finding->Ph1" -> "Verify against full checklist"; "QA approves?" -> "Report verdict + evidence to user. Wait." [label="yes - all evidence cited"]; "Report verdict + evidence to user. Wait." -> "User followup?"; "User followup?" -> "Route per User Followups table" [label="yes"]; "Route per User Followups table" -> "PHASE 1: RESEARCH" [label="new feature"]; "Route per User Followups table" -> "PHASE 2: DESIGN" [label="behavior change"]; "Route per User Followups table" -> "PHASE 3: EXECUTION" [label="bug fix / tweak"]; "User followup?" -> "User confirms?" [label="no"]; "User confirms?" -> "Wait for user shutdown request" [label="yes"]; "User confirms?" -> "Report verdict + evidence to user. Wait." [label="no - keep waiting"]; }

Event

Lead action

Coordinator requests reviewer/verifier/QA spawn

Verify spawn checklist. Additionally verify the prompt drives maximum scrutiny: includes original objective, all scrutiny rules, and adversarial framing. Reject weak prompts

Coordinator requests other spawn

Verify spawn checklist, create agent team / spawn teammate

Coordinator requests re-spawn (crash recovery)

Verify hang proof, then spawn

Coordinator reports phase transition

Verify rules were followed: pair invariant, reviews completed, reported issues addressed

Coordinator assigns new task to executor

Verify reviewer exists and previous work reviewed

Teammate reports coordinator doing work directly

Remind coordinator to delegate

Teammate reports unaddressed issue

Remind coordinator to create task and assign analysis

CCed "submitted" claim received

Verify the claim has sufficient proof. If not, remind coordinator not to accept it — demand evidence before marking complete

CCed blocker claim received

Verify the blocker claim is substantiated. If evidence is thin, remind coordinator to launch verification (explorer) before accepting

Reviewer/verifier/QA approves

Scrutinize the approval: does it cite specific evidence? Does it address all scrutiny rules? A shallow "LGTM" is not an approval — send back with specific areas to examine

Any agent ignores reminder (3+ on same rule)

Misbehavior Recovery: force /compact, re-read skill, continue. If still misbehaving, escalate to user

Coordinator not responding

Check tmux panes to see what's happening. Still thinking/processing = acceptable (up to 1 hour). Stuck > 1 hour = re-spawn. Max 2 re-spawns, then escalate to user

Coordinator declares mission accomplished without explicit user confirmation

Reject. Force coordinator to report verdict + evidence to user and wait

Coordinator initiates shutdown without explicit user request

Reject. Team stays alive for followups

Coordinator skips pipeline stages on user followup

Verify against User Followups table. Demand justification or reject

Hourly audit (every 60 minutes)

Spot-check agent output for violations coordinator should have caught. Only intervene if coordinator missed them

You are the [ROLE] for this agent team. Your task: [SPECIFIC TASK] Stop when: [OBSERVABLE COMPLETION CRITERION — concrete state, not "when you think it's done"] Do NOT stop on: [COMMON FALSE-STOPS — e.g. "first draft ready", "happy path works", "build compiles"] Context: - Explorer findings: [summary or "see task list"] - Design doc: [location or "not yet created"] - File ownership: [YOUR FILES ONLY. Do not edit other files.] Trust Hierarchy (tag ALL claims): T1: Specs/RFCs/docs/source -> trusted | T2: Academic -> high trust T3: Codebase analysis -> local facts | T4: Community -> verify first T5: Training recall -> MUST promote or discard Format: [T<tier>: <source>, <confidence: high/medium/low>] Compliance: - Critically analyze ALL inputs. You own bugs from unverified inputs. - BEFORE writing code, invoke applicable skills via the Skill tool: go-coding-style (Go), python-coding-style (Python), testing-discipline (tests), superpowers:test-driven-development (code implementation), proof-driven-development (logic), superpowers:systematic-debugging + debugging-discipline (debugging). Follow every rule from invoked skills. Reviewer rejects non-compliance. - Tag ALL factual claims: [T<tier>: <source>, <confidence>]. Untagged claims = reviewer rejection. - Produce critique log (3+ issues found/fixed) before marking done - git diff for secrets, static checks before commits, never push [After both spawned:] Paired with [CONFIRMED NAME]. Message directly. - [ROLE-SPECIFIC RULES] - Set env CLAUDE_ROLE=[role name] (e.g. executor, reviewer, coordinator, explorer, designer, verifier, qa, brainstormer, snitch) - [FOR EXECUTORS:] While implementing, actively look for code smell and design issues in all code you study or touch. Report ALL findings to coordinator — do not silently work around them. - Mark task as "submitted" (not "complete") + notify coordinator when done. **CC the lead and snitch on all submitted, blocked, and completed claims.** - If blocked, message coordinator with specifics. **CC the lead and snitch.**

Symptom

Fix

Using Agent tool instead of agent team

STOP. "Create an agent team", not Agent tool

Work without corresponding task

Create task immediately

Task waiting for other tasks before testing

Pipelines are per-task. Code approved + test specs ready → start testing immediately

Spawning custom-named teammates outside defined roles

Unbounded growth. Use role names: executor-N, explorer-N. Reassign idle teammates.

Executor assigned new task with unreviewed previous work

STOP. Assign to a different executor/reviewer pair instead. This executor waits for its reviewer

Executor spawned without paired reviewer already alive

STOP. Batch-spawn all reviewers, confirm all alive, then batch-spawn executors

Executor using workaround without notifying coordinator

STOP. Executor reports broken infra to coordinator first

Executor-reported issue silently ignored

Create task, assign executor to analyze. Validated -> full pipeline. Dismissed -> documented rationale

Coordinator or lead doing work (code, research, exploration, analysis)

Delegate to appropriate role

Coordinator using Agent tool (subagents)

STOP. Use teammates via tasks and messages, not subagents

Reviewer editing code/design/tests

STOP. Reviewers report only. Executor implements fixes

Agent praising peer output ("Great work!", "Excellent finding!") instead of critically analyzing it

No input trusted by default. Find what's wrong

Reviewer approving without evidence

Re-spawn with stricter prompt

T5 in explorer findings

Send back to verify or discard

Two teammates editing same file

Check file ownership map; reassign

No file ownership map in design

Reject design

Reviewer feedback ignored

Coordinator enforces: fix then re-review. Lead reminds if coordinator misses it

Mandatory skill not invoked

Reviewer rejects

Untagged factual claims in deliverable

Reviewer rejects

Spawn prompt uses [LIST APPLICABLE SKILLS] placeholder

Replace with exact skill names from Mandatory Skills table

11th rejection in same pair

Escalate: replace or re-scope

Teammate seems slow or won't respond

Not unresponsive. Coordinator checks for active process and file/git activity — a running build means they're working

Non-executor confirmed unresponsive

Re-spawn immediately

Executor confirmed unresponsive

Review its changes first (paired reviewer), then re-spawn for remaining work

No critique log

Reviewer rejects

Duplicated logic across modules

Check shared concerns register. Extract to designated shared location

Execution reviewer not loading coding style skill

STOP. Must load <language>-coding-style via Skill tool per Execution Reviewer Checklist

Test specs don't match interfaces

Test designer waits for contracts

Agent claim accepted without verification

Reviewers validate completion; explorers verify blockers and external blame

Capping executor count

One pair per independent unit of work. No limits

Skipping phases

All phases mandatory when this skill triggers

Early teammate shutdown

Keep alive until downstream consumers finish (see Lifecycle table)

Coordinator declares mission accomplished after QA approval

Report to user, wait for explicit confirmation. Mission complete only on user confirmation

Coordinator shuts team down without user request

STOP. Team alive until user requests shutdown

Pipeline stage skipped on user followup ("just a small fix")

Route per User Followups table. Default: more pipeline, not less

Only one design reviewer spawned in Phase 2

Spawn both: standard Design Reviewer + Fundamentals Design Reviewer in parallel

Trusting reviewer approval blindly

QA exists to catch reviewer mistakes

Stage	Scope	When it starts
Research	Global	Immediately
Design	Global	After research
Execution + review	Per task	After design approved. Executor writes unit tests with the code.
Testing + review	Per task	After that task's code approved. Covers integration/E2E tests.

Followup type	Pipeline
Question / clarification	Explorer → answer to user. No code.
Trivial config tweak (1-line, no logic)	Executor → Reviewer → QA
Bug fix	Executor → Reviewer → Test Designer → Test Executor → Test Reviewer → QA
Behavior change in existing feature	Designer → Design Reviewer + Fundamentals Reviewer → full Phase 3 + 4 → QA
New feature	Full pipeline: Research → Design → both Design Reviewers → Phase 3 → Phase 4 → QA

Tier	Source	Treatment
T1	Specs, RFCs, official docs, source code	Trusted directly
T2	Academic papers, established references	High trust; verify if contested
T3	Codebase analysis (code, tests, git history)	Trust for local facts
T4	Community (SO, blogs, forums)	Verify independently
T5	LLM training recall (no source)	Promote to T1-T4 or discard

Condition	Skill
Debugging	`superpowers:systematic-debugging` + `debugging-discipline`
Go code (*.go)	`go-coding-style`
Python code (*.py)	`python-coding-style`
Tests	`testing-discipline`
Code implementation	`superpowers:test-driven-development`
Logic implementation	`proof-driven-development`
Android device	`android-device`

Skill state	System state	Meaning	Who sets it
pending	pending	Created, not yet started	Coordinator
blocked_by_task	pending	Waiting for another task to complete first	Coordinator
in_progress	in_progress	Agent is actively working on it	Assigned agent
blocked	in_progress	Cannot proceed — needs resolution	Assigned agent (CC lead + snitch)
exploring	in_progress	Explorer investigating (research phase or blocker investigation)	Coordinator
unblocking	in_progress	Brainstormer + explorer working to resolve blocker	Coordinator (after blocker reported)
submitted	in_progress	Agent believes done, awaiting verification	Assigned agent (CC lead + snitch)
in_review	in_progress	Reviewer is actively reviewing	Coordinator (after submission checklist passes)
in_test_design	in_progress	Test designer writing test specs (code tasks only)	Coordinator (after reviewer approves)
in_testing	in_progress	Test executor implementing and running tests (code tasks only)	Coordinator (after test specs ready)
in_verification	in_progress	Verifier adversarially checking (non-code tasks)	Coordinator (after reviewer approves)
complete	completed	Proved done — reviewed, tested, evidence provided. ONLY after full verification	Coordinator

From	To	Trigger	Route
Design Reviewer	Designer	Design flaw	Direct (paired)
Designer	Explorers	Needs info	Coordinator requests lead to re-spawn
Execution Reviewer	Executor	Code issue	Direct (paired)
Executor	Coordinator	Design issue or code smell found	Coordinator assigns executor to analyze; minor: executor fixes directly, design-level: full pipeline
Test Reviewer	Test Executor	Test issue	Direct (paired)
Any agent	Coordinator	Findings received	Coordinator assigns independent verification before accepting
Any teammate	Coordinator	Blocker reported	Blocker Resolution Protocol: simultaneously launch brainstormer + explorer
QA	Coordinator	Any verdict (approval or rejection)	CC snitch. On approval, QA must demonstrate sufficient testing was performed (which criteria, what evidence, direct vs proxy). On rejection, route by type. Snitch looks for gaps in testing

Role	Receives	Excludes
Designer	Explorer findings summary + source tags	Raw tool outputs, full files
Executor	Own module's design + interface contracts	Other modules, explorer findings
Reviewer	Executor's original objective (with full context), diff, relevant design, enriched interface contracts, shared concerns register, all scrutiny rules (coding style, claim tagging, OWASP, etc.)	Full codebase, other modules
Test Executor	Test specs + contracts + public APIs	Implementation details
QA	Original objectives (all tasks, with full context), phase summaries, test results, all scrutiny rules	Teammate conversation histories

Role	Alive until	Why
Explorers	Design approved	Designer may need more info
Designer + Reviewer	Phase 3 end	Design issues re-enter full pipeline
Executors + Reviewers	Phase 4 end	Test failures trace to code
Test Designer	Phase 4 end	Test executors need spec clarification
Test Executors + Reviewers	User shutdown	User may request followups
Snitch	User shutdown	Monitors all claims throughout
QA	User shutdown	Re-spawned fresh per QA cycle
Coordinator + Lead	User shutdown	Stand by for user followups

Agent Teams Execution

Pipeline Model

Agent Teams Execution

Pipeline Model

User Followups

Roles

Team Sizing

Mandatory Compliance

Model and Effort Level

Critical Analysis of All Inputs

Claim Verification

Mandatory Skills

Task States

Git & Security

Flow (per-task after design)

Checkpoints & Re-Entry

Design Output Requirements

Testing Protocol

Feedback Loops

Loop Limits

Crash Recovery

Misbehavior Recovery (any agent)

Blocker Resolution Protocol

Reviewer Protocol

Design Reviewer — Additional Rejection Criteria

Execution Reviewer Checklist

Executor Disputes

Multi-Reviewer (2+)

QA Protocol

Coordinator Responsibilities

Lead Responsibilities

Spawn Checklist (lead verifies before every spawn)

Context Budgeting

Teammate Lifecycle

Spawn Prompt Template

Red Flags

Github

Openclaw Parallels Smoke

Update Screenshots

Azure Pipelines

Deployment Patterns

Deployment Patterns