Use when multiple aspects of a problem need parallel investigation with findings shared and challenged between investigators
Multiple teammates investigate different aspects simultaneously, then share findings and challenge each other's conclusions. The key value is adversarial review — not just parallel work, but structured debate.
Core principle: Investigate in parallel, challenge in rounds, synthesize with evidence.
Management theory: Tuckman's Storming phase is intentional here. Psychological Safety (Edmondson) ensures agents flag problems without hesitation.
Don't use when:
coordinator (lead)
├── researcher × 2-3 (each investigates a different aspect)
└── devil-advocate × 1 (challenges all findings)
Belbin coverage:
Sizing: 3-4 total (coordinator + 2-3 investigators). Devil-advocate can double as one of the researchers if team is small.
The coordinator:
Spawn prompt template:
You are investigating [ASPECT] of the problem: [PROBLEM].
Your scope: [SPECIFIC AREA]
Do NOT investigate: [OTHER AREAS — those are covered by teammates]
When done, report:
- What you found (with evidence: file paths, line numbers, logs)
- Your interpretation
- Confidence level (high/medium/low)
- What could disprove your finding
Each researcher works independently:
Devil-advocate waits for findings, then:
Coordinator facilitates:
Debate protocol:
For each finding:
1. Researcher states conclusion + evidence
2. Devil-advocate states objection or alternative
3. Researcher rebuts or concedes
4. Coordinator records: CONFIRMED / DISPUTED / NEEDS MORE EVIDENCE
For CONFIRMED findings: include in final synthesis For DISPUTED findings: assign follow-up investigation For NEEDS MORE EVIDENCE: researcher gathers more data
Coordinator produces:
Coordinator creates team:
Researcher A → "Investigate database query changes in last 2 weeks"
Researcher B → "Investigate frontend bundle size and rendering"
Researcher C → "Investigate infrastructure/deployment changes"
Devil-advocate → "Challenge all findings, propose alternative causes"
Results:
A: "Found N+1 query introduced in commit abc123" (high confidence)
B: "Bundle size unchanged, no rendering issues" (high confidence)
C: "No infra changes in timeframe" (high confidence)
Devil-advocate: "A's finding explains DB load but not the 2s delay
on pages that don't hit that query. Check CDN cache invalidation."
→ Follow-up investigation reveals CDN config change was the second cause.
→ Without devil-advocate, team would have stopped at the N+1 query fix.
| Mistake | Fix |
|---|---|
| Researchers investigate same area | Give explicit non-overlapping scopes |
| Devil-advocate too agreeable | Prompt must say "finding no issues is a failure" |
| Skipping debate phase | Debate is the VALUE — don't skip it |
| Coordinator anchors on first finding | Wait for ALL researchers before synthesizing |
| No evidence, only opinions | Require file:line references or test output |
Pre-requisite: team-orchestrator:orchestrating-work routes here Post-requisite: team-orchestrator:session-reflection records learnings