技能檔案

Exhaustive Research: $ARGUMENTS

Name: Exhaustive Research: $ARGUMENTS
Author: shafir360

Conducts massive-scale, multi-layered research using a 3-tier model strategy (Haiku pre-screening, Sonnet data collection, Opus synthesis) with hierarchical tree-reduction to process hundreds of sources without overloading any single agent. Accepts optional /deep-research output as bootstrap context. Configurable depth levels: standard (~1hr, ~40 sources), deep (~2hr, ~100 sources), or exhaustive (~4hr, ~300 sources). Produces comprehensive cited reports with per-finding confidence levels, cross-source contradiction analysis, circular sourcing detection, and source quality scoring. Use when the user says "exhaustive research", "massive research", "scale up research", "go deeper", "research everything about", "exhaustive analysis", or needs research far beyond what /deep-research provides.

shafir3600 星標2026年3月17日

職業
分類: 數據工程

技能內容

You are conducting massive-scale research that processes hundreds of sources through a hierarchical tree of agents. This skill uses a 3-tier model strategy: Haiku for cheap pre-screening, Sonnet for data collection and tree merging, Opus for final synthesis and adversarial review.

Rate source credibility using references/source-evaluation.md. Use depth parameters from references/depth-config.md. Use tree-reduction algorithm from references/tree-reduction.md. Use screening rubric from references/screening-rubric.md. Use report template from references/report-template.md. Use checkpointing and progress reporting from .

相關技能

Exhaustive Research: $ARGUMENTS | Skills Pool

技能檔案

Exhaustive Research: $ARGUMENTS

shafir3600 星標2026年3月17日

職業
分類: 數據工程

技能內容

相關技能

For each wave of up to 10 agents:
  1. Spawn all agents in the wave in a single parallel message
  2. Collect results as they return
  3. Once all other agents in the wave have returned, the wave is effectively complete.
     Any agent that has not returned by then is considered TIMED-OUT — do not wait further.
     (The non-returning agent had the entire duration of all other agents' processing to respond.)
  4. Mark timed-out agents as TIMED-OUT, note coverage gap, do NOT retry now
  5. Write mini-reports to disk for all completed agents in this wave
  6. **Context release**: After writing mini-reports to disk, release full report text from active context.
     Keep only a manifest line per report: `batch_{N}.md — [1-line summary: top claim + source count]`
     Do NOT reference the full mini-report text again until Phase 5 reads it from disk.
  7. Emit a wave-level progress note: "Wave [N]/[total] complete: [X]/[10] agents returned"
  8. Start next wave immediately — do not wait for timed-out agents

NEVER generate citation URLs from memory — only use URLs that research agents explicitly returned. Citation failure rates exceed 60% in AI systems.
All agents MUST be deep-researcher subagent type — this structurally prevents sub-agent spawning and skill invocation. No exceptions.
Every agent prompt MUST include the anti-recursion instruction verbatim from references/tree-reduction.md. This is Layer 2 defense-in-depth.
No agent receives information about other agents or the tree structure — agents are isolated leaf nodes that think they're doing standalone tasks (Layer 3: information isolation).
Tree depth hard cap: 3 levels — even if group math suggests more, stop at 3. Pass remaining reports directly to Opus synthesis.
Skip-and-note over retry storms — max 1 retry per agent, only on timeout (never on bad output). If an agent returns bad output, skip it and note the coverage gap. Retrying bad output risks cost explosions (documented: $10+/day from naive retries on large contexts). Never retry entire batches.
Phase timeouts are hard — if an agent hasn't returned by phase timeout, skip it and proceed. No agent is waited on indefinitely. Overall hard caps: Standard=75min, Deep=150min, Exhaustive=270min.
Use tiered models per depth level: Haiku for screening and validation gates (Sonnet validation for Exhaustive only). Sonnet for data collection, merging, query generation, and gap analysis (all depths). Opus for skeptic and final synthesis only. See references/depth-config.md for the full per-phase model table. Never use Haiku for synthesis or Opus for search.
Validation gates are structural-only — check: citation URLs present? metadata block complete? word count in range? sections parseable? Do NOT auto-reject based on semantic quality judgments (only ~53% accurate). Flag suspicious outputs for downstream awareness, don't kill them.
Confidence is source-agreement-based, not self-reported — LLMs have <30% calibration accuracy. Count independent sources per claim: 3+ = HIGH, 2 = MEDIUM, 1 = LOW.
Deduplication is soft-merge — flag duplicates and merge unique claims, don't hard-delete. Over-deduplication reduces source diversity.
Always save the report — even if the pipeline fails partway, save a partial report with whatever data was collected. Never lose completed work.
Checkpoint after every phase — save a JSON checkpoint file per references/checkpointing.md. Each agent's output file is an implicit sub-phase checkpoint. On failure, only re-run agents whose output files are missing.
Emit Phase Recap after every phase — display phase name, agent counts, source counts, elapsed time, issues, and next phase per the template in references/checkpointing.md. Users need visibility during 1-4 hour runs.
Pause on high failure rates — if >30% of agents in any phase fail, or <20% of sources pass screening, pause and ask the user before continuing. Do not silently degrade on catastrophic failures.
Never wait indefinitely for a single non-returning agent — when spawning agents in parallel waves, once all other agents in the wave have returned, the wave is complete. Any agent that has not returned by then is TIMED-OUT (it had the entire duration of all other agents' processing to respond). Write outputs for completed agents, note skipped agents as coverage gaps, and proceed immediately. Apply to ALL parallel-spawn phases (2, 3, 4, 5, 7). Never hold an entire wave hostage to one non-responding agent.
Every agent prompt MUST explicitly forbid WebSearch and WebFetch UNLESS those tools are part of the agent's defined task. Only reader agents (Phase 4) and collector/skeptic agents (Phase 7) are permitted to use WebFetch (max 2 calls each). All other agents — query-gen (Phase 2), screeners (Phase 3), merge agents (Phase 5), validators (Phase 4b/5b), gap analysis (Phase 6), synthesis (Phase 8) — MUST include "Do NOT use WebSearch or WebFetch" in their prompts. If an agent cannot verify a claim, it notes uncertainty in its output — it does not fetch.
Phase 0 (Bootstrap) MUST NOT invoke any tools — it only parses text the user pasted into their initial prompt, or reads a file path the user specified. Do NOT invoke Agent, Skill, WebSearch, or WebFetch during bootstrap ingestion. If the user wants to pass /deep-research output, they must paste it or provide a file path.
Phase 7 is a single pass — collector and skeptic results do NOT trigger additional rounds of data collection. Any new gaps or contradictions identified by skeptics are documented in the report as limitations, not fed back into a new collector round. There is no Phase 7b.
Context-aware streaming (Deep/Exhaustive) — the orchestrator MUST NOT hold all agent responses in context simultaneously. After each wave of agents completes and responses are saved to disk, release full report text from active context and keep only a compact manifest (file path + 1-line summary). When downstream phases need prior reports, read them from disk using the Read tool. Phase 8 synthesis receives only the highest-level tree outputs (not all mini-reports), plus the manifest, gap analysis, and Round 2 results. This prevents context overflow (~1,001K without mitigation) and "lost in the middle" quality degradation. See the Context Management section for full details.

Exhaustive Research: $ARGUMENTS

Exhaustive Research: $ARGUMENTS

Context Management (Critical for Deep/Exhaustive Depth)

Pre-Phase: Resume Check

Phase 0: Bootstrap Ingestion (~1 minute)

Phase 1: Objective Clarification (~2-5 minutes)

1a: Parse depth from arguments

1b: Confirm save location

1c: Clarify research objective

1d: Research plan

Phase 2: Query Generation & Search (~3-10 minutes)

2a: Generate diverse search queries

2b: Execute searches

Phase 3: Source Pre-Screening (~5-20 minutes)

Phase 4: Deep Reading — Round 1 (~10-40 minutes)

4a: Assign sources to reader agents

4b: Validation gate

Phase 5: Tree Reduction (~5-20 minutes)

5a: Level 1 merge

5b: Validation gate (Deep/Exhaustive only)

5c: Level 2 merge (Exhaustive only)

5d: Timeout handling

Phase 6: Gap Analysis & Round 2 Planning (~3-10 minutes)

Phase 7: Targeted Deep Dives — Round 2 (~10-40 minutes)

Collector agents (Sonnet)

Skeptic agent(s) (Opus) — spawn concurrently with collectors

Phase 8: Final Synthesis (~5-15 minutes)

Phase 9: Report Generation & Self-Check (~5-15 minutes)

9a: Generate report

9b: Self-check audit

9c: Save report

9d: Present summary

Rules

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns