Name: Research Methodology
Author: tuanldas

搵技能.../

Research Methodology | Skills Pool

USER = Decision Maker.  Approves scope, reviews findings, makes final choices.
CLAUDE = Researcher.  Investigates, synthesizes, presents options with evidence.

NEVER present a single option as "the answer."
NEVER assume user wants what you'd recommend. Present trade-offs.
"I recommend X" is fine. "Here's X" without alternatives is not.

Context	Depth	Areas	Agents	Output
`/st:phase-research`	Deep	Dynamic areas from catalog	Parallel per wave + 1 synthesizer	Area files + SUMMARY.md
`/st:init`	Deep	Dynamic areas from catalog	Parallel per wave + 1 synthesizer	research/ directory
`/st:brainstorm`	Medium	2 rounds (broad → focused)	AI direct (no subagents)	Inline findings
`/st:plan` (optional)	Light	1 focused area	1 researcher	1-2 page inline

Phase 1: SCOPE DEFINITION
    ↓ (gate: research question is specific and bounded)
Phase 2: MULTI-SOURCE INVESTIGATION
    ↓ (gate: 3+ independent sources consulted per key claim)
Phase 3: EVIDENCE EVALUATION
    ↓ (gate: findings ranked by evidence quality)
Phase 4: SYNTHESIS & PRESENTATION

Category	Sources	Strength
Web	Official docs, tutorials, issue trackers, benchmarks, blog posts	Current, external
Codebase	Existing patterns, conventions, dependencies already in use	Proven in this project
Ecosystem	npm/pip/cargo stats, GitHub stars/issues, release cadence, community activity	Adoption signals

Strong evidence	Weak evidence
Official documentation (version-matched)	Blog post without benchmarks
Benchmarks with methodology	"In my experience..."
Existing codebase pattern (you read the code)	Training data memory ("I know that...")
Issue tracker with reproduction steps	Stack Overflow answer (may be outdated)
Package download stats + release dates	GitHub stars alone
Verified working example in codebase	"Should work" without testing
Changelog/release notes	Second-hand reports

Bias	Trap	Antidote
Confirmation	Searching for evidence that supports your initial preference	Search for evidence AGAINST your top recommendation first
Familiarity	Recommending tools/patterns you "know" from training data	Include at least one option you haven't previously recommended
Authority	Treating popular opinion as truth ("React is best for...")	Evaluate on project-specific criteria, not general reputation
Anchoring	First technology found becomes the default, others compared to it	Evaluate each option independently before comparing
Recency	Newest library/version assumed best	Check stability, community size, production readiness
Survivorship	Only looking at successful projects using X	Search for failure stories, migration-away-from posts

Read all outputs completely. Not just summaries or first paragraphs.
Identify agreements. Where all agents converge → high confidence.
Surface conflicts. Where agents disagree → present both sides with evidence.
Cross-validate. If Stack recommends X but Pitfalls warns about X → highlight the tension.
Rank recommendations by evidence strength, not by word count or confidence language.
Build Impact Analysis. Compare research findings against the original approach (from context inputs: PROJECT.md, ROADMAP.md, REQUIREMENTS.md). For each key aspect, categorize as KEEP (compatible), REPLACE (incompatible — state alternative), ADD (needed but not in original), REMOVE (in original but no longer needed). This gives users immediate visibility into what changed, what stayed, and why — not just what went wrong.
Produce SUMMARY.md with THREE clearly separated sections:
- Findings (tài liệu tham khảo): key findings, evidence, conflicts, unknowns — auto-saved, no user action needed.
- Impact Analysis: delta giữa original approach và research recommendations. Mỗi aspect được categorize KEEP/REPLACE/ADD/REMOVE với lý do evidence-based. Giúp user thấy ngay toàn cảnh thay đổi.
- Decisions Requiring Confirmation: mọi quyết định kiến trúc/tech mà research recommend (project structure, framework, ORM, hosting, payment, etc.) — PHẢI được present cho user chọn trước khi áp dụng. Mỗi decision phải có 2-3 options với trade-offs.
```
## Findings (Reference Material)
[Key findings, evidence, comparisons — informational only]

## Impact Analysis
| # | Aspect | Original Approach | Research Recommends | Change | Reason |
|---|--------|-------------------|---------------------|--------|--------|
| 1 | [aspect] | [from PROJECT/ROADMAP] | [finding] | KEEP/REPLACE/ADD/REMOVE | [why] |

## Decisions Requiring Confirmation
[Each decision that needs user choice before it can be applied to REQUIREMENTS.md or ROADMAP.md]

| # | Decision | Research Recommends | Alternatives | Status |
|---|----------|-------------------|-------------|--------|
| 1 | Project structure | Turborepo monorepo | Single app, Polyrepo | Pending user choice |
| 2 | Database | Supabase PostgreSQL | PlanetScale, Neon | Pending user choice |
```
The calling command (init, phase-research) is responsible for presenting these decisions to the user. SUMMARY.md only extracts and lists them.
Frame output as findings, not instructions. Follow research-boundaries rules for output framing, language choice (descriptive not prescriptive), and header templates. See core-principles/references/research-boundaries.md for full rules and examples.

Thought	What to do instead
"I already know the best approach"	Your training data is a guess. Verify against current sources.
"Research isn't needed for this"	The command already decided research is needed. Do it properly.
"Let me quickly mention a few options"	Quick = shallow. Follow the protocol. Investigate each option.
"X is the industry standard"	Says who? Cite the source. Industry standards change.
"Everyone uses X"	Popularity is not evidence of fitness. Check trade-offs for THIS project.
"Based on my knowledge..."	Your knowledge is training data. Find a current source.
"I'll research this later / in more detail"	Research happens NOW. This IS the research step.
"The user probably wants X"	Present options. User decides. Your job is to inform, not assume.
"This technology is better because it's newer"	Newer is not better. Compare on actual criteria.
"Let me skip this area, it's obvious"	Selected areas are not skippable. Even "obvious" domains have surprises.
"This finding is strong enough to be a MUST requirement"	Research findings are suggestions. See `core-principles/references/research-boundaries.md`.

Excuse	Reality
"Simple project doesn't need research"	The command determined research is needed. Depth varies, methodology doesn't.
"I'm an AI, I already know this domain"	You know what your training data said. The ecosystem may have changed.
"User is in a hurry"	Shallow research leads to wrong decisions. 20 min research saves days of rework.
"There's really only one good option"	Then research will confirm that quickly. If there's truly one option, proving it takes 5 minutes.
"Research is done, just need to write it up"	Writing IS research. Synthesis reveals gaps. If you can't write it clearly, you haven't understood it.
"The codebase already uses X, so we should keep using X"	Consistency has value, but verify X is still the right choice. Present the trade-off.
"I found a great article that covers everything"	One source is not research. Cross-reference with at least 2 more.
"The official docs say to do it this way"	Docs describe one way. Are there alternatives? What are the trade-offs?

IRON LAW:
  RESEARCH BEFORE DECISIONS. EVIDENCE BEFORE ASSUMPTIONS.
  Training data is not evidence. Cite sources or flag as assumption.

AREAS (dynamic from catalog):
  Select areas based on trigger + brownfield conditions.
  Core: STACK, LANDSCAPE, ARCHITECTURE, PITFALLS
  Domain: SECURITY, PERFORMANCE, ACCESSIBILITY, DATA, INTEGRATION
  Custom: max 2, requires user confirmation, max 8 total

WAVE ORDER (dynamic from dependencies):
  wave = max(wave[deps]) + 1
  Areas with no deps → Wave 1. Spawn all per wave in one message.

PROTOCOL:
  1. Scope (bound the question)
     → gate: specific research question
  2. Investigate (3+ sources, 2+ categories)
     → gate: 3+ independent data points per key claim
  3. Evaluate (rank evidence, surface conflicts)
     → gate: every recommendation cites strong evidence
  4. Synthesize (options, trade-offs, unknowns)

DEPTH:
  Deep (phase-research, init): dynamic areas from catalog, parallel agents, full output
  Medium (brainstorm): 2 rounds, inline
  Light (plan): 1 area, focused, 1-2 pages

SOURCES (min 2 of 3 categories):
  Web (docs, benchmarks, issues) | Codebase (patterns, deps) | Ecosystem (stats, releases)

NEVER:
  Single option without alternatives | Claims without sources |
  Skip areas at Deep depth | Trust training data alone |
  Resolve conflicts silently | Skip codebase scan

Mistake	Fix
Recommending a technology without checking alternatives	Always present 2-3 options with trade-offs. Even if one is clearly better, show why.
Citing training data as fact ("React 18 introduced...")	Verify via web search or docs. Training data may be wrong or outdated.
Skipping codebase scan	Existing patterns are the strongest evidence. The project already uses tools — check them first.
Research output is a knowledge dump, not actionable	Every finding must answer: "So what? What should the user DO with this?"
Resolving conflicts between sources silently	Surface conflicts explicitly. User decides. Research informs, doesn't decide.
All recommendations are the same technology/approach	Check for familiarity bias. Force yourself to evaluate at least one unfamiliar option.
Research is broad but shallow (mentions many things, investigates none)	Better to deeply investigate 3 options than shallowly mention 10.
Pitfalls section is generic ("watch out for performance")	Pitfalls must be specific to THIS stack, THIS architecture, THIS domain.
Landscape section only lists competitors without analysis	Compare on criteria relevant to the project, not just list names.
SUMMARY.md has no conflicts section	There are ALWAYS trade-offs. No conflicts = missed something.
Research uses MUST/SHOULD as if setting requirements	See `core-principles/references/research-boundaries.md` for output language rules

# Research Plan

Created: [date]
Context: [research_context]
Status: planning

## Selected Areas

| Area | Focus | Wave | Status |
|------|-------|------|--------|
| STACK | [focus] | 1 | pending |
| ARCHITECTURE | [focus] | 2 | pending |
...

## Wave Structure

Wave 1: [areas] → Wave 2: [areas] → ...

## Decisions
- [area] included because: [reason]
- [area] skipped because: [reason]

Present research plan to user:

RESEARCH PLAN — [research_context]

Wave [N] (parallel, [M] agents):
  ├─ [AREA]: [focus description]
  └─ [AREA]: [focus description]
...
Total: [X] agents, [Y] waves
Saved: [output_dir]/RESEARCH-PLAN.md
Adjust areas or proceed?

If config.research_auto_approve is true: display plan and proceed immediately (EXCEPT: if custom areas proposed, always pause for confirmation)
If false (default): wait for user to approve, adjust areas, or skip research
On approval: update RESEARCH-PLAN.md status to in-progress

RESEARCH SUMMARY — [research_context]
Areas researched: [list of areas]

Key findings (reference material — auto-saved):
  1. [finding]
  2. [finding]

Decisions requiring confirmation (NOT yet applied):
  1. [decision] — [recommended option] vs [alternatives]
  2. [decision] — [recommended option] vs [alternatives]

Conflicts found:
  [if any: describe + suggest resolution]

File	When to Load	Trigger
`SKILL.md`	Always	Skill invocation (via init, phase-research, brainstorm, or plan)
`references/research-catalog.md`	When planning research	Area selection: triggers, dependencies, brownfield, guardrails
`references/research-areas.md`	On demand	Deep research execution guidance: search strategies, templates, scope

IRON LAW:
  RESEARCH BEFORE DECISIONS. EVIDENCE BEFORE ASSUMPTIONS.
  Training data is not evidence. Cite sources or flag as assumption.

AREAS (dynamic from catalog):
  Select areas based on trigger + brownfield conditions.
  Core: STACK, LANDSCAPE, ARCHITECTURE, PITFALLS
  Domain: SECURITY, PERFORMANCE, ACCESSIBILITY, DATA, INTEGRATION
  Custom: max 2, requires user confirmation, max 8 total

WAVE ORDER (dynamic from dependencies):
  wave = max(wave[deps]) + 1
  Areas with no deps → Wave 1. Spawn all per wave in one message.

PROTOCOL:
  1. Scope (bound the question)
     → gate: specific research question
  2. Investigate (3+ sources, 2+ categories)
     → gate: 3+ independent data points per key claim
  3. Evaluate (rank evidence, surface conflicts)
     → gate: every recommendation cites strong evidence
  4. Synthesize (options, trade-offs, unknowns)

DEPTH:
  Deep (phase-research, init): dynamic areas from catalog, parallel agents, full output
  Medium (brainstorm): 2 rounds, inline
  Light (plan): 1 area, focused, 1-2 pages

SOURCES (min 2 of 3 categories):
  Web (docs, benchmarks, issues) | Codebase (patterns, deps) | Ecosystem (stats, releases)

NEVER:
  Single option without alternatives | Claims without sources |
  Skip areas at Deep depth | Trust training data alone |
  Resolve conflicts silently | Skip codebase scan

Mistake	Fix
Recommending a technology without checking alternatives	Always present 2-3 options with trade-offs. Even if one is clearly better, show why.
Citing training data as fact ("React 18 introduced...")	Verify via web search or docs. Training data may be wrong or outdated.
Skipping codebase scan	Existing patterns are the strongest evidence. The project already uses tools — check them first.
Research output is a knowledge dump, not actionable	Every finding must answer: "So what? What should the user DO with this?"
Resolving conflicts between sources silently	Surface conflicts explicitly. User decides. Research informs, doesn't decide.
All recommendations are the same technology/approach	Check for familiarity bias. Force yourself to evaluate at least one unfamiliar option.
Research is broad but shallow (mentions many things, investigates none)	Better to deeply investigate 3 options than shallowly mention 10.
Pitfalls section is generic ("watch out for performance")	Pitfalls must be specific to THIS stack, THIS architecture, THIS domain.
Landscape section only lists competitors without analysis	Compare on criteria relevant to the project, not just list names.
SUMMARY.md has no conflicts section	There are ALWAYS trade-offs. No conflicts = missed something.
Research uses MUST/SHOULD as if setting requirements	See `core-principles/references/research-boundaries.md` for output language rules

# Research Plan

Created: [date]
Context: [research_context]
Status: planning

## Selected Areas

| Area | Focus | Wave | Status |
|------|-------|------|--------|
| STACK | [focus] | 1 | pending |
| ARCHITECTURE | [focus] | 2 | pending |
...

## Wave Structure

Wave 1: [areas] → Wave 2: [areas] → ...

## Decisions
- [area] included because: [reason]
- [area] skipped because: [reason]

Present research plan to user:

RESEARCH PLAN — [research_context]

Wave [N] (parallel, [M] agents):
  ├─ [AREA]: [focus description]
  └─ [AREA]: [focus description]
...
Total: [X] agents, [Y] waves
Saved: [output_dir]/RESEARCH-PLAN.md
Adjust areas or proceed?

If config.research_auto_approve is true: display plan and proceed immediately (EXCEPT: if custom areas proposed, always pause for confirmation)
If false (default): wait for user to approve, adjust areas, or skip research
On approval: update RESEARCH-PLAN.md status to in-progress

RESEARCH SUMMARY — [research_context]
Areas researched: [list of areas]

Key findings (reference material — auto-saved):
  1. [finding]
  2. [finding]

Decisions requiring confirmation (NOT yet applied):
  1. [decision] — [recommended option] vs [alternatives]
  2. [decision] — [recommended option] vs [alternatives]

Conflicts found:
  [if any: describe + suggest resolution]

File	When to Load	Trigger
`SKILL.md`	Always	Skill invocation (via init, phase-research, brainstorm, or plan)
`references/research-catalog.md`	When planning research	Area selection: triggers, dependencies, brownfield, guardrails
`references/research-areas.md`	On demand	Deep research execution guidance: search strategies, templates, scope

Research Methodology

Overview

Core Principle

Research Methodology

Overview

Core Principle

Roles

Research Depth Calibration

Research Areas

Research Protocol

Phase 1: Scope Definition

Phase 2: Multi-Source Investigation

Phase 3: Evidence Evaluation

Phase 4: Synthesis & Presentation

Evidence Standards

Cognitive Biases in Research

Synthesis Protocol

Anti-Shortcut System

Red Flags — STOP

Common Rationalizations

Quick Reference

Common Mistakes

Research Orchestration

Step 1 — Select research areas

Step 2 — Build research plan and persist

Step 3 — Execute waves

Step 4 — Synthesize

Step 5 — Present findings

Step 6 — Save and commit

Resuming interrupted research

Context Budget

Integration

Quick Reference

Common Mistakes

Research Orchestration

Step 1 — Select research areas

Step 2 — Build research plan and persist

Step 3 — Execute waves

Step 4 — Synthesize

Step 5 — Present findings

Step 6 — Save and commit

Resuming interrupted research

Context Budget

Integration

Automation Audit Ops

Github Qa Labels

Jupyter Notebook

Tidb Integrationtest Recorder

Quality Nonconformance

Hugging Face Trackio