Structured interview and project-spec workflow for new research ideas.
40:T16d2,
Formalize a research idea into a concrete project specification with testable hypotheses and empirical strategies.
Input: $ARGUMENTS — a topic, phenomenon, dataset, or "start fresh" for open-ended exploration.
This skill runs in three phases. Phase 1 is conversational — ask one or two questions at a time and wait for responses. Phases 2 and 3 run automatically after the interview.
Goal: Draw out the researcher's thinking and establish a clear research question.
Ask questions one or two at a time. Build on each answer before moving to the next phase. Use conversational prompts, not a separate question tool. A good interview runs 4–6 exchanges.
The Puzzle (start here):
Why It Matters:
Theoretical Motivation:
Data and Setting:
Identification:
Expected Results + Contribution:
Move to Phase 2 when you have:
If after 3 exchanges the user keeps giving vague answers, move to Phase 2 anyway and flag the open questions.
Goal: Generate 3–5 structured research questions covering the full range from descriptive to causal.
Announce the transition: "Great — I have enough to generate a structured set of research questions. Let me build that out now."
Then generate 3–5 research questions ordered by type:
| Type | What It Asks |
|---|---|
| Descriptive | What are the patterns? How has X evolved? |
| Correlational | What factors are associated with X, controlling for Z? |
| Causal | What is the causal effect of X on Y? |
| Mechanism | Through what channel does X affect Y? |
| Policy | Would intervention X improve outcome Y? |
For each RQ, develop:
Rank the questions by feasibility × contribution:
| RQ | Feasibility | Contribution | Priority |
|---|---|---|---|
| 1 | High | High | ★★★ |
| 2 | High | Medium | ★★ |
| ... | ... | ... | ... |
Produce the unified project spec document and save it.
Save to: quality_reports/project_spec_[sanitized_topic].md
# Research Project: [Working Title]
**Date:** [YYYY-MM-DD]
**Researcher:** [from PROJECT_CONTEXT.md if available]
---
## Research Question
[Single clear sentence]
## Motivation
[2–3 paragraphs: why this matters, theoretical context, policy relevance, what the answer would change]
## Research Questions
### RQ1: [Question] — Priority: ★★★ (Feasibility: High / Contribution: High)
**Type:** Causal
**Hypothesis:** [Testable prediction with expected sign]
**Identification Strategy:**
- **Method:** [e.g., Staggered DiD with Sun–Abraham estimator]
- **Treatment:** [What varies and when]
- **Control group:** [Comparison units]
- **Key assumption:** [e.g., Parallel pre-trends conditional on controls]
- **Robustness:** [Pre-trends test, placebo outcomes, alternative control groups]
**Data Requirements:**
- [Dataset or data type needed]
- [Key variables: treatment proxy, outcome, controls]
- [Time period and geography]
**Key Pitfalls:**
1. [Threat + mitigation]
2. [Threat + mitigation]
**Related Work:** [Author (Year)], [Author (Year)]
---
[Repeat for RQ2–RQ5]
---
## Priority Empirical Strategy
[1 paragraph recommending the single highest-priority RQ and why, with the specific identification approach]
## Open Questions
[Issues raised in the interview that need further thought before committing to a strategy]
---
## Suggested Next Steps
1. **`lit-review [topic]`** — Search the literature for related work and citation chains
2. **`data-finder [topic]`** — Find and assess datasets for the priority RQ
3. Once data is secured: **`data-analysis`** to begin analysis
Tell the user:
quality_reports/project_spec_[topic].mdlit-review [topic] to build the literature foundationdata-finder [topic] to identify and assess data sources