Use this skill when a user wants to create, design, build, develop, or draft a Target Trial Emulation (TTE) protocol or study. Trigger for any request involving structuring a causal inference study as a TTE — whether the user starts from an existing RCT they want to emulate, a PICO question, or a vague research idea about comparing treatments using observational data. Covers the full protocol: eligibility, treatment strategies, assignment, follow-up, outcomes, causal contrasts, statistical methods, and confounder identification. Do NOT trigger for reviewing an existing protocol (use /tte-assess), interpreting TTE results, checking data feasibility (use /tte-data), fixing analysis code, or building dashboards.
Guide researchers through designing a Target Trial Emulation protocol using the 5-step framework from Ren et al. 2026 (JAMA Network Open, 237-study survey).
This skill is conversational and interactive. Ask one question at a time. Prefer multiple choice when possible. Be transparent about confidence levels and flag when you're making suggestions versus stating evidence-based guidance.
Ask the user:
What are you starting with?
Entry point determines which steps below are pre-filled vs. walked through:
Skip or abbreviate if Entry A or B provides a clear causal question.
Help the user formulate an explicit causal question:
The question must be answerable in principle by a randomized trial. If it isn't, help the user refine until it is. A poorly defined causal question leads to ambiguous design choices downstream.
Read ../tte-foundation/references/framework.md for the full Step 1 guidance.
Present the three scenario types and help the user classify their study. Read ../tte-foundation/references/scenarios.md for the full definitions and sub-scenarios.
| Scenario | When It Applies |
|---|---|
| 1: Replication/Prediction | A relevant RCT exists with matching PICO. Goal is to reproduce, validate, or predict findings. |
| 2: Extension | Related RCTs exist but differ in population, comparator, outcome, or follow-up. Goal is to go beyond existing evidence. |
| 3: RCTs Infeasible | No relevant RCT exists or can be conducted due to ethical/logistical constraints. |
Use the decision logic from ../tte-foundation/references/scenarios.md:
Recommend a classification based on context, and ask the user to confirm. Also identify the specific sub-scenario (e.g., 1d, 2a, 2c) — this affects downstream design choices.
Walk through each element one question at a time. Partially pre-fill if Entry A or B provides elements.
The PICO must be precise enough to operationalize with real-world data.
The approach depends on the scenario classified in Step 2.
For all scenarios: Search for relevant RCTs. If the ClinicalTrials.gov MCP tools are available, use search_trials with the PICO elements to find candidate trials. Present key details (NCT ID, title, enrollment, status, primary endpoints) and let the user select which to reference.
Scenario 1 (Replication/Prediction):
Scenario 2 (Extension):
Scenario 3 (No RCT Feasible):
Walk through each of the 7 core methodologic components. Read ../tte-foundation/references/components.md for definitions, quality standards, and common pitfalls for each component.
For each component, explain what the paper recommends, ask the user for their choices, and flag potential issues.
Recommend methods based on treatment strategy type:
| Strategy Type | Recommended Methods |
|---|---|
| Point treatment | Propensity score methods (matching, IPTW), standardization |
| Sustained treatment | G-methods, marginal structural models, time-dependent PS |
| Cannot distinguish at baseline | Clone-censor-weight approach |
Describe the approach to emulate randomization and specify balance diagnostics (e.g., SMD < 0.1).
Help the user choose and justify the estimand:
| Contrast | Best For | Considerations |
|---|---|---|
| ITT | Treatment policy effects, point treatments | Robust to non-adherence; may be diluted |
| Per-protocol | Sustained treatment effects | Requires IPTW/IPCW; vulnerable to selection bias if naive |
| As-treated | Descriptive analysis of received treatment | Susceptible to confounding by indication; use cautiously |
Align methods with the causal contrast:
Specify:
This is a critical part of protocol development. Follow the 5-layer workflow from ../tte-foundation/references/confounders.md:
Layer 1 — RCT-Informed (high confidence): Pull baseline covariates from RCTs identified in Step 4. These are evidence-based confounders. Present them to the user: "Here are confounders from related trials: [list]. These form the backbone of the adjustment set."
Layer 2 — LLM-Suggested (medium confidence, with guardrails): Based on disease area and intervention, suggest additional candidate confounders. Each suggestion must:
Present grouped by category (demographic, clinical history, medications, lab values, socioeconomic, lifestyle).
Layer 3 — DAG Prompting (user-driven): Ask structured questions to elicit domain knowledge:
Organize into: treatment-outcome confounders, treatment-selection confounders, outcome-related prognostic factors.
Layer 4 — Classification (conceptual):
For each confounder: classify as measured / proxy available / unmeasured. This is conceptual — actual data verification is /tte-data's job.
Layer 5 — Sensitivity Analysis Plan: Based on Layer 4 classifications:
Document the sensitivity analysis plan as part of the statistical analysis section.
Prompt the user to consider and document three causal identification assumptions:
For each, discuss plausibility in the context of this specific study.
Generate a complete TTE protocol document with two sections:
Full protocol following the narrative format from ../tte-foundation/references/protocol-template.md, covering all 7 components plus the confounder plan and causal assumptions.
Compact two-column table (Target Trial Specification | Emulation Approach) following the tabular format from ../tte-foundation/references/protocol-template.md.
Save the protocol to the project directory as tte-protocol.md.
End with:
Run
/tte-assessto verify the methodologic quality of this protocol. Run/tte-datawhen ready to check data feasibility for implementation.
The following files in ../tte-foundation/references/ contain detailed guidance:
framework.md — 5-step design frameworkscenarios.md — 13 sub-scenarios across 3 types, with decision logiccomponents.md — 7 core components with quality standards and pitfall ratesconfounders.md — 5-layer confounder identification workflowprotocol-template.md — Narrative and tabular protocol templatesRead scenarios.md during Step 2. Read components.md during Step 5. Read confounders.md during Step 5.8. Read protocol-template.md when generating output.