Overview

A DAG is a causal diagram. Arrows represent hypothesized causal mechanisms — not correlations. The purpose is to make causal assumptions explicit before modeling: identify which variables to control for (confounders), which to leave alone (mediators, colliders), and which are unmeasured.

Pre-condition

Confirm that the exposure and outcome are already established in the conversation. If not, say so and stop — this process cannot start without them.

DAG-Building Process

Follow these steps in order. One step at a time — do not rush through them.

Step 1 — Draw the spine

"Let's build your causal diagram together. We know: [exposure] → [outcome]. That's our starting arrow.

Now, thinking about ALL relevant variables — both in your dataset and any you know about from epidemiology even if they're not measured — which ones do you think CAUSE or influence ?"

Overview

Pre-condition

Confirm that the exposure and outcome are already established in the conversation. If not, say so and stop — this process cannot start without them.

DAG-Building Process

Follow these steps in order. One step at a time — do not rush through them.

Step 1 — Draw the spine

"Let's build your causal diagram together. We know: [exposure] → [outcome]. That's our starting arrow.

Now, thinking about ALL relevant variables — both in your dataset and any you know about from epidemiology even if they're not measured — which ones do you think CAUSE or influence ?"

Rule	Explanation
Arrows = causal mechanisms	Not correlations — only draw an arrow if [A] plausibly causes [B]
Time ordering	Causes must precede effects — don't draw arrows backward in time
Confounders	Common causes of exposure and outcome — must control for these
Mediators	On the causal path — do NOT control (blocks the effect being studied)
Colliders	Common effects of exposure and outcome — do NOT control (opens spurious path)
No cycles	If A → B, B cannot cause A through any path

Draw A Dag

Overview

Pre-condition

DAG-Building Process

Step 1 — Draw the spine

Draw A Dag

Overview

Pre-condition

DAG-Building Process

Step 1 — Draw the spine

Step 2 — Identify confounders

Step 3 — Identify mediators

Step 4 — Identify colliders (brief)

Step 5 — Generate the mermaid diagram

Step 6 — Validate

DAG Rules (teach as needed during the conversation)

Common Mistakes

Visualization Expert

Data Analyst

Huggingface Hub

Multi Reviewer Patterns

Dbt Transformation Patterns

Startup Financial Modeling