Name: Dbt Prod Ci Regression
Author: duneanalytics

dbt prod vs CI regression queries

Intent

Pipeline / logic change only: compiled SQL or merge behavior changed; prod and CI should match on a chosen time/block window.
Manual workflow: user or agent runs this when needed; do not assume every model edit triggers regression.
Adapt per lineage: reuse structure (explore → align filters → count check → join → diff filter); swap tables, grain, metrics, and filters from the models under test.

From user: commit SHA or direct test_schema.git_dunesql_<hash>_<schema>_<alias>.
From git (Spellbook): Sub-project workflows use on: pull_request (e.g. ). For those runs, in is the GitHub builds for — the PR branch head shown in the PR commits list. So the CI hash (e.g. ). then . copy the hash from (workflow run / logs). Workflows triggered by (if any) would use the pushed commit—then matches.

Pipeline / logic change only: compiled SQL or merge behavior changed; prod and CI should match on a chosen time/block window.
Manual workflow: user or agent runs this when needed; do not assume every model edit triggers regression.
Adapt per lineage: reuse structure (explore → align filters → count check → join → diff filter); swap tables, grain, metrics, and filters from the models under test.

From user: commit SHA or direct test_schema.git_dunesql_<hash>_<schema>_<alias>.
From git (Spellbook): Sub-project workflows use on: pull_request (e.g. ). For those runs, in is the GitHub builds for — the PR branch head shown in the PR commits list. So the CI hash (e.g. ). then . copy the hash from (workflow run / logs). Workflows triggered by (if any) would use the pushed commit—then matches.

Output	Meaning
CI row count	`count(1)` on CI with full shared `where`.
Prod row count	Same on prod.
Row-count diff	`prod - ci` (expect 0 before grain join).
CI raw metric	e.g. `sum(amount_usd)` on CI with the same `where` (spell-dependent column).
Prod raw metric	Same on prod.
Raw metric abs diff	e.g. `abs(prod_usd - ci_usd)` (use tolerance for floats).
Inner-join grain count	`count(1)` from `prod_agg inner join ci_agg` without diff filter.
Inner-join metric sums	`sum(prod grain total_usd)` vs `sum(ci grain total_usd)` over that join (should match raw totals when every row maps to one grain and keys align).
Diff grain count	`count(1)` from the join with diff filter on rows / USD. 0 ⇒ pass.