An L2 Truth-Execution skill. It doesn't run tests — it decides what order regression-test-runner (or a human operator) should run them in, and it caps the plan to a time budget that matches the CI stage the user is in. Fast feedback is a product; exhausting the test suite every commit isn't.

The risk model is deterministic and documented in references/risk-model.md. The mode budgets are in references/mode-budgets.md. Both files are load-bearing — a weak priority is how "the tests I should have run" becomes "the bug I shipped".

When You're Invoked

PIPELINE-2 step 1 — before regression-test-runner on an incremental PR run. The plan produced here becomes the input --scope=incremental for the runner.
PIPELINE-5 step 1 — before the pre-release regression run. The plan is used as a scheduling hint so the highest-risk tests run earliest, and a failure trips the gate before the long tail.
On demand as /vibeflow:test-priority-engine [--mode <m>] [--since <sha>].
From regression-test-runner when it needs the ordering for its own Step 1 scope resolution.

When You're Invoked

PIPELINE-2 step 1 — before regression-test-runner on an incremental PR run. The plan produced here becomes the input --scope=incremental for the runner.
PIPELINE-5 step 1 — before the pre-release regression run. The plan is used as a scheduling hint so the highest-risk tests run earliest, and a failure trips the gate before the long tail.
On demand as /vibeflow:test-priority-engine [--mode <m>] [--since <sha>].
From regression-test-runner when it needs the ordering for its own Step 1 scope resolution.

Input	Required	Notes
Changed files list	yes	From `git diff --name-only <since>..HEAD` or an explicit file list. Empty list → "no risk signal"; the skill emits a full-suite plan sorted by priority-only fallback and WARNs.
`regression-baseline.json`	optional but preferred	Used for baseline fail counts + per-test duration + tags. Absent → cold-start fallback (see mode-budgets.md §4).
`scenario-set.md`	optional	Links tests to scenarios; scenarios carry `priority` which can override file-level priority.
`.vibeflow/artifacts/observability/flakiness.json`	optional	Latest `ob_track_flaky` output. When present, flake score feeds the risk model.
`codebase-intel` MCP	optional	`ci_dependency_graph` gives the transitive import graph so affected-set calculation isn't limited to directory proximity.
Mode	optional	One of `quick / smart / full`. Default derived from trigger (see §4).
Budgets override	optional	`--time-budget <seconds>` / `--count-budget <n>` — if present, always tightens the defaults, never loosens.

Trigger	Default mode
`pr` / `push`	`quick`
`release`	`smart`
`manual`	`smart`
PIPELINE-5 pre-release	`full` (regardless of trigger)

Test Priority Engine

When You're Invoked

Test Priority Engine

When You're Invoked

Input Contract

Algorithm

Step 1 — Resolve the mode + budgets

Step 2 — Derive the affected set

Step 3 — Score every candidate

Step 4 — Enforce the P0 mandatory set

Step 5 — Budget-fit the non-P0 tail

Step 6 — Write the plan

Output Contract

`priority-plan.md`

Machine-readable counterpart

Gate Contract

Non-Goals

Downstream Dependencies

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing