Name: Hdd
Author: Kastalien-Research

Search skills.../

Hdd | Skills Pool

Research --> Stage Docs --> Implement --> Validate --> Decide
   |    checkpoint    |   checkpoint   |          |  checkpoint  |  checkpoint
   v                  v                v          v              v
hypotheses     spec + ADR       code + tests  evidence    accept/reject

Check for existing HDD sessions:
```
bd list --label=hdd --type=epic --json
```
If an active session exists for this ADR number, warn and ask whether to resume or start fresh.

Create an epic and phase tasks in beads:

EPIC_ID=$(bd create --title="ADR-${adr_number}: ${title}" \
  --description="HDD session — hypotheses to be defined during research" \
  --type=epic --priority=2 --json | jq -r '.id')

Create phase tasks with dependencies (Phase 2 depends on Phase 1, etc.):
- Phase 1: Research and Hypothesis Formation
- Phase 2: Stage Spec + ADR
- Phase 3: Implementation
- Phase 4: Validation
- Phase 5: Decision (Accept/Reject)
If --phases=1-2, only create tasks for Phases 1 and 2.

Write state to .hdd/state.json:

{
  "workflow": "hdd",
  "version": "2.1",
  "adr_number": "<number>",
  "title": "<title>",
  "phase": "research",
  "phases_requested": [1, 2, 3, 4, 5],
  "epicId": "<bead-id>",
  "phaseIds": {
    "research": "<id>",
    "stage": "<id>",
    "implement": "<id>",
    "validate": "<id>",
    "decide": "<id>"
  },
  "status": "in_progress",
  "artifacts": [],
  "hypotheses": [],
  "staging_adr_path": null,
  "spec_path": null,
  "open_risks": [],
  "reconciliation_flags": [],
  "updated_at": "<ISO timestamp>"
}

Show the session dashboard and begin Phase 1.

Agent tool call:
  subagent_type: Explore
  prompt: |
    You are the HDD Research agent for ADR-${adr_number}: ${title}.

    ## Task
    Build context and form testable hypotheses for this architectural decision.

    ## What to read
    - Accepted ADRs: .adr/accepted/ (constraints and patterns)
      Also check docs/adr/ — legacy ADRs predate the .adr/ convention
    - Rejected ADRs: .adr/rejected/ (prior failure modes)
      Also check docs/adr/rejected/ — legacy location
    - Active staging: .adr/staging/ (in-flight work)
    - Specs: specs/ (implementation contracts)

    ## ADR Reconciliation
    For each accepted ADR related to this domain, apply these 7 signals:
    1. Context mismatch — ADR describes codebase state that no longer matches
    2. Broken references — files/modules mentioned have moved or been deleted
    3. Dependency drift — major version change in a dependency the ADR relied on
    4. Contradicting direction — this new work conflicts with the ADR's decision
    5. Unrealized consequences — ADR predicted outcomes that never materialized
    6. Rejected alternative resurfacing — a rejected approach is being proposed again
    7. Orphaned dependency chain — ADR depends on a retired/rejected ADR

    If any signal fires, include it in your output with specific evidence.

    ## Hypotheses
    Form SOFT hypotheses — Specific, Observable, Falsifiable, Testable.
    Each hypothesis must name an exact behavior, metric, or outcome.

    ## Return format
    ```
    RESEARCH SUMMARY
    ================
    Domain: <what area of the codebase this touches>

    EXISTING CONSTRAINTS
    - <constraint from accepted ADR-NNN>: <what it means for this work>

    PRIOR FAILURES
    - <lesson from rejected ADR-NNN>: <what to avoid>

    RECONCILIATION FLAGS
    - ADR-NNN: <signal name> — <specific evidence>
      Recommended disposition: STILL VALID | NEEDS AMENDMENT | SUPERSEDED | INVALIDATED
    (or: No reconciliation flags.)

    HYPOTHESES
    H1: <specific testable claim>
      Prediction: <exact observable outcome>
      Validation: <how to test>
    H2: ...

    UNKNOWNS
    - <anything that blocks staging>

    OPEN RISKS
    - <anything that could derail implementation>
    ```

Agent tool call:
  prompt: |
    You are the HDD Staging agent for ADR-${adr_number}: ${title}.

    ## Context
    Research summary (from Phase 1):
    <paste the full research output here>

    Approved hypotheses:
    <paste the user-approved hypotheses>

    ## Task
    Create two documents:

    ### 1. Spec (WHAT)
    Path: specs/${slug}.md

    The spec describes the implementation contract. It answers: what does the
    system look like after this work is done? Include:
    - Data structures / types being added or changed
    - API surface (new endpoints, tool operations, parameters)
    - Behavior descriptions (what happens when X)
    - Acceptance criteria (testable conditions for "done")

    Do NOT include reasoning about WHY — that belongs in the ADR.

    ### 2. Staging ADR (WHY)
    Path: .adr/staging/ADR-${adr_number}-${slug}.md

    Sections:
    - **Status**: Proposed
    - **Context**: Problem, current state, constraints from research
    - **Decision**: Chosen approach and why (reference alternatives considered)
    - **Consequences**: Positive outcomes, tradeoffs, follow-ups
    - **Hypotheses**: Each in SOFT format with validation plan (use format below)
    - **Spec**: Link to specs/${slug}.md
    - **Links**: Related ADRs, files, rejected approaches

    ### Hypothesis format in ADR
    ```markdown
    ### Hypothesis N: [Specific testable claim]
    **Prediction**: [Exact observable outcome]
    **Validation**: [Commands, observations, or measurements]
    **Outcome**: PENDING
    ```

    ### Dependency/conflict notes
    If any reconciliation flags were raised in Phase 1, note them in the ADR's
    Context section with planned dispositions.

    ## Return format
    ```
    STAGING SUMMARY
    ===============
    Spec path: specs/${slug}.md
    ADR path: .adr/staging/ADR-${adr_number}-${slug}.md
    Hypotheses staged: N
    Dependencies noted: [list or "none"]
    Conflicts noted: [list or "none"]
    ```

Agent tool call:
  prompt: |
    You are an HDD Implementation agent.

    ## Scope
    ADR: <paste staging ADR path and content>
    Spec: <paste spec path and content>

    ## Task
    Implement the minimum changes required to exercise the hypotheses in the ADR.

    Rules:
    1. Read the spec for WHAT to build. Read the ADR for WHY.
    2. Write tests tied to each hypothesis — tests validate the PREDICTION,
       not just that code exists.
    3. Run build and type checks after implementation.
    4. Track any deviation from spec with justification.
    5. Do NOT commit. Changes stay uncommitted until after validation.

    ## Return format
    ```
    IMPLEMENTATION SUMMARY
    ======================
    Files modified: [list with paths]
    Files created: [list with paths]
    Tests written: N
    Tests passing: N/N
    Build: pass | fail
    Type check: pass | fail

    HYPOTHESIS COVERAGE
    H1: covered by test <test name>
    H2: covered by test <test name>
    ...

    DEVIATIONS FROM SPEC
    - <deviation>: <justification>
    (or: None.)

    TEST TARGETS
    Command: <exact test command>
    Files: <test file paths>
    ```

Agent tool call:
  prompt: |
    You are an HDD Validation agent.

    ## Context
    ADR: <staging ADR path and content>
    Test targets: <from implementation summary>

    ## Task
    For each hypothesis in the ADR:
    1. Run the specific validation described in the hypothesis
    2. Capture concrete evidence (test output, command output, observations)
    3. Classify: VALIDATED | INVALIDATED | INCONCLUSIVE

    Also run full quality checks:
    - Build: npm run build (or equivalent)
    - Tests: npm test (or equivalent)
    - Linter: if configured
    - Type checker: tsc --noEmit (or equivalent)

    ## Return format
    ```
    VALIDATION REPORT
    =================
    Quality checks: build [pass|fail], tests [pass|fail], types [pass|fail]

    HYPOTHESIS RESULTS
    H1: "<claim>"
      Prediction: <what we expected>
      Evidence: <what actually happened — include command output>
      Classification: VALIDATED | INVALIDATED | INCONCLUSIVE
      Notes: <analysis of match/mismatch>

    H2: ...

    MANUAL VERIFICATION NEEDED
    - <hypothesis or aspect that requires human testing>
    (or: None — all hypotheses verified automatically.)

    OVERALL: ALL VALIDATED | SOME INVALIDATED | MIXED
    ```

Update ADR status to "Accepted"
Migrate ADR: mv .adr/staging/ADR-NNN-*.md .adr/accepted/
Execute any reconciliation dispositions recorded in Phase 1:
- NEEDS AMENDMENT: Edit the flagged accepted ADR in place
- SUPERSEDED: Move flagged ADR to .adr/retired/ with forward reference
- INVALIDATED: Move flagged ADR to .adr/retired/ with explanation
Move spec from staging if needed (spec should already be in specs/)

Create the commit (ADR + spec + implementation + tests in one atomic commit):

feat(<scope>): <description>

ADR-NNN accepted — all hypotheses validated.

Close all phase tasks and the epic in beads
Clean up .hdd/state.json

Migrate ADR: mv .adr/staging/ADR-NNN-*.md .adr/rejected/
Remove the staged spec: trash specs/${slug}.md (it describes something we didn't build)
Rollback implementation: git checkout -- <files>

Commit only the rejected ADR:

docs(adr): reject ADR-NNN — [hypothesis] invalidated

[Brief explanation of what was learned]

Close all phase tasks and the epic in beads
Clean up .hdd/state.json

HDD SESSION: ADR-<number> — <title>
Epic: <epicId> | Branch: <branch>
Phases: <1-5 or 1-2>

Phase                     Status          Artifact
-----                     ------          --------
1. Research               [status]        hypotheses: N, flags: N
2. Stage Docs             [status]        spec: specs/<slug>.md
                                          adr: .adr/staging/ADR-NNN-*.md
3. Implementation         [status]        files: N, tests: N
4. Validation             [status]        validated: N, invalidated: N
5. Decision               [status]        accepted | rejected | pending

Open risks: <count>
Reconciliation: <count> flags, <count> pending dispositions

.adr/
  staging/          # Work in progress — under validation
  accepted/         # Validated — source of truth
  rejected/         # Failed — documented lessons
  retired/          # Superseded — historical reference
.hdd/
  state.json        # Current session state (gitignored)
specs/              # Implementation specs (WHAT — paired with ADRs)

Hdd

Core Principle

Inputs

Phases

Hdd

Core Principle

Inputs

Phases

Integration with /workflow

Initialization

New session (no --resume)

Resume (--resume)

Phase 1: Research

Delegation

Reconciliation Dispositions

Phase 2: Stage Spec + ADR

Delegation

Phase 3: Implementation

Delegation

Phase 4: Validation

Delegation

Phase 5: Decision

Path A: All Hypotheses Validated

Path B: Hypotheses Invalidated

Path C: Mixed Results

Dashboard

Hypothesis Quality Gate

Operational Rules

Anti-Patterns

When to Use HDD

Directory Structure

Reference

Sessions

Docker Patterns

Autonomous Loops

Kotlin Patterns

Eval Harness

Golang Patterns