Sequence

define agent boundaries → design guardrails → configure sandboxing → build verification pipeline → implement feedback loops → add recovery mechanisms → set up session continuity → validate end-to-end

Layers

context management     → what the agent sees (repo-local, versioned artifacts only)
permission boundary    → what the agent can access (files, tools, commands)
execution sandbox      → where the agent runs (isolated, rollback-capable)
validation pipeline    → how outputs are verified (deterministic + semantic)
architectural guard    → structural rules the agent must not violate
feedback loop          → how failures inform next attempts
recovery mechanism     → how the system handles and corrects errors
session continuity     → how state bridges across context resets

Architectural Constraints

<constraints> - enforce constraints mechanically — never rely solely on prompt instructions for safety - every agent action must be auditable — log inputs, outputs, tool calls, and decisions - sandbox must support rollback — no irreversible side effects without explicit approval - permission boundaries must follow least-privilege — grant minimum access required per task - context is a scarce resource — provide maps, not manuals; pointers, not payloads - architectural boundaries must be enforced by structural tests, not by convention - verification must combine deterministic and semantic checks — neither alone is sufficient - feedback from failures must be converted into reusable constraints — not one-off fixes - recovery must be automatic for known failure classes, escalated for unknown ones - agent-generated artifacts must pass the same quality gates as human-generated ones - periodic maintenance agents must detect drift in docs, architecture, and constraints - long-running tasks must persist progress in structured artifacts across sessions - detect doom loops via trace analysis — intervene when repeated attempts show no progress - route actions through risk tiers — high-risk actions require human approval - prefer single-agent topology — escalate to multi-agent only when single agent is insufficient - harness must survive model changes — no coupling to specific model capabilities - evaluator must be adversarial — optimistic self-review is insufficient for quality </constraints> <criteria> permission boundaries enforced + execution sandboxed with rollback + verification pipeline active (deterministic + semantic + generator-evaluator) + architectural constraints mechanically enforced + feedback loops convert failures to constraints + all agent outputs pass same quality gates as human outputs + audit trail present for all agent actions + session continuity via progress artifacts + doom loop detection active + risk-based routing configured + harness components documented </criteria>

Layers

context management → what the agent sees (repo-local, versioned artifacts only) permission boundary → what the agent can access (files, tools, commands) execution sandbox → where the agent runs (isolated, rollback-capable) validation pipeline → how outputs are verified (deterministic + semantic) architectural guard → structural rules the agent must not violate feedback loop → how failures inform next attempts recovery mechanism → how the system handles and corrects errors session continuity → how state bridges across context resets

Harness Engineering

Sequence

Layers

Architectural Constraints

Harness Engineering

Sequence

Layers

Architectural Constraints

Context Strategy

Verification Tiers

Risk-Based Routing

Session Continuity

Agent Topology

Trace-Driven Improvement

Harness Evolution

Rules

Anti-Patterns

Done

Sessions

Docker Patterns

Autonomous Loops

Kotlin Patterns

Eval Harness

Golang Patterns