Tenet Skill — Autonomous Development Loop Brain

Execute this file as an operational program. Be decisive, deterministic, and checkpoint-driven.

Core invariants (never violate)

Fresh session per job is default defense against compounding errors.
Context is always compiled per job (tenet_compile_context), never raw file dumps.
Generation and validation are separated (author flow vs critic flow).
Eval is a HARD BLOCKING GATE. A job that fails eval MUST be retried or blocked. You MUST NOT proceed to the next job saying "the next job will fix it" or "this is not blocking." If eval fails, the job is not done. Period.
Harness enforcement is mandatory in all modes.
Persistent human-readable state lives in .tenet/ markdown files.
Operational runtime state is MCP server SQLite; do not manually manage runtime IDs.
Use server-side continuation (tenet_continue()), not ad-hoc ID reconstruction.
Keep wrong turns in active job context (within that session) to prevent repetition.

Tenet Skill — Autonomous Development Loop Brain

Execute this file as an operational program. Be decisive, deterministic, and checkpoint-driven.

Core invariants (never violate)

Fresh session per job is default defense against compounding errors.
Context is always compiled per job (tenet_compile_context), never raw file dumps.
Generation and validation are separated (author flow vs critic flow).
Eval is a HARD BLOCKING GATE. A job that fails eval MUST be retried or blocked. You MUST NOT proceed to the next job saying "the next job will fix it" or "this is not blocking." If eval fails, the job is not done. Period.
Harness enforcement is mandatory in all modes.
Persistent human-readable state lives in .tenet/ markdown files.
Operational runtime state is MCP server SQLite; do not manually manage runtime IDs.
Use server-side continuation (tenet_continue()), not ad-hoc ID reconstruction.
Keep wrong turns in active job context (within that session) to prevent repetition.

# jobs_completed_since_last_health = 0 while True: # 1. Steering checkpoint steer = tenet_process_steer() IF steer.has_emergency: HALT — cancel active jobs, process emergency, wait for user IF steer.has_directive: apply directive (reorder queue, add/remove jobs, update spec) # 2. Get next job from server-managed DAG continuation = tenet_continue() IF continuation.all_done: BREAK — run complete IF continuation.all_blocked: BREAK — report blocked jobs, wait for user steer job = continuation.next_job # 3. Compile bootstrap context for this job compiled_context = tenet_compile_context(job_id=job.id) # 4. Dispatch registered job for execution run = tenet_start_job(job_id=job.id) # 5. Brief user and start background status check TELL USER: "Dispatched: {job.name}. I'll monitor in the background." TELL USER: "You can send messages or steer directives while this runs." check = BACKGROUND tenet_job_wait(job_id=run.job_id) # 6. When background check returns (instant — no blocking): # - If is_terminal=false: check steer, brief user, wait, then re-check # - If is_terminal=true: proceed to result collection # Wait strategy: start at 30s, increase by 1.5x each cycle, cap at 120s poll_delay = 30 WHILE check result is not terminal: result = COLLECT check tenet_process_steer() TELL USER: "{job.name}: {result.progress_line}" SLEEP poll_delay seconds poll_delay = min(poll_delay * 1.5, 120) check = BACKGROUND tenet_job_wait(job_id=run.job_id, cursor=result.cursor) # 7. Retrieve full output output = tenet_job_result(job_id=run.job_id) # 8. Dispatch evaluation (code critic + test critic + Playwright eval) eval = tenet_start_eval(job_id=job.id, output=output) # This dispatches THREE jobs: code_critic, test_critic, playwright_eval code_check = BACKGROUND tenet_job_wait(job_id=eval.code_critic_job_id) test_check = BACKGROUND tenet_job_wait(job_id=eval.test_critic_job_id) playwright_check = BACKGROUND tenet_job_wait(job_id=eval.playwright_eval_job_id) # Wait for all three eval jobs eval_delay = 30 WHILE any of (code_check, test_check, playwright_check) not terminal: SLEEP eval_delay seconds eval_delay = min(eval_delay * 1.5, 120) IF code_check not terminal: code_check = BACKGROUND tenet_job_wait(job_id=eval.code_critic_job_id) IF test_check not terminal: test_check = BACKGROUND tenet_job_wait(job_id=eval.test_critic_job_id) IF playwright_check not terminal: playwright_check = BACKGROUND tenet_job_wait(job_id=eval.playwright_eval_job_id) code_output = tenet_job_result(job_id=eval.code_critic_job_id) test_output = tenet_job_result(job_id=eval.test_critic_job_id) playwright_output = tenet_job_result(job_id=eval.playwright_eval_job_id) # 9. Act on eval results — ALL THREE must pass # ⛔ EVAL IS A HARD BLOCKING GATE — DO NOT PROCEED TO THE NEXT JOB IF EVAL FAILS # "The next job will fix it" is NEVER acceptable. Retry THIS job until it passes. IF code_output.passed AND test_output.passed AND playwright_output.passed: tenet_update_knowledge(type="journal", job_id=job.id, findings=output.findings) ELIF NOT playwright_output.passed: # Playwright e2e failed — actual app behavior is broken # Create fix job with the screenshots and findings as evidence create_fix_job(job, playwright_output.exploratory_findings) # DO NOT continue to next job — wait for fix job to complete, then re-eval ELIF NOT test_output.passed: # Test critic failed — tests are insufficient, create fix job to strengthen tests create_test_fix_job(job, test_output.missing_tests) # DO NOT continue to next job — wait for fix job to complete, then re-eval ELSE: # Code critic failed — retry the job (preferred) or create new job if approach is wrong tenet_retry_job(job_id=job.id) # preferred over creating new job # DO NOT continue to next job — wait for retry to complete, then re-eval # 11. Post-job steering checkpoint tenet_process_steer() # 12. Periodic health audit (every 3 completed jobs) jobs_completed_since_last_health += 1 IF jobs_completed_since_last_health >= 3: tenet_health_check() jobs_completed_since_last_health = 0

Tag	Meaning
`[implemented-and-tested]`	Code exists and passes tests
`[implemented-not-tested]`	Code exists but tests are missing or incomplete
`[decision-only]`	Agreed approach, not yet coded
`[scanned-not-verified]`	Extracted from existing code during brownfield scan, not validated

Tenet

Tenet Skill — Autonomous Development Loop Brain

Core invariants (never violate)

Tenet

Tenet Skill — Autonomous Development Loop Brain

Core invariants (never violate)

Boot sequence (must run on skill load)

Scale-adaptive mode selection

Signals

Mode rules

Full mode (default for significant work)

Standard mode

Quick mode

Full-mode crystallization phase

A) Interview protocol (includes clarity gate)

B) Visual artifact generation

C) Scenario + anti-scenario criteria

D) Pre-spec research (mandatory)

E) Spec + Harness generation

E.5) Implementation readiness gate (hard block before decomposition)

F) DAG decomposition

Standard-mode prep

Quick-mode prep

YOLO mode (upfront phases only)

Pre-execution confirmation gate

Core autonomous loop (all modes)

Bootstrap compiler contract

Evaluation pipeline (6 stages)

Stage 1 — Mechanical

Stage 1.5 — Smoke Check (mandatory for dev jobs)

Stage 2 — Property-based

Stage 3 — Code critic (independent context)

Stage 4 — Test critic (independent context)

Stage 5 — Playwright E2E (independent context, two layers)

Knowledge vs Journal

Knowledge (.tenet/knowledge/)

Journal (.tenet/journal/)

Cascade checks (three types)

Type 1 — Document-to-document alignment

Type 2 — Code-to-document alignment

Type 3 — Trajectory-to-purpose alignment (drift detection)

Eval failure handling

Stagnation detection and persona rotation

Async user steering protocol

Creating steer messages

Processing steer messages

Lifecycle tracking

Source priority

Job-targeted steer

MCP server recovery

Safety and resilience gates

State management contract

Agent routing and runtime adjustments

Health and status cadence

Termination conditions

Prose

Coding Agent (bash-first)

Create Prompt

Strategic Compact

Strategic Compact

Strategic Compact

Knowledge (`.tenet/knowledge/`)

Journal (`.tenet/journal/`)