Testing Guidelines

Fail fast: Write test first, expect it to fail.
Fix bug: reproduce the bug first, then fix the root cause.

Core Principle

Tests are living documentation. They describe what the product does, not how the code is wired internally. A test that breaks on a refactor (while behavior is unchanged) is a bad test -- it was coupled to implementation, not behavior.

Behavior Over Implementation

Test observable behavior from the user/caller's perspective. Do not test internal mechanics.

Ask before writing any test:

Does this test describe a product behavior, or does it just mirror the code structure?
If I refactor the internals without changing what the user sees, does this test survive?
Can a human read the test name and understand what requirement it verifies?

Test Type	Example	Verdict
Behavior	"rejected tool call shows error in red"	Good -- tests what user sees
Behavior	"CJK character at buffer edge stays in bounds"	Good -- tests an invariant

Testing Guidelines

Fail fast: Write test first, expect it to fail.
Fix bug: reproduce the bug first, then fix the root cause.

Core Principle

Behavior Over Implementation

Test observable behavior from the user/caller's perspective. Do not test internal mechanics.

Ask before writing any test:

Does this test describe a product behavior, or does it just mirror the code structure?
If I refactor the internals without changing what the user sees, does this test survive?
Can a human read the test name and understand what requirement it verifies?

Test Type	Example	Verdict
Behavior	"rejected tool call shows error in red"	Good -- tests what user sees
Behavior	"CJK character at buffer edge stays in bounds"	Good -- tests an invariant

Tdd

Testing Guidelines

Core Principle

Behavior Over Implementation

Tdd

Testing Guidelines

Core Principle

Behavior Over Implementation

When Tests Fail - Analysis Framework

Modification Guidelines

Pre-Modification Checklist

Red Flags - Stop and Reconsider

Agent Workflow Verification

Github

Openclaw Parallels Smoke

Feature Flags

Test

Azure Pipelines

Update Screenshots