Core Rule

Every factual claim in a report, plan, or diagnostic document MUST have a corresponding evidence source collected BEFORE the claim is written.

Violation of this rule produces garbage reports that waste the user's time and destroy trust.

What Counts as Evidence

Claim Type	Required Evidence	NOT Acceptable
Runtime state (process running, port open, env var value)	`ps aux`, `docker inspect`, `env`, `curl` output	Reading source code and guessing what runs
Database content (row exists, column value)	`psql` / API query result	Inferring from code that writes to the table
Configuration (which file sets a value)	`grep` the actual file + confirm via `docker inspect` or process cmdline	Seeing one config file and assuming it's the only one

# ALWAYS run on the HOST first
ps aux | grep <process_name>
# If found on host: investigate the host environment
# If NOT found on host: THEN check containers

FOR a task execution investigation:
  1. ps aux | grep <process> on HOST — find where execution starts
  2. Read the process's env vars (ps eww -p <PID>)
  3. Read the code that spawns the next subprocess — find the command + env
  4. For each subprocess in the chain, verify: what binary, what cwd, what env, what config files
  5. Only investigate network/connectivity AFTER you know which machine the process runs on

FOR EACH factual claim you are about to write:
  1. STOP writing
  2. Run the verification command (DB query, grep, ps, curl, docker inspect, etc.)
  3. Read the output
  4. Write the claim WITH the evidence inline or as a citation
  5. If the evidence contradicts your expectation, UPDATE your understanding — do NOT ignore it

1. LOCATE the process: ps aux | grep <name> on HOST first, then containers
2. READ its env: ps eww -p <PID> | grep <VAR> (or /proc/<PID>/environ on Linux)
3. READ its config files: find the settings file the process actually reads
4. TEST connectivity FROM the correct machine: curl from where the process runs
5. ONLY THEN form a hypothesis and verify it

> **Evidence**: `<command that was run>`
> ```
> <actual output pasted here>
> ```

> **Evidence**: [filename.py:L120-L129](file:///path/to/file#L120-L129)
> ```python
> def relevant_function(...):
>     ...
> ```

FOR a fix that changes tool/service availability:
  1. UNIT: The function you changed now returns the expected output
     → Run the function directly in the same import context as production
  2. API: The API endpoint returns the corrected response
     → curl/POST the endpoint and confirm the change is reflected
  3. CONSUMER: The consumer of the API (e.g., MCP gateway, CLI) receives the corrected data
     → Check the consumer's tool list or config
  4. END-TO-END: The original symptom is resolved
     → Trigger the same user-facing action that was failing

Fixed Layer	Often Missed
Python function	API serves from a different worker with stale imports
Backend API	MCP gateway caches tool list from previous startup
Config file	Process needs restart to pick up new config
Docker image	Container uses mounted volume that overrides the image

Evidence Based Reporting | Skills Pool

Evidence Based Reporting

Evidence Based Reporting

Core Rule

What Counts as Evidence

Prohibited Patterns

1. Inference-as-Fact

2. Single-Source Assumption

3. Negation Without Verification

4. IP-as-Identity

5. Wrong Execution Host (Container vs Host)

6. Data Misinterpretation Without Code Verification

7. Premature Root Cause Declaration

8. Incomplete Execution Path Trace

9. Unchecked Line Number Citation

10. Design Claim Without Source Verification

11. Schema-as-Data

12. Narrow Grep Scope for Negation

13. Runtime Quantity from Code Inference

Mandatory Workflow

Execution Path Investigation Order

Evidence Citation Format

Pre-Commit Checklist for Reports

Fix Verification Checklist

Common Verification Gaps

Why This Exists

Incident 1 (2026-02-17)

Incident 2 (2026-02-22)

Incident 3 (2026-03-10)

Hr Pro

Mental Health Analyzer

Satori

Claude Ally Health

Wellally Tech

Tcm Constitution Analyzer