Purpose

Evaluates a completed first-principles thinking cycle across two phases: a structured self-evaluation that scores each artifact against its quality criteria, and a human evaluation that captures implementation outcomes and narrative feedback. Both phases produce artifacts in the project directory and append a structured entry to first-principals-thinking/evals/eval-log.json — a running JSON record that enables pattern analysis across projects over time.

This skill is what turns individual FPT cycles into a learning system.

Inputs

Required: A project name that matches an existing directory at first-principals-thinking/projects/{project-name}/
That directory should contain the six artifacts from /fpt-decompose and /fpt-reconstruct:
- 01-problem-statement.md through 06-decision-brief.md
The existing first-principals-thinking/evals/eval-log.json
Optionally: the user's stated assessment of how implementation went (for human-eval phase)

Purpose

This skill is what turns individual FPT cycles into a learning system.

Inputs

Required: A project name that matches an existing directory at first-principals-thinking/projects/{project-name}/
That directory should contain the six artifacts from /fpt-decompose and /fpt-reconstruct:
- 01-problem-statement.md through 06-decision-brief.md
The existing first-principals-thinking/evals/eval-log.json
Optionally: the user's stated assessment of how implementation went (for human-eval phase)

Fpt Evaluate

Purpose

Inputs

Fpt Evaluate

Purpose

Inputs

Steps

1. Load Artifacts and Determine Eval Phase

2. Phase 1 — Self-Evaluation

3. Phase 2 — Human Evaluation

4. Update the Eval Log

5. Pattern Analysis (If 3+ Entries Exist)

Output Format

Error Handling

Do Not

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags