Name: Debug Agent Tests
Author: echomodel

Debug Agent Tests | Skills Pool

./agent test <agent-name> -k <test_name> --debug

./agent test <agent-name> --debug

./agent test <agent-name> -n 5

tail -f /tmp/privacy-guard-tests/*.log

Check the harness log:

cat /tmp/privacy-guard-tests/<repo-name>.log

Is structured JSON present? Look for the privacy-guard-result fenced block in the raw output. If missing, the agent didn't follow its output instructions.
JSON present but wrong findings? Compare matched_value and category fields against what was planted in the test fixture (see conftest.py for fixture definitions).
Agent doing unexpected things? Check the Claude debug log:
```
cat /tmp/privacy-guard-tests/<repo-name>.claude-debug.log
```
Look for: tool calls the agent made, whether it read files it shouldn't have, whether it chained commands with &&.
Fix the agent .md or the test fixture, not both at once.

Agent	Test dir	What it tests
`privacy-guard`	`tests/integration/privacy_guard/`	Pre-push scope: staged diffs, unstaged diffs, unpushed commits
`privacy-audit`	`tests/integration/privacy_audit/`	Full audit: git history, pattern detection across files and commits

Agent	Test dir	What it tests
`privacy-guard`	`tests/integration/privacy_guard/`	Pre-push scope: staged diffs, unstaged diffs, unpushed commits
`privacy-audit`	`tests/integration/privacy_audit/`	Full audit: git history, pattern detection across files and commits

File	Contents
`<repo-name>.log`	Test harness log: commands run, raw agent output, parsed JSON
`<repo-name>.claude-debug.log`	Claude internals: tool calls, model responses

Debug Agent Tests

Debug Agent Integration Tests