/audit-bug — 未発見バグの検出 & Issue 起票

Ship チャットログ、ソースコード、E2E テスト実行を 3 つの Dispatch で並行分析し、未発見のバグを検出して issue を起票する。

前提

Commander（Dock）から呼び出される
ソースコードの調査は全て Dispatch 経由（Commander はコードを直接読まない）
Issue 起票は Commander が gh issue create で行う（Dispatch は起票しない）

Step 1: 3 Dispatch を並行起動

3 つの Dispatch を順次発行し、全完了を待つ。

1a. チャットログ分析（Dispatch Y）

Ship の ship-log.jsonl / escort-log.jsonl を読み、潜在バグのパターンを検出する。

curl -s -X POST http://localhost:$VIBE_ADMIRAL_ENGINE_PORT/api/dispatch \
  -H 'Content-Type: application/json' \
  -d '{
    "fleetId": "<fleet-id>",
    "parentRole": "dock",
    "name": "audit-bug-chatlog",
    "type": "investigate",
    "cwd": "<repo-path>",
    "prompt": "You are a Dispatch agent analyzing Ship chat logs for potential bugs.\n\nRepo: <repo-path>\n\n## Task\n\nAnalyze Ship and Escort chat logs to find evidence of bugs.\n\n### Steps\n\n1. Find all worktree directories:\n   ls -d ../.worktrees/feature/*/ 2>/dev/null || ls -d .worktrees/feature/*/ 2>/dev/null\n2. Randomly select 3-5 worktrees that have log files\n3. For each selected worktree, read:\n   - `.claude/ship-log.jsonl` — Ship activity log\n   - `.claude/escort-log.jsonl` — Escort gate review log\n4. Analyze for these bug indicators:\n   - **Error messages**: unhandled exceptions, stack traces, unexpected errors\n   - **Abnormal transitions**: phase transitions that skip steps or go backwards unexpectedly\n   - **Retry loops**: the same operation retried more than 3 times\n   - **Unhandled exceptions**: errors that were not caught or recovered from\n   - **Escort reject patterns**: repeated rejections for the same reason (indicates a systemic issue)\n   - **Process crashes**: processDead events, unexpected exits\n   - **Timeout patterns**: operations that consistently hit timeouts\n\n### Output format\n\n```\n## Chat Log Analysis Results\n\n### Analyzed Ships\n| Ship (worktree) | Issue # | Logs Found |\n|-----------------|---------|------------|\n\n### Potential Bugs Found\n\n#### Bug Y-1: <title>\n- **Source**: <worktree> / <log file>\n- **Evidence**: <relevant log excerpt (max 5 lines)>\n- **Category**: ERROR | ABNORMAL_TRANSITION | RETRY_LOOP | UNHANDLED_EXCEPTION | ESCORT_PATTERN | CRASH | TIMEOUT\n- **Severity**: critical | high | medium | low\n- **Description**: <what the bug appears to be>\n- **Suspected Root Cause**: <hypothesis based on log evidence>\n\n### Escort Reject Patterns\n| Reject Reason | Frequency | Affected Ships |\n|---------------|-----------|----------------|\n```\n\nDo NOT create issues or make changes. Only investigate and report."
  }'

Step 1: 3 Dispatch を並行起動

3 つの Dispatch を順次発行し、全完了を待つ。

1a. チャットログ分析（Dispatch Y）

Ship の ship-log.jsonl / escort-log.jsonl を読み、潜在バグのパターンを検出する。

curl -s -X POST http://localhost:$VIBE_ADMIRAL_ENGINE_PORT/api/dispatch \ -H 'Content-Type: application/json' \ -d '{ "fleetId": "<fleet-id>", "parentRole": "dock", "name": "audit-bug-e2e", "type": "investigate", "cwd": "<repo-path>", "prompt": "You are a Dispatch agent auditing E2E test coverage, enhancing tests, and running them to discover bugs.\n\nRepo: <repo-path>\n\n## Task\n\nExecute three sequential sub-steps (W-1 \u2192 W-2 \u2192 W-3). Do NOT skip W-1/W-2 even if tests appear comprehensive.\n\n### W-1: Test Coverage Analysis\n\n1. List all existing E2E test files:\n ls e2e/*.spec.ts 2>/dev/null || find . -name \"*.spec.ts\" -path \"*/e2e/*\"\n2. For each test file, read it and summarize what flow/feature it validates\n3. Cross-reference with the current feature set:\n - Ship lifecycle (sortie / stop / resume / abandon / retry)\n - Fleet CRUD (create / switch / delete / settings)\n - Commander operations (Flagship / Dock chat, Dispatch launch)\n - Gate flow (planning-gate / implementing-gate / acceptance-test-gate)\n - Phase transitions and UI updates\n4. Identify untested features/flows and output a coverage-gap table\n\n### W-2: Test Enhancement\n\n1. For identified coverage gaps, prioritize:\n - **Regression tests for past bugs** \u2014 query recent closed `type/bug` issues:\n gh issue list --state closed --label type/bug --limit 20 --json number,title,body\n For each past bug, assess whether an E2E regression test exists. If not, add one.\n - **Uncovered critical feature flows** \u2014 Fleet CRUD, Ship sortie/stop/resume, Commander chat, Dispatch launch\n2. Write new tests under `e2e/`, following existing test conventions\n3. Commit the added tests:\n git add e2e/<new-test-files>\n git commit -m \"test: add regression/coverage tests for audit-bug\"\n4. Only add tests that are actually executable and meaningful. Do NOT fabricate tests for features that do not exist.\n\n### W-3: Test Execution\n\n1. Check for playwright config:\n ls playwright*.config.* 2>/dev/null\n2. Run all E2E tests (existing + newly added):\n npx playwright test --config playwright.e2e.config.ts 2>&1\n3. If tests fail, run each failed test individually for detailed output:\n npx playwright test --config playwright.e2e.config.ts <test-file> 2>&1\n4. Categorize each failure:\n - **BUG**: Application code is broken (the test correctly catches a real bug)\n - **TEST_ISSUE**: Test itself is broken (selector changed, timing issue, bad assertion)\n - **FLAKY**: Intermittent failure (timing-dependent, environment-dependent)\n5. For BUG category failures, trace the root cause in the application code\n\n### Output format\n\n```\n## E2E Audit Results\n\n### W-1: Coverage Analysis\n\n#### Existing Tests\n| Test File | Covered Flow/Feature |\n|-----------|----------------------|\n\n#### Coverage Gaps\n| Untested Area | Risk Level | Description |\n|---------------|-----------|-------------|\n\n### W-2: Test Enhancement\n\n#### Added Tests\n| Test File | Target | Type (regression / coverage) | Related Past Bug |\n|-----------|--------|------------------------------|-------------------|\n\n#### Commit\n- Commit SHA: <sha>\n- Message: <commit message>\n\n### W-3: Execution Summary\n- **Total**: N tests\n- **Passed**: N\n- **Failed**: N\n- **Skipped**: N\n\n#### Failures\n\n##### Bug W-1: <title>\n- **Test**: `e2e/<file>.spec.ts` \u2014 \"<test name>\"\n- **Category**: BUG | TEST_ISSUE | FLAKY\n- **Error**: <error message>\n- **Stack**: <first 5 lines of stack trace>\n- **Root Cause**: <analysis of why this fails>\n- **Severity**: critical | high | medium | low\n\n## Summary\n| Category | Count |\n|----------|-------|\n| Coverage Gaps (W-1) | N |\n| Added Tests (W-2) | N |\n| BUG | N |\n| TEST_ISSUE | N |\n| FLAKY | N |\n```\n\nYou MAY write and commit new test files under `e2e/` for W-2. Do NOT modify application source code and do NOT create issues. Only audit, enhance tests, run tests, and report." }'

Audit Bug

/audit-bug — 未発見バグの検出 & Issue 起票

前提

Step 1: 3 Dispatch を並行起動

1a. チャットログ分析（Dispatch Y）

Audit Bug

/audit-bug — 未発見バグの検出 & Issue 起票

前提

Step 1: 3 Dispatch を並行起動

1a. チャットログ分析（Dispatch Y）

1b. ソースコード分析（Dispatch Z）

1c. E2E テストカバレッジ分析・増強・実行（Dispatch W）

Dispatch 完了待ち

Step 2: 結果の統合・分析

2a. 結果の取得

2b. 統合・重複排除

2c. 既存 issue との照合

Step 3: Issue 起票

起票ルール

Step 4: E2E テスト増強 issue の起票

起票ルール

Step 5: サマリ報告

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags