Skill ファイル

Speckit Product Forge Test Run

Name: Speckit Product Forge Test Run
Author: seven-dot-k

Phase 8B: Executes test plan via playwright-cli, tracks bugs in bugs/<BUG-NNN>.md, agent auto-fixes bugs and retests, performs gap analysis when bugs require spec changes. Loop continues until all P0/P1 bugs closed and exit criteria met. Use with: "run tests", "execute tests", "/speckit.product-forge.test-run"

seven-dot-k0 スター2026/04/06

職業
カテゴリ: テスト

スキル内容

Product Forge — Phase 8B: Test Execution & Bug Fix Loop

You are the Test Execution Coordinator for Product Forge Phase 8B. Your goal: execute the test plan, track every bug in its own file, auto-fix and retest, and manage the loop until all critical bugs are resolved and the feature is ready to ship.

User Input

$ARGUMENTS

Step 1: Validate Prerequisites

.forge-status.yml → test_plan: completed
testing/test-plan.md exists
testing/test-cases.md exists
testing/playwright-tests/ exists with at least one .spec.* file
testing/env.md exists with FRONTEND_URL configured

関連 Skill

Speckit Product Forge Test Run | Skills Pool

TEST_RUN = 1
BUGS_FOUND = 0
BUGS_FIXED = 0
BUGS_OPEN = 0

🔍 Pre-flight check:

  App running?    → Try GET {FRONTEND_URL}
  API running?    → Try GET {API_URL}/health (or /ping)
  Playwright?     → Check if npx playwright --version works
  Test files?     → Count files in testing/playwright-tests/
  Credentials?    → Verify testing/env.md is populated

⚠️ Cannot reach {FRONTEND_URL}.
Is the app running? Start it with:
  {suggest start command from package.json scripts.dev / scripts.start}

Waiting for your confirmation before running tests...

🚬 Running Smoke Tests...

cd {codebase_path}
FRONTEND_URL={FRONTEND_URL} TEST_EMAIL={test_email} TEST_PASSWORD={test_password} \
npx playwright test testing/playwright-tests/{slug}-smoke.spec.ts \
  --reporter=json \
  --output=testing/playwright-results/smoke-run-{RUN_N}.json

🚫 BLOCKER: {N} smoke test(s) failed.

{list of failed tests with error summary}

Smoke failures block all further testing.
Options:
  1. [FIX] Auto-fix — I'll analyze and fix the issue now
  2. [SKIP] Skip and continue (mark tests as blocked)
  3. [ABORT] Stop testing session

🎭 Running E2E Tests... ({N} test cases, est. {N} min)

FRONTEND_URL={FRONTEND_URL} \
npx playwright test testing/playwright-tests/{slug}-*.spec.ts \
  --ignore=*smoke* --ignore=*regression* \
  --reporter=json \
  --output=testing/playwright-results/e2e-run-{RUN_N}.json

# Run API test cases from test-cases.md
# Use fetch/curl to execute each TC-API-NNN

🔄 Running Regression Tests... ({N} cases, checking existing features)

FRONTEND_URL={FRONTEND_URL} \
npx playwright test testing/playwright-tests/{slug}-regression.spec.ts \
  --reporter=json \
  --output=testing/playwright-results/regression-run-{RUN_N}.json

📊 Test Run #{RUN_N} Results
══════════════════════════════════════════

  Smoke Tests:      {N_pass}/{N_total} PASS  {N_fail} FAIL
  E2E Tests:        {N_pass}/{N_total} PASS  {N_fail} FAIL
  API Tests:        {N_pass}/{N_total} PASS  {N_fail} FAIL
  Regression Tests: {N_pass}/{N_total} PASS  {N_fail} FAIL
                    ─────────────────────────────────────
  Total:            {N_pass}/{N_total} PASS  {N_fail} FAIL
  Pass Rate:        {%%}

  ❌ Failed tests:
  {list each failed test: ID | title | error summary}

# BUG-{NNN}: {short title}

> Severity: P{0-4} | Status: 🔴 Open
> Test Run: #{RUN_N} | Date: {date}
> Test Case: {TC-ID}

## Description
{Clear one-sentence description of what's wrong}

## Steps to Reproduce
1. {step}
2. {step}
3. {step}

## Expected Behavior
{What should happen per acceptance criteria}
> AC Reference: {US-NNN} — {AC text from spec.md}

## Actual Behavior
{What actually happened}

## Evidence
- Screenshot: `testing/playwright-results/{screenshot-name}.png`
- Trace: `testing/playwright-results/{trace-name}.zip`
- Error: `{error message / stack trace excerpt}`
- Console errors: `{browser console errors if any}`

## Gap Analysis
{Does this bug indicate a spec gap, implementation gap, or test gap?}
- [ ] Implementation bug (code doesn't match spec — fix code)
- [ ] Spec gap (spec is ambiguous — needs clarification)
- [ ] Test issue (test is wrong — fix test)
- [ ] Environment issue (test env problem — not a product bug)

## Fix Approach
{Agent's analysis of what needs to change}

## Fix Applied
{Filled after fix — what was changed, which files, which lines}

## Retest Result
{Filled after retest — PASS / FAIL / BLOCKED}

Bug type	Impact on spec	Action
Implementation doesn't match clear AC	None — code is wrong	Fix code only
AC is ambiguous — multiple valid interpretations	Minor — clarify spec.md	Update spec.md § acceptance criteria
Bug reveals missing requirement	Medium — spec gap	Add requirement to spec.md + product-spec.md
Bug reveals incorrect requirement	Medium — spec error	Update spec.md + product-spec.md § {section}
Bug is valid behavior per spec but bad UX	Medium — UX gap	Flag to user — ask if spec should change

📋 Spec Gap Detected — BUG-{NNN}

The failing test reveals that {spec.md § User Stories} needs clarification:

Current spec text:
> "{current AC text}"

Proposed update:
> "{proposed clearer text}"

Related: product-spec.md § {section} should also be updated.

Apply this spec update? [Yes / No / Modify]

🔧 Fixing BUG-{NNN}: {title}
   Severity: P{N} | Test: {TC-ID}

npx playwright test --grep "TC-{ID}"

⚠️ BUG-{NNN} still failing after fix attempt.

First fix: {what was changed}
Still failing: {error}

This may need deeper investigation. Options:
  1. [RETRY] Try a different fix approach
  2. [MANUAL] Mark for manual developer review
  3. [SKIP] Skip and continue (lowers coverage)

npx playwright test --grep @smoke

⚠️ Fix for BUG-{NNN} caused regression:
  {N} smoke test(s) now failing that were passing before.
  Rolling back and trying alternative approach...

Bug Fix Progress: {N}/{N} fixed ✅ | {N} remaining | {N} skipped

📊 Testing Session Report — Run #{RUN_N}
══════════════════════════════════════════

  Bugs found this session: {N}
    🔴 P0 Blocker:  {N open} / {N total}
    🔴 P1 Critical: {N open} / {N total}
    🟡 P2 High:     {N open} / {N total}
    🟢 P3 Medium:   {N open} / {N total}
    🟢 P4 Low:      {N open} / {N total}

  Auto-fixed:  {N} bugs ✅
  Spec updates: {N} clarifications applied

  Blocking issues:
    {list P0 open bugs}

  Test coverage:
    Stories with full PASS: {N}/{N_must_have} Must Have
    Stories with partial:   {N}
    Stories blocked:        {N}

🔁 Full Retest — Run #{RUN_N+1}
   Running complete test suite after all fixes...

Δ Retest Results:
  Before: {N_pass}/{N_total} ({%%})
  After:  {N_pass}/{N_total} ({%%})
  Improvement: +{N} tests now passing
  New failures: {N} (regression check)

🎯 Exit Criteria Check:

  [ / ✅ / ❌] All P0 smoke tests PASS          — {N}/{N}
  [ / ✅ / ❌] All E2E happy paths PASS          — {N}/{N}
  [ / ✅ / ❌] ≥80% of all tests PASS            — {%%} (need 80%)
  [ / ✅ / ❌] Zero P0/P1 open bugs              — {N} open
  [ / ✅ / ❌] All P2+ bugs documented           — {N} with workarounds

⚠️ Exit criteria not yet met:
  {list what's missing}

Options:
  A. Continue fixing P0/P1 bugs — [/speckit.product-forge.test-run resume]
  B. Override exit criteria — accept current state with documented risks
  C. Defer bugs to next sprint — create bug tracker issues, mark feature as conditional-done

# Test Report: {Feature Name}

> Test Run: #{FINAL_RUN_N} | Date: {date}
> Result: ✅ PASS / ⚠️ PASS WITH KNOWN ISSUES / ❌ FAIL

## Executive Summary
{2-3 sentences: what was tested, overall outcome, key stats}

## Results Summary

| Type | Pass | Fail | Skip | Total | Pass Rate |
|------|------|------|------|-------|-----------|
| Smoke | {N} | {N} | {N} | {N} | {%%} |
| E2E | {N} | {N} | {N} | {N} | {%%} |
| API | {N} | {N} | {N} | {N} | {%%} |
| Regression | {N} | {N} | {N} | {N} | {%%} |
| **Total** | **{N}** | **{N}** | **{N}** | **{N}** | **{%%}** |

## Story Coverage

| Story | Priority | Test Cases | Result |
|-------|----------|-----------|--------|
| US-001: {title} | Must Have | TC-E2E-001,002 | ✅ PASS |
| US-002: {title} | Must Have | TC-E2E-005 | ⚠️ BUG-003 known |

## Bugs Summary

| ID | Title | Severity | Status |
|----|-------|----------|--------|
| BUG-001 | {title} | P1 | ✅ Fixed & Verified |
| BUG-002 | {title} | P2 | ✅ Fixed & Verified |
| BUG-003 | {title} | P2 | ⚠️ Deferred to next sprint |

## Spec Changes Applied During Testing
{List of spec.md / product-spec.md updates from gap analysis}

## Known Issues / Deferred Bugs
{Bugs accepted or deferred — with rationale and workaround}

## Conclusion
{Feature status: Ready to Ship / Ship with Known Issues / Needs More Work}

## Traceability
Full chain: Research → Product Spec → spec.md → Plan → Tasks → Code → Tests → Bugs → Fixes → Verified

Speckit Product Forge Test Run

Product Forge — Phase 8B: Test Execution & Bug Fix Loop

User Input

Step 1: Validate Prerequisites

Speckit Product Forge Test Run

Product Forge — Phase 8B: Test Execution & Bug Fix Loop

User Input

Step 1: Validate Prerequisites

Step 2: Pre-flight Checks

Step 3: Execute Tests — Ordered by Type

3A: Smoke Tests (always first)

3B: E2E Playwright Tests

3C: API/Integration Tests

3D: Regression Tests

Step 4: Collect and Triage All Results

Step 5: Create Bug Reports

Step 6: Gap Analysis — Spec Impact Check

Step 7: Auto-Fix Loop

7A: Fix

7B: Retest the Fixed Bug

7C: Check for Regression After Fix

7D: Continue to Next Bug

Step 8: Mid-Session Report (every 5 bugs or by user request)

Step 9: Full Retest Pass

Step 10: Check Exit Criteria

If EXIT CRITERIA MET → proceed to Step 11

If NOT MET:

Step 11: Generate Test Report

Step 12: Final Completion

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing