Name: Discover Abtests
Author: shakacode

Skills suchen.../

Discover Abtests | Skills Pool

Resize to 375×667 via mcp__claude-in-chrome__resize_window
Take a screenshot and scroll through the mobile layout — visually note what's different from desktop (stacked columns, hidden sidebars, hamburger menus, mobile-specific UI)
Re-run scripts/probe-sections.js at this width

Check each desktop selector from A4 — does it exist on mobile? Use javascript_tool to query visibility:

const el = document.querySelector('.rate-form-wrapper');
el ? { display: getComputedStyle(el).display, height: el.getBoundingClientRect().height } : 'NOT FOUND'

Compare desktop vs mobile sections:
- Desktop selector hidden/absent on mobile → restrict that test to viewports: [tablet, desktop] or [desktop]. Check if there's a mobile-specific replacement (e.g., .mobile-nav replaces .nav-tabs). If a replacement exists, plan a phone-only test for it.
- New element on mobile not seen on desktop → plan a phone-only test for it
- Same selector, different dimensions → note for threshold adjustment
Check interactive elements at mobile width — buttons/menus that appear only on mobile (hamburger menu, mobile filters, etc.)
Resize back to desktop width when done

import { abTest, TestType } from 'shaka-shared';
import { waitUntilPageSettled } from 'shaka-perf/visreg/helpers';

// TODO: Hero section snapshot
// - selector: [data-cy="hero"] or .hero-section
// - wait for: .skeleton (found in A2) to disappear
// - threshold: 0.05 (dynamic hero image)
abTest('Homepage Hero', { startingPath: '/', options: { visreg: {} } }, async () => {});

// TODO: Click "Contact Us" button → modal opens
// - confirmed in A6: clicking button.contact-cta opens modal .contact-modal
// - inside modal (A7): form with name, email, message fields
// - .contact-cta is display:none on phone viewport (A8)
// - need desktop-only viewports
abTest('Homepage Contact Modal', { startingPath: '/', options: { visreg: {} } }, async () => {});

// TODO: Fill contact form inside modal
// - fields: input[name="name"], input[name="email"], textarea[name="message"]
// - depends on: opening the modal first (chained interaction)
abTest('Homepage Contact Form Fill', { startingPath: '/', options: { visreg: {} } }, async () => {});

Implement the TODO stub — replace the empty async () => {} with the real test body

Run with --filter to execute only this test (the filter is a regex matched against the test name):

Twin-server mode:

cd <app-directory> && yarn shaka-perf visreg-compare --testFile ab-tests/<page>.abtest.ts --filter "Homepage Hero"

Single-server mode:

cd <app-directory> && yarn shaka-perf visreg-compare --testFile ab-tests/<page>.abtest.ts --filter "Homepage Hero" --controlURL <url> --experimentURL <url>

Quick check: read the screenshot to verify real content was captured (not blank)
If pass → move on to the next TODO stub
If fail → debug and fix (up to 3 attempts). If still failing, comment out the abTest() call (don't delete it) and add a // TODO: comment explaining what's broken and what was tried. This preserves the test code so it's easy to revisit later.

python3 .claude/skills/discover-abtests/scripts/parse-report.py

#	Criterion	Fix if failing
1	No loading indicators (spinners, skeletons) visible	Add `waitForLoadState('networkidle')` and/or `waitUntilPageSettled`
2	No mid-animation carousels — frames frozen at deterministic position	Add CSS override confirmed in probing
3	No missing lazy-loaded content	Add scroll + networkidle + scroll-to-top. Blank screenshot check is ground truth — add scrolling even if probing didn't detect it
4	Thresholds appropriate — `0.01` static, `0.05` dynamic, `0.1` very dynamic	Fix root cause
5	All selectors resolve without timeout	Inspect DOM in Chrome, use a more reliable selector
6	Every non-trivial action is annotated	Add missing `annotate(...)` calls
7	No auth-gated content (login page/access denied in screenshot)	Remove the test
8	No unconfirmed interactions — every click/scroll/CSS override validated in probing	Remove unconfirmed action
9	No high-whitespace screenshots (`whitePixelPercent < 90` in report.json)	Re-evaluate selector: try child element, parent, or different section. Read screenshot to verify.

Symptom	Fix
UI change on one server only	Real A/B diff — mark PASS (A/B diff), do not raise threshold
Skeleton/spinner visible	`waitForLoadState('networkidle')`
Content missing (lazy)	scroll + networkidle + scroll-to-top
Carousel moving	CSS override from probing
Selector timeout	Inspect DOM in Chrome, use more reliable selector
High diff on dynamic content	`hideSelectors`/`removeSelectors` or wait for settle — do not raise threshold
Selector not found on some viewports	Element is `display:none` at that breakpoint — split into viewport-scoped tests (see patterns.md)
High whitespace (whitePixelPercent > 90%)	Wrong selector — try child, parent, or different section. Read screenshot to verify.
`Cannot type text into input[type=number]`	Use numeric-only strings: `'5551234567'` not `'555-123-4567'`
`button:has-text("X")` matches multiple	Use `page.getByLabel()`, `page.getByRole()`, or more specific CSS selector
`Timeout 60000ms on page.goto`	Server too slow — page takes too long to respond
`size: isDifferent`	Dynamic content changes height between renders; often unfixable in single-server mode, mark NEEDS REVIEW
`strict mode violation`	Multiple elements match — use `.first()` or more specific selector

## Discovered AB Tests

| File               | Tests | Status          | Notes                              |
| ------------------ | ----- | --------------- | ---------------------------------- |
| homepage.abtest.ts | 2     | PASS            |                                    |
| cart.abtest.ts     | 1     | PASS (A/B diff) | Experiment has new checkout button |
| products.abtest.ts | 1     | NEEDS REVIEW    | Selector timeout on phone viewport |

Statuses:

- **PASS** — no diff, both servers identical
- **PASS (A/B diff)** — test ran correctly and detected a real difference
- **NEEDS REVIEW** — test infrastructure issue (flaky, timeout, broken selector)

Total: N files, M tests (X passing, Y A/B diffs detected, Z needs review)
Output: ./ab-tests/

## Coverage decisions

### Shared sections (tested once)

| Section                    | Tested on            | Skipped on                |
| -------------------------- | -------------------- | ------------------------- |
| `[data-cy="testimonials"]` | `homepage.abtest.ts` | product pages, about page |
| `footer`                   | `homepage.abtest.ts` | all other pages           |

### Pages scoped to unique content only

| Page               | Selector used                      | Reason                                                |
| ------------------ | ---------------------------------- | ----------------------------------------------------- |
| `/products/widget` | `[data-cy="product-configurator"]` | Lower sections covered by representative product page |

### Skipped pages

| Path        | Reason               |
| ----------- | -------------------- |
| `/admin`    | Auth required        |
| `/checkout` | Multi-step auth flow |

Path	When to use
`scripts/extract-links.js`	Run via `javascript_tool` to collect internal links from any page
`scripts/probe-lazy-load.js`	Run via `javascript_tool` to test whether scrolling triggers new content
`scripts/probe-sections.js`	Run via `javascript_tool` on tall pages (>2000px) to score candidate CSS selectors for section-based testing
`scripts/parse-report.py`

Path	When to use
`scripts/extract-links.js`	Run via `javascript_tool` to collect internal links from any page
`scripts/probe-lazy-load.js`	Run via `javascript_tool` to test whether scrolling triggers new content
`scripts/probe-sections.js`	Run via `javascript_tool` on tall pages (>2000px) to score candidate CSS selectors for section-based testing
`scripts/parse-report.py`

Discover Abtests

Bundled resources

Discover Abtests

Bundled resources

Inputs

Phase 1: Crawl (links only)

Phase 2: Per-page loop

Step A — Probe the page

Step B — Write TODO comments with all probing findings

Threshold guidance

Annotation

Step C — Implement and validate tests one at a time

Step D — Full-file validation

Step E — Coverage comparison (loop until covered)

Acceptance criteria

Common fixes

Final summary

Update Skills

Eval Harness

Ecc Tools Cost Audit

Code Tour

Rules Distill

Design System