Name: Cro Testing
Author: valentinyeo

CRO Testing -- A/B Test Hypothesis Generation

Generate, prioritize, and document A/B test hypotheses for any page or funnel. Uses the ICE framework to rank opportunities by expected ROI. Produces ready-to-implement test briefs with control/variant descriptions, traffic estimates, and duration calculations.

Process

Fetch and analyze the page using ${CLAUDE_SKILL_DIR}/../cro/scripts/fetch_page.py and ${CLAUDE_SKILL_DIR}/../cro/scripts/parse_cro.py. Extract all conversion-relevant elements.
Identify conversion issues and opportunities by evaluating the page against CRO best practices. Look for gaps in copy, UX, trust, forms, and visual hierarchy.
Load the testing framework by reading ${CLAUDE_SKILL_DIR}/../cro/references/testing-framework.md. This provides hypothesis structure, ICE scoring guidance, sample size calculators, and MDE (Minimum Detectable Effect) reference tables.
Generate hypotheses for each opportunity. Use the structured hypothesis format (below).

CRO Testing -- A/B Test Hypothesis Generation

Process

Fetch and analyze the page using ${CLAUDE_SKILL_DIR}/../cro/scripts/fetch_page.py and ${CLAUDE_SKILL_DIR}/../cro/scripts/parse_cro.py. Extract all conversion-relevant elements.
Identify conversion issues and opportunities by evaluating the page against CRO best practices. Look for gaps in copy, UX, trust, forms, and visual hierarchy.
Load the testing framework by reading ${CLAUDE_SKILL_DIR}/../cro/references/testing-framework.md. This provides hypothesis structure, ICE scoring guidance, sample size calculators, and MDE (Minimum Detectable Effect) reference tables.
Generate hypotheses for each opportunity. Use the structured hypothesis format (below).

Score	Definition	Example
9-10	Transformative	Redesigning the entire above-the-fold section
7-8	High	Rewriting the headline and value proposition
5-6	Moderate	Changing CTA button text and color
3-4	Low	Adding a trust badge near the CTA
1-2	Minimal	Changing font size or minor spacing

Score	Definition	Basis
9-10	Near certain	Multiple case studies showing consistent results for this exact change
7-8	High	Strong theoretical basis + some case study evidence
5-6	Moderate	Established best practice but no direct evidence for this context
3-4	Low	Logical reasoning but no supporting data
1-2	Speculative	Gut feeling, novel idea, no precedent

Score	Definition	Effort
9-10	Trivial	Text change, color change, hide/show element. < 1 hour.
7-8	Easy	Copy rewrite, button redesign, add trust badge. < 4 hours.
5-6	Moderate	Layout change, new section, form restructure. 1-2 days.
3-4	Hard	New page design, multi-step form, dynamic content. 3-5 days.
1-2	Very hard	Full redesign, backend changes, new functionality. 1+ weeks.

Priority Score	Category
7.0+	Quick Win -- implement first
5.0-6.9	Standard Test -- plan and schedule
3.0-4.9	Strategic Bet -- high risk, potentially high reward
< 3.0	Avoid -- not worth the effort

Question	What to Document
What is the page?	Page type, business type, primary purpose
Primary conversion goal	What action should the visitor take? (Buy, sign up, submit form, call, etc.)
Secondary goals	Newsletter signup, social follow, content download, etc.
Current conversion elements	List all CTAs, forms, trust signals, and persuasion elements present
Key metrics to track	Primary metric, secondary metrics, guardrail metrics
Traffic level	Estimate monthly visitors (affects test duration and MDE)

Cro Testing

CRO Testing -- A/B Test Hypothesis Generation

Process

Cro Testing

CRO Testing -- A/B Test Hypothesis Generation

Process

Hypothesis Format

ICE Framework

Impact (1-10)

Confidence (1-10)

Ease (1-10)

ICE Priority Score

Test Categories

Quick Wins (ICE >= 7.0)

Strategic Bets (High Impact, Lower Confidence)

Avoid List (ICE < 3.0)

Test Brief Template

Analysis Sections

1. Current State Assessment

2. Opportunity Identification

3. Test Sequencing

Scoring

Test Plan Quality Score

Output Format

Cross-References

Test

Feature Flags

Unit Tests

Integration Tests

Write Frontend Tests

Golang Testing

Category	What to Look For
Headlines	Unclear, generic, feature-focused, missing unique mechanism
CTAs	Generic text, low contrast, poor placement, too many competing CTAs
Trust	Missing testimonials, no security badges, no guarantees, no social proof
Forms	Too many fields, poor labels, no inline validation, generic submit button
Copy	Feature-heavy, low readability, no emotional triggers, no urgency
Visual hierarchy	Cluttered layout, unclear focus, CTA not prominent, poor whitespace
Mobile	Poor responsive behavior, small tap targets, CTA hidden below fold
Speed	Slow LCP, high CLS, render-blocking resources
Pricing	Confusing tiers, no anchor pricing, no risk reducer, no social proof near price

Criterion	Weight
Hypothesis specificity (actionable, measurable?)	25%
ICE scoring accuracy (well-calibrated?)	25%
Coverage (all categories examined?)	20%
Prioritization logic (correct order?)	15%
Brief completeness (enough detail to implement?)	15%