Name: Actionbook Scraper
Author: actionbook

Actionbook Scraper Skill

⚠️ CRITICAL: Two-Part Verification

Every generated script MUST pass BOTH checks:

Check	What to Verify	Failure Example
Part 1: Script Runs	No errors, no timeouts	`Selector not found`
Part 2: Data Correct	Content matches expected	Extracted "Click to expand" instead of name

┌─────────────────────────────────────────────────────┐
│   1. Generate Script                                │
│          ↓                                          │
│   2. Execute Script                                 │
│          ↓                                          │
│   3. Check Part 1: Script runs without errors?      │
│          ↓                                          │
│   4. Check Part 2: Data content is correct?         │
│      - Not empty                                    │
│      - Not placeholder text ("Loading...")          │
│      - Not UI text ("Click to expand")              │
│      - Fields mapped correctly                      │
│          ↓                                          │
│      ┌───┴───┐                                      │
│   BOTH Pass  Either Fails                           │
│      │           │                                  │
│      │           ↓                                  │
│      │       Is it Actionbook data issue?           │
│      │           │                                  │
│      │       ┌───┴───┐                              │
│      │      Yes      No                             │
│      │       │       │                              │
│      │       ↓       ↓                              │
│      │    Log to   Fix script                       │
│      │    .actionbook-issues.log                    │
│      │       │       │                              │
│      │       └───┬───┘                              │
│      │           ↓                                  │
│      │       Retry (max 3x)                         │
│      ↓                                              │
│   Output Script                                     │
└─────────────────────────────────────────────────────┘

⚠️ CRITICAL: Two-Part Verification

Every generated script MUST pass BOTH checks:

Check

What to Verify

Failure Example

Part 1: Script Runs

No errors, no timeouts

Selector not found

Part 2: Data Correct

Content matches expected

Extracted "Click to expand" instead of name

┌─────────────────────────────────────────────────────┐ │ 1. Generate Script │ │ ↓ │ │ 2. Execute Script │ │ ↓ │ │ 3. Check Part 1: Script runs without errors? │ │ ↓ │ │ 4. Check Part 2: Data content is correct? │ │ - Not empty │ │ - Not placeholder text ("Loading...") │ │ - Not UI text ("Click to expand") │ │ - Fields mapped correctly │ │ ↓ │ │ ┌───┴───┐ │ │ BOTH Pass Either Fails │ │ │ │ │ │ │ ↓ │ │ │ Is it Actionbook data issue? │ │ │ │ │ │ │ ┌───┴───┐ │ │ │ Yes No │ │ │ │ │ │ │ │ ↓ ↓ │ │ │ Log to Fix script │ │ │ .actionbook-issues.log │ │ │ │ │ │ │ │ └───┬───┘ │ │ │ ↓ │ │ │ Retry (max 3x) │ │ ↓ │ │ Output Script │ └─────────────────────────────────────────────────────┘

Check	What to Verify	Failure Action
1. Script Runs	No errors, no timeouts	Fix syntax/selector errors
2. Data Correct	Content matches expected fields	Fix extraction logic

Rule	Example Failure	Fix
Fields not empty	`name: ""`	Check selector targets correct element
No placeholder text	`name: "Loading..."`	Add wait for dynamic content
No UI text	`name: "Click to expand"`	Extract after expanding, not button text
Correct data type	`year: "View Details"`	Wrong selector, fix field mapping
Reasonable count	Expected ~100, got 3	Add scroll/pagination handling

Operation	Primary Tool	Fallback	Notes
Find selectors for URL	`search_actions`	None	Search by domain/keywords
Get full selector details	`get_action_by_id`	None	Use action_id from search
List available sources	`list_sources`	`search_sources`	Browse all indexed sites
Generate agent-browser script	Agent (sonnet)	-	Default mode for /generate
Generate Playwright script	Agent (sonnet)	-	Use --standalone flag
Structure analysis	Agent (haiku)	-	Parse Actionbook response
Request new website	`agent-browser`	Manual	Submit to actionbook.dev (ONLY command that executes agent-browser)

Step	Action
1	Generate script with Actionbook selectors
2	Execute script to verify it works
3	If failed: analyze error, fix script, go to step 2
4	If success: output verified script + data preview

Error	Example	Fix
Extracted button text	`name: "Click to expand"`	Extract content after expanding
Extracted placeholder	`desc: "Loading..."`	Add wait for dynamic content
Empty fields	`name: ""`	Fix selector
Wrong field mapping	`year: "San Francisco"`	Fix selector for each field
Too few items	Expected 100, got 3	Add scroll/pagination

Command	Description	Agent
`/actionbook-scraper:analyze <url>`	Analyze page structure and show available selectors	structure-analyzer
`/actionbook-scraper:generate <url>`	Generate agent-browser scraper script	code-generator
`/actionbook-scraper:generate <url> --standalone`	Generate Playwright/Puppeteer script	code-generator
`/actionbook-scraper:list-sources`	List websites with Actionbook data	-
`/actionbook-scraper:request-website <url>`	Request new website to be indexed (uses agent-browser)	website-requester

Indicator	Page Type	Template
Scroll to load more	Dynamic/Infinite	playwright-js (with scroll)
Click to expand	Card-based	playwright-js (with click)
Pagination links	Paginated	playwright-js (with pagination)
Static content	Static	puppeteer or playwright
SPA framework detected	SPA	playwright-js (network idle)

Actionbook Scraper