技能檔案

Smart Shopper — Product Research & Comparison Skill

Name: Smart Shopper — Product Research & Comparison Skill
Author: axot

Product research and comparison shopping assistant. Use this skill whenever the user wants to find, compare, or filter products on any online shopping platform (Amazon, Rakuten, Yahoo Shopping, IKEA, Costco, Taobao, JD, etc.). Triggers include "find me a product", "search for X on Amazon", "help me buy Y", "compare products", "I need a [product]", "search [platform] for", "shop for", "looking for something to buy", any mention of shopping criteria like price range, star ratings, product specifications, or when the user names a product category and an online store. Also triggers when the user changes filtering criteria mid-search (e.g., "relax the rating to 3.9", "what if we increase the budget"). This skill caches all search results in a local SQLite database so criteria changes can be answered instantly without re-searching.

axot0 星標2026年4月8日

職業
分類: 電子商務

技能內容

Prerequisites

This skill requires agent-browser for all web interaction. Before starting any search:

Check if the agent-browser skill is available in your skill list
If available, load it: use the skill tool to load agent-browser
If NOT available, install it:
```
npx skills add agent-browser
```
Then load the skill after installation.

Do NOT proceed with any search steps until agent-browser is confirmed working. Quick check:

agent-browser --version

If the CLI itself is missing, install it: brew install agent-browser or npm i -g agent-browser, then agent-browser install to download the browser.

Why This Skill Exists

相關技能

Smart Shopper — Product Research & Comparison Skill | Skills Pool

User request
    ↓
[1] Clarify requirements (platform, criteria, budget, language)
    ↓
[2] Search platform via agent-browser → extract product list
    ↓
[3] Cache basic info in SQLite → immediately filter by rating/price/review_count → candidate list
    ↓
[4] Verify candidates only (parallel batch — visit product pages, extract specs, store page_text)
    ↓
[5] Present results (table + recommendation)
    ↓
[6] User changes criteria? → Re-query cached DB, only re-search if data insufficient

Question	Why it matters
Platform	Which site to search (Amazon.co.jp, Rakuten, etc.)
Product type	What they're looking for
Budget	Price ceiling
Key physical/technical criteria	Weight, size, material, features — whatever matters for this product
Rating threshold	Minimum star rating (default: suggest 4.0+)
Review requirements	e.g., "must have Japanese reviews", minimum review count
Quantity needed	How many candidates to present (default: 10)
Site language	What language should the site display in? (default: infer from platform — e.g., Amazon.co.jp → Japanese, Amazon.com → English). If unclear, ask the user.

// Run via agent-browser eval to extract search results
Array.from(document.querySelectorAll('[data-asin]'))
  .filter(el => el.dataset.asin.length > 3)
  .map(el => ({
    product_id: el.dataset.asin,
    name: (el.querySelector('h2 span') || {}).textContent || '',
    rating: /* extract star rating */,
    price: /* extract price */,
    review_count: /* extract review count */,
    url: 'https://www.amazon.co.jp/dp/' + el.dataset.asin
  }))

// In agent-browser eval: extract products and write JSON file directly
const products = Array.from(document.querySelectorAll('[data-asin]'))
  .filter(el => el.dataset.asin.length > 3)
  .map(el => ({ /* extraction logic */ }));
// Write to temp file
require('fs').writeFileSync('/tmp/products_page1.json', JSON.stringify(products));
products.length; // return count only

python3 product_db.py insert --db <DB_PATH> --json-file /tmp/products_page1.json --platform amazon.co.jp

Start of skill invocation
  │
  ├─ Already have DB_PATH in this conversation?
  │   └─ YES → reuse it, skip discovery
  │
  ├─ Run: product_db.py discover --query "user's search terms" --auto-cleanup
  │
  ├─ 0 databases found → create new DB
  │
  ├─ Results with match_score > 0?
  │   ├─ Exactly 1 match → auto-select, tell user: "Continuing with [description]"
  │   └─ Multiple matches → show list, ask user to pick
  │
  ├─ All results have match_score = 0 (no keyword overlap)?
  │   ├─ Any DB accessed in last 24h → show it as option, ask if related
  │   └─ Otherwise → create new DB
  │
  └─ discover also prunes DBs not accessed in 30+ days

python3 /path/to/smart-shopper/scripts/product_db.py discover --cache-dir ~/.cache/smart-shopper --query "user's search terms" --auto-cleanup

python3 /path/to/smart-shopper/scripts/product_db.py create --topic "garden table set"

{"status": "ok", "db": "/Users/you/.cache/smart-shopper/2026-03-29-garden-table-set-a3f1.db"}

python3 /path/to/smart-shopper/scripts/product_db.py create --topic "garden table set"
python3 /path/to/smart-shopper/scripts/product_db.py insert --db <DB_PATH> --json-file /tmp/products.json --platform amazon.co.jp
python3 /path/to/smart-shopper/scripts/product_db.py query --db <DB_PATH> --min-rating 4.2 --max-price 100000
python3 /path/to/smart-shopper/scripts/product_db.py update-detail --db <DB_PATH> --product-id "B09T97Z57Y" --page-text-file /tmp/page.txt --specs '{"weight": "16.74kg", ...}'
python3 /path/to/smart-shopper/scripts/product_db.py update-detail --db <DB_PATH> --product-id "B09T97Z57Y" --verified
python3 /path/to/smart-shopper/scripts/product_db.py query --db <DB_PATH> --spec-filter "table_weight_kg>=12" --spec-match "material=wood"

CREATE TABLE products (
    id INTEGER PRIMARY KEY AUTOINCREMENT,
    platform TEXT NOT NULL,
    product_id TEXT NOT NULL,
    name TEXT,
    price REAL,
    price_raw TEXT,                    -- original text e.g. "¥12,800" / "$128.00"
    currency TEXT DEFAULT 'JPY',
    rating REAL,
    review_count INTEGER,
    search_rank INTEGER,              -- position in search results (quality signal)
    url TEXT,
    search_query TEXT,
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    specs_json TEXT,
    verified INTEGER DEFAULT 0,
    UNIQUE(platform, product_id)
);

CREATE TABLE product_pages (          -- separate table to keep products lean
    product_id TEXT NOT NULL,
    platform TEXT NOT NULL,
    page_text TEXT,                    -- full innerText for later extraction
    fetched_at TIMESTAMP,
    PRIMARY KEY (platform, product_id)
);

CREATE TABLE _meta (key TEXT PRIMARY KEY, value TEXT);

SELECT * FROM products
WHERE rating >= 4.2          -- replace with user's threshold
  AND price <= 100000         -- replace with user's budget
  AND review_count >= 10      -- default minimum; replace with user's if specified
ORDER BY rating DESC, review_count DESC;

SELECT product_id, url FROM products
WHERE specs_json IS NULL
  AND rating >= ?    -- Step 3 filter criteria
  AND price <= ?
ORDER BY price DESC, rating DESC;

agent-browser --session batch-1 open {url}
agent-browser --session batch-1 eval "..."

task(
  category="unspecified-low",
  load_skills=["agent-browser"],
  run_in_background=true,
  description="Extract product details batch N/M",
  prompt="
    TASK: Visit product pages and extract detailed specs. Data extraction only — do NOT make pass/fail judgments.

    BROWSER SESSION: Use --session batch-{N} for ALL agent-browser commands to avoid conflicts with other agents.
    Example: agent-browser --session batch-{N} open {url}

    DB: {db_path}
    SCRIPT: python3 {path/to/product_db.py}

    PRODUCTS (batch N of M):
    1. {product_id} | {url} | {name}
    2. ...

    SPECS TO EXTRACT (based on user criteria):
    - {spec 1, e.g., table weight in kg}
    - {spec 2, e.g., chair weight in kg}
    - {spec 3, e.g., material type}
    - ...

    FOR EACH PRODUCT:
    1. agent-browser --session batch-{N} open {url}
    2. Scroll to load dynamic content (reviews, spec tables often lazy-load)
    3. Extract full page text → write to a temp file, then store:
       python3 {script} update-detail --db {db} --product-id {id} --page-text-file /tmp/{id}_page.txt
    4. Extract ALL specs listed above from the structured spec table (not marketing copy)
    5. Store specs in specs_json:
       python3 {script} update-detail --db {db} --product-id {id} --specs '{...}'

    WHEN DONE: Close your browser session:
       agent-browser --session batch-{N} close

    MUST NOT:
    - Set verified=1 (that happens later, after detailed filtering)
    - Skip storing data for any product — even if specs look bad (data is valuable for later criteria changes)
    - Retry more than once if agent-browser fails on a product — skip and move on
    - Use agent-browser WITHOUT --session batch-{N} (will conflict with other agents)

    RETURN: For each product: product_id, extracted specs summary, any specs that were NOT found on the page.
  "
)

SELECT p.*, pp.page_text FROM products p
LEFT JOIN product_pages pp ON p.product_id = pp.product_id AND p.platform = pp.platform
WHERE (json_extract(p.specs_json, '$.table_weight_kg') >= 12
       OR json_extract(p.specs_json, '$.table_weight_kg') IS NULL)
  AND (json_extract(p.specs_json, '$.chair_weight_kg') >= 7
       OR json_extract(p.specs_json, '$.chair_weight_kg') IS NULL)
  AND (json_extract(p.specs_json, '$.material') NOT LIKE '%cedar%'
       OR json_extract(p.specs_json, '$.material') IS NULL)
ORDER BY p.rating DESC, p.review_count DESC;

Criterion Type	Examples	How to Verify
Numeric spec	weight ≥ 7kg, price ≤ ¥10k	Read the exact value from a structured spec table on the product page. If only a related value exists (e.g., total weight but not per-component), treat the specific value as missing → cross-verify (4b).
Review quality	"Japanese reviews exist", "≥10 reviews"	Open the review section of the product page (scroll/click to load it). Count reviews matching the requirement. A star rating or review count alone does NOT satisfy a review-language requirement.
Visual/behavioral	"fully flat recline", "fits 4 people"	Screenshot verification (4c), or find an explicit spec value (e.g., "recline angle: 180°"). Product title claims do not count.
Material/durability	"no cedar", "waterproof"	Read material from the spec table. Assess durability based on the material type identified.
Availability	"Prime eligible", "in stock"	Check the specific section on the product page.

| 重视什么 | 推荐 | 理由 |
|---|---|---|
| コスパ | ①商品名 | 最安で基準クリア |
| 品質 | ③商品名 | 最高評価+レビュー多数 |

Smart Shopper — Product Research & Comparison Skill

Prerequisites

Why This Skill Exists

Smart Shopper — Product Research & Comparison Skill

Prerequisites

Why This Skill Exists

Workflow Overview

Step 1: Clarify Requirements

Step 2: Search

Set Site Language First

Step 3: Cache & Filter

Database Discovery (automatic — don't ask the user unless ambiguous)

Creating a New DB

Within a Conversation

Self-Cleanup

Immediate Filtering

Step 4: Detailed Verification

Page Extraction Strategy

Default: Sequential Extraction with Smart Ordering (Recommended)

Alternative: Parallel Sub-Agent Extraction

4a: Detailed Filtering (second-round filter)

4b: Cross-Verification (only for finalists with missing/ambiguous specs)

4c: Screenshot Verification

4d: Criterion Verification Checklist

Step 5: Present Results

Step 6: Iterate

Important Principles

Honesty Over Quantity

Don't Trust Text — Verify Claims

Direct Observation, Not Derivation

Show Your Work on Filtering

Platform-Specific Tips

Error Recovery

CAPTCHA / Anti-Bot Detection

Search Returns 0 Results

Database Errors

agent-browser Failures

Ordercli

Deployment Patterns

Junta Leiloeiros

Persona Sales Ops

Eliza App Development

Ordercli