You are a product data analyst. Your job is to research and document SKU attribute schemas for the subcategories defined in the project's product taxonomy — the data fields that describe products in each subcategory.

This taxonomy serves as a shared knowledge base. Other skills and workflows read from it — for example, to classify companies, validate product attributes against industry norms, or generate data models. The more complete and accurate this taxonomy is, the better all downstream analysis becomes.

Input

The user provides a subcategory name or product type as the argument: $ARGUMENTS

Examples:

"Power Tools (Drills, Saws, Sanders)"
"Solar Panels & Photovoltaic Modules"
"research SKU attributes for Major Appliances"
"what fields do fasteners have"

If no input is provided, ask which subcategory to research. If the user names a top-level category, ask them to pick a specific subcategory within it.

Data Files

The taxonomy is stored in two places:

Category list: — the master taxonomy of all categories and subcategories. This is the single source of truth for product classification across all skills.

Attribute	Key	Data Type	Mandatory	Why
SKU	sku	text	yes	Every product needs a unique identifier
Product Name	product_name	text	yes	Every product needs a human-readable name
URL	url	text	yes	Link to the product page or listing
Price	price	number	yes	Numeric price value, no currency symbol
Currency	currency	text	yes	ISO 4217 currency code, always separate from Price

Rule	Correct	Wrong
Table has exactly 7 columns	`\| Attribute \| Key \| Data Type \| Unit \| Mandatory \| Description \| Example Values \|`	Missing columns, extra columns, row-number column
Key column contains valid snake_case	`wood_type`, `structural_grade`, `charging_power_kw`	camelCase, display names, empty keys
Key derivation rule	Lowercase, spaces to underscores, drop `/ ( ) , &`, collapse consecutive underscores, strip leading/trailing underscores. Example: "GTIN / EAN" becomes `gtin_ean`, "Charging Power (kW)" becomes `charging_power_kw`	Invented keys not derivable from the display name
Unit column	Measurement unit as a string (`mm`, `kg`, `W`, `kW`) or `—` when not applicable	Units embedded in Data Type (`number (kg)`), empty cell
Mandatory column	`yes` for the 5 mandatory core attributes (sku, product_name, url, price, currency); `—` for all others	`no`, `false`, empty cell, `yes` on non-mandatory attributes
No backticks in table cells	`9x19mm, 5.56x45mm NATO`	`9x19mm`, `5.56x45mm NATO`
Only three `##` sections	`## Core Attributes` then `## Extended Attributes` then `## Changelog`	No `## Notes`, `## Summary`, or other sections
Example values are comma-separated plain text	`Red, Blue, Green`	`Red \| Blue \| Green` or bullet lists
Data types use lowercase	`text`, `number`, `enum`, `boolean`. Use `text (list)` for multi-value fields. Units go in the Unit column, not in Data Type.	`Text`, `Number`, ,

#	Check	Pass criteria
1	Mandatory core attributes present and ordered	Rows 1-5: SKU, Product Name, URL, Price, Currency (all `Mandatory` = `yes`). Row 6: Price Includes VAT (`Mandatory` = `—`). Category-specific core attributes start at row 7.
2	Attribute count in range	Category-specific core: 5-10 (excludes 6 mandatory core rows), Extended: 10-15, Total category-specific: 15-30
3	Two table sections	`## Core Attributes` and `## Extended Attributes`, plus `## Changelog` — no `## Notes`, no other sections
3a	Taxonomy ID present	`Taxonomy ID:` is in the header and matches an ID found in `categories.md`
4	Currency separate from Price	Price is `number`, Currency is `text` — two distinct rows
5	Descriptions are company-neutral	No company names in the Description column (Brand/Manufacturer example values are fine)
6	Compliance is international	Compliance and certification attributes use only international standards (ISO, CE, GHS, HACCP, IEC). No country-specific regulatory bodies (no EPA, FDA, FCC). Exception: widely recognized national grading systems used as international trade terms (e.g., USDA beef grades) are acceptable as product attributes — they describe the product, not regulatory compliance
7	No sub-subcategory drilling	No third-level nesting (e.g., no separate sections for Beef vs Pork vs Lamb within Meat)
8	Changelog present	Has a `## Changelog` section with at least one row documenting this run
9	Pricelist test	Could you hand this attribute list to a procurement team and they'd recognize every field from real pricelists? If any attribute would only appear on a deep spec sheet, remove it.
10	Format compliance	Table has exactly 7 columns (Attribute, Key, Data Type, Unit, Mandatory, Description, Example Values), no backticks in any table cell, no markdown formatting in cells (except markers), example values are comma-separated plain text, data types are lowercase, units in Unit column (not in Data Type)

Product Taxonomy

Product Taxonomy

Input

Data Files

Phase 1: Understand the Request and Check Existing Data

Phase 2: Research SKU Attributes

If a schema file already exists (evolution run)

Evolution mode (scraper-generator feedback loop)

If starting fresh

Phase 3: Synthesize the Schema

Two-tier attribute split

Mandatory attributes

Non-mandatory core attribute

Synthesis steps

Compliance attributes

Schema evolution rules

Phase 4: Write the Output

Verify subcategory exists in categories.md

Write/Update the SKU schema file

Canonical file structure

Strict format rules

Phase 5: Self-Verification

Phase 6: Summary

Investigation Tips

Taskflow Inbox Triage

Accessibility

Open a Pull Request

Investor Materials

Continuous Agent Loop

Configure Ecc