Category Inference | Skills Pool

Category Inference | Skills Pool

list_distinct_merchants

category (explicit) > inferred_category (LLM-assigned) > uncategorized

Label	Examples
`groceries`	Supermarkets, grocery delivery, wholesale clubs (Costco, Sam's Club)
`dining`	Restaurants, cafes, bars, food delivery (Uber Eats, DoorDash, Grubhub)
`transport`	Gas stations, ride-share (Uber, Lyft), parking, tolls, public transit, auto services
`subscriptions`	Streaming (Netflix, Spotify, Disney+), SaaS, news, cloud storage
`entertainment`	Movie theaters, concerts, events, gaming, amusement parks
`utilities`	Electricity, gas, water, internet, mobile phone, waste collection
`healthcare`	Pharmacies, doctors, dentists, labs, insurance premiums, gym (when wellness-focused)
`shopping`	Retail (Amazon, Target, department stores), online marketplaces, home goods
`travel`	Hotels, airlines, travel agencies, car rentals, vacation bookings
`education`	Tuition, course platforms (Coursera, Udemy), textbooks, tutoring
`personal`	Haircuts, beauty, spa, laundry, personal care not covered elsewhere
`other`	Anything that does not fit the above — use sparingly; prefer a specific label

result = list_distinct_merchants(min_count=1)

memory_search(
    query="merchant category inference",
    tags=["merchant-category"],
    limit=100,
)

memory_recall(topic="merchant_category:Netflix")

bulk_update_transactions(updates=[
    {
        "match": {"merchant_pattern": "Netflix%"},
        "set": {"inferred_category": "subscriptions"},
    },
    {
        "match": {"merchant_pattern": "Whole Foods%"},
        "set": {"inferred_category": "groceries"},
    },
    # ... one entry per known merchant→category mapping
])

bulk_update_transactions(updates=[
    {
        "match": {"merchant_pattern": "Trader Joe%"},
        "set": {"inferred_category": "groceries"},
    },
    {
        "match": {"merchant_pattern": "UBER%"},
        "set": {"inferred_category": "transport"},
    },
    # ... one entry per merchant
])

Category inference complete.

Distinct merchants processed:  54
  Auto-applied (known mappings):  8  (from memory)
  LLM-categorized:               46

Transactions updated: 892

Breakdown by category:
  groceries      187  (21%)
  dining         162  (18%)
  subscriptions   94  (11%)
  shopping       201  (23%)
  transport       89  (10%)
  utilities       47   (5%)
  healthcare      38   (4%)
  entertainment   31   (3%)
  other           43   (5%)

Processed in 2 batches (1,100 distinct merchants total).

# Resolve or create the merchant entity first (follow memory-classification skill).