Name: Run Message Routing Analysis
Author: jwrobes

Run Message Routing Analysis | Skills Pool

export ENV=staging
export STAGING_LLM_GATEWAY_ACCESS_ID=your_access_id
export STAGING_LLM_GATEWAY_HMAC_KEY=your_hmac_key

cd /Users/jonathan.wrobel/workspace/data_probe
source .envrc.private
python scripts/run_message_routing.py --name "my_run_name" --top-n 4

cd /Users/jonathan.wrobel/workspace/data_probe
source .envrc.private

# Step 1: Classify all messages as GLP-1 yes/no/possibly
python scripts/run_message_routing.py --mode classify-glp1 --name "glp1_v1"

# Step 2: Discover sub-issue types from "yes" messages (samples 300)
python scripts/run_message_routing.py --mode discover-glp1 --from-run "glp1_v1" --name "glp1_subs"

# Test run (limit to 100 messages)
python scripts/run_message_routing.py --mode classify-glp1 --name "test" --limit 100

Flag	Purpose	Example
`--mode`	`discover`, `classify-glp1`, or `discover-glp1`	`--mode classify-glp1`
`--name`	Label the run (output subfolder name)	`--name "glp1_v1"`
`--top-n N`	Analyze only top N categories by priority	`--top-n 4`
`--categories "Cat1" "Cat2"`	Analyze specific categories	`--categories "Scale - Coach Escalation"`
`--batch-size N`	Messages per LLM call (default: 25)	`--batch-size 15`
`--limit N`	Limit total messages processed (test runs)	`--limit 100`
`--from-run NAME`	For discover-glp1: source classify-glp1 run	`--from-run "glp1_v1"`
`--sample N`	For discover-glp1: sample size (default: 300)	`--sample 300`
`--skip-validation`	Skip heuristic keyword testing (discover mode only)	`--skip-validation`
`--config PATH`	Use alternate config file	`--config configs/custom.yaml`

output/message_routing/<run_name>/
├── sub_pattern_report.md        # SME-facing (safe to read)
├── string_matching_report.md    # Engineering-facing (safe to read)
├── classifications.csv          # Per-message detail (PHI — DO NOT READ)
├── discoveries.json             # Raw LLM discovery data (safe to read)
└── validation.json              # Keyword precision/recall data (safe to read)

output/message_routing/<run_name>/
├── glp1_classifications.csv     # Per-message with message text (PHI — DO NOT READ)
├── glp1_summary.json            # Aggregate counts and percentages (safe to read)
├── glp1_raw_batches.json        # Raw LLM batch results (safe to read)
└── checkpoints/glp1_classification/  # Per-batch checkpoints for resume

output/message_routing/<run_name>/
├── glp1_discovery_classifications.csv  # Per-message with sub-category (PHI — DO NOT READ)
├── glp1_discovery_batches.json         # Raw LLM discovery results (safe to read)
└── checkpoints/glp1_discovery/         # Per-batch checkpoints for resume

data_probe/analyses/message_routing/
├── __init__.py
├── formatters.py          # MessageRoutingFormatter — format_items() + format_items_with_category()
├── prompts.py             # Discovery, merge, and GLP-1 classification prompts
├── domain_context.py      # GLP-1 program context + signal phrases loader
├── glp1_signal_phrases.txt # Signal phrases for GLP-1 detection (one per line)
├── heuristic_validator.py # Keyword precision/recall testing
└── report_generator.py    # Reports: SME, string matching, classifications, GLP-1 summary

Issue	Fix
`STAGING_LLM_GATEWAY_ACCESS_ID` missing	Create/source `.envrc.private`
LLM timeout on large categories	Reduce `--batch-size` or split with `--categories`
Output folder already exists	Contents are overwritten; use a new `--name`
100% precision/recall everywhere	Dataset too small — sub-patterns need overlap to test false positives

Column	Role
`member_message_id`	Row identifier
`ticket_subject`	Category grouping key
`ticket_category`	`support` or `safety`
`member_message_text`	Primary text for LLM
`coach_note_text`	Secondary text (coach's context)

Run Message Routing Analysis

Message Routing Sub-Pattern Discovery

PHI Safety Rules

What This Analysis Does

Run Message Routing Analysis

Message Routing Sub-Pattern Discovery

PHI Safety Rules

What This Analysis Does

Prerequisites

LLM Credentials

Input Data

Running the Analysis

Basic run (named)

GLP-1 two-step workflow

Common flags

Updating the config

Output Structure

Discovery mode (`--mode discover`)

GLP-1 classification mode (`--mode classify-glp1`)

GLP-1 sub-issue discovery mode (`--mode discover-glp1`)

Interpreting the SME Report

Interpreting the String Matching Report

Architecture (for code changes)

Troubleshooting

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags

Run Message Routing Analysis

Message Routing Sub-Pattern Discovery

PHI Safety Rules

What This Analysis Does

Run Message Routing Analysis

Message Routing Sub-Pattern Discovery

PHI Safety Rules

What This Analysis Does

Prerequisites

LLM Credentials

Input Data

Running the Analysis

Basic run (named)

GLP-1 two-step workflow

Common flags

Updating the config

Output Structure

Discovery mode (--mode discover)

GLP-1 classification mode (--mode classify-glp1)

GLP-1 sub-issue discovery mode (--mode discover-glp1)

Interpreting the SME Report

Interpreting the String Matching Report

Architecture (for code changes)

Troubleshooting

Session Logs

OpenClaw Test Heap Leaks

Node Connect

Openclaw Qa Testing

Openclaw Secret Scanning Maintainer

Flags

Discovery mode (`--mode discover`)

GLP-1 classification mode (`--mode classify-glp1`)

GLP-1 sub-issue discovery mode (`--mode discover-glp1`)