On first invocation, read references/orchestrator.md and follow its welcome protocol.

Assess and improve an app's ability to handle growth. Covers load testing, performance budgets, caching strategy, query optimization, CDN config, and capacity planning. Produces a scaling readiness report with a concrete upgrade path from current capacity to target capacity.

/scale — Command Reference

SCALE OPS COMMANDS
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

  ASSESS
  /scale audit          Full scaling readiness assessment
  /scale budget         Set or audit performance budgets
  /scale bottleneck     Identify the #1 scaling bottleneck

  TEST
  /scale loadtest       Run load test against target URL
  /scale benchmark      Benchmark specific endpoints

  OPTIMIZE
  /scale cache          Design or audit caching strategy
  /scale queries        Audit and optimize database queries
  /scale cdn            CDN and edge optimization

  PLAN
  /scale plan           Capacity plan from current → target users
  /scale cost           Cost projection at target scale

  UTILITIES
  /checkpoint           Show checkpoint status

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
  Type any command to begin. /scale to see this again.

/scale — Command Reference

SCALE OPS COMMANDS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ASSESS /scale audit Full scaling readiness assessment /scale budget Set or audit performance budgets /scale bottleneck Identify the #1 scaling bottleneck TEST /scale loadtest Run load test against target URL /scale benchmark Benchmark specific endpoints OPTIMIZE /scale cache Design or audit caching strategy /scale queries Audit and optimize database queries /scale cdn CDN and edge optimization PLAN /scale plan Capacity plan from current → target users /scale cost Cost projection at target scale UTILITIES /checkpoint Show checkpoint status ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Type any command to begin. /scale to see this again.

Layer	Checks	Tools
Database	Query complexity, missing indexes, N+1 patterns, connection pooling, row counts	EXPLAIN ANALYZE, schema review
API	Response times, payload sizes, pagination, rate limiting, caching headers	curl timing, endpoint inventory
Frontend	Bundle size, code splitting, lazy loading, image optimization, core web vitals	Lighthouse, bundle analysis
Infrastructure	Worker limits, D1 limits, KV limits, edge caching, CDN config	CF dashboard / API
Architecture	Stateless design, horizontal scalability, single points of failure	Code review

Service	Free Tier	Paid Tier	Hard Ceiling
Workers requests	100K/day	10M/mo ($5)	Unlimited (pay per use)
Workers CPU time	10ms/invocation	30s/invocation	30s
D1 reads	5M/day	25B/mo	Bound by SQLite limits
D1 writes	100K/day	50M/mo	1 writer at a time
D1 database size	500MB	10GB	10GB per DB
KV reads	100K/day	10M/mo	1000 reads/sec per namespace
KV writes	1K/day	1M/mo	1 write/sec per key
R2 storage	10GB	Pay per use	Unlimited
Pages deployments	500/mo	5000/mo	Per project

Score	Meaning	Can Handle
A	Production-ready at scale	10K+ concurrent users
B	Ready for moderate traffic	1K-10K concurrent users
C	Works for small teams	100-1K concurrent users
D	Works for personal use	1-100 concurrent users (aidops-scale)
F	Has scaling blockers	Will break under real load

Metric	Budget	Measurement
Time to First Byte (TTFB)	< 200ms	Server response time
First Contentful Paint (FCP)	< 1.5s	Browser paint
Largest Contentful Paint (LCP)	< 2.5s	Largest visible element
Cumulative Layout Shift (CLS)	< 0.1	Visual stability
Interaction to Next Paint (INP)	< 200ms	Input responsiveness
API response time (p50)	< 100ms	Median endpoint latency
API response time (p95)	< 500ms	Tail latency
JS bundle size (gzipped)	< 150KB	Initial load
Total page weight	< 500KB	All resources
Database query time (p95)	< 50ms	Slowest queries

Data Type	Cache?	Where	TTL	Invalidation
Static assets (JS, CSS, images)	Yes	CDN	1 year (immutable hash)	Deploy new hash
User-specific data	Maybe	KV	1-5 min	On write (purge key)
Public list data	Yes	KV	5-15 min	On write or TTL
Auth tokens	Yes	KV	Token expiry - 60s	On refresh
Computed aggregations	Yes	KV	5-60 min	On underlying data change
Real-time data	No	—	—	—

Scale Ops

/scale — Command Reference

Scale Ops

/scale — Command Reference

Scaling Readiness Assessment (`/scale audit`)

What Gets Checked

Cloudflare-Specific Limits

Readiness Score

Readiness Report

Performance Budgets (`/scale budget`)

Default Budgets (adjust per project)

Budget Enforcement

Budget Monitoring

Load Testing (`/scale loadtest`)

Using oha (Rust-based HTTP load tester)

Using hey (Go-based, simpler)

Using curl for quick benchmarks

Load Test Report

Caching Strategy (`/scale cache`)

Cache Layers for Cloudflare Stack

What to Cache (Decision Framework)

KV Caching Pattern

Cache Invalidation

Query Optimization (`/scale queries`)

Find Slow Queries

Common Query Optimizations

EXPLAIN for D1

Capacity Planning (`/scale plan`)

Scaling Path Template

Checkpoint Integration

Checkpoint Location

When to Write

Cross-Skill Reads

Principles

Things Mac

Trello

Production Scheduling

Jira Integration

Production Scheduling

Cost Aware Llm Pipeline

Problem	Detection	Fix
Missing index	Query on column without index, table > 1K rows	`CREATE INDEX idx_table_col ON table(col)`
N+1 queries	Loop with DB call inside (fetch list, then fetch detail per item)	Use JOIN or batch query
SELECT *	Fetching all columns when only 2-3 needed	List specific columns
Unbounded query	No LIMIT on list queries	Add pagination (LIMIT + OFFSET or cursor)
Repeated queries	Same query called multiple times per request	Cache result in request context
Missing compound index	WHERE on multiple columns, each indexed separately	`CREATE INDEX idx_tbl_a_b ON tbl(a, b)`

Event	What to Save
Readiness audit run	Scores per layer, bottleneck list, overall grade
Performance budgets set	Budget values, enforcement config
Load test completed	Results, endpoint, concurrency, pass/fail
Caching strategy designed	Cache layers, TTLs, invalidation plan
Query optimization done	Queries optimized, before/after times
Capacity plan produced	Current tier, target tier, upgrade path

Reads from	Why
stack-forge	Stack architecture → infrastructure limits
code-auditor	Performance findings → pre-identified bottlenecks
deploy-ops	Current deployment config → infrastructure baseline
integrations-engineer	API rate limits → external constraints

Read by	Why
deploy-ops	Readiness score → deploy confidence at scale
code-auditor	Performance budgets → `/audit perf` thresholds
app-architect	Cost projections → spec cost estimates

Scale Ops

/scale — Command Reference

Scale Ops

/scale — Command Reference

Scaling Readiness Assessment (/scale audit)

What Gets Checked

Cloudflare-Specific Limits

Readiness Score

Readiness Report

Performance Budgets (/scale budget)

Default Budgets (adjust per project)

Budget Enforcement

Budget Monitoring

Load Testing (/scale loadtest)

Using oha (Rust-based HTTP load tester)

Using hey (Go-based, simpler)

Using curl for quick benchmarks

Load Test Report

Caching Strategy (/scale cache)

Cache Layers for Cloudflare Stack

What to Cache (Decision Framework)

KV Caching Pattern

Cache Invalidation

Query Optimization (/scale queries)

Find Slow Queries

Common Query Optimizations

EXPLAIN for D1

Capacity Planning (/scale plan)

Scaling Path Template

Checkpoint Integration

Checkpoint Location

When to Write

Cross-Skill Reads

Principles

Things Mac

Trello

Production Scheduling

Jira Integration

Production Scheduling

Cost Aware Llm Pipeline

Scaling Readiness Assessment (`/scale audit`)

Performance Budgets (`/scale budget`)

Load Testing (`/scale loadtest`)

Caching Strategy (`/scale cache`)

Query Optimization (`/scale queries`)

Capacity Planning (`/scale plan`)