Name: Cost Tracking
Author: JNZader

AI token usage and cost tracking for per-session monitoring, budget alerts, and optimization.

1. Core Principle

AI API costs are invisible by default. Without tracking: costs spiral, cache opportunities are missed, expensive models do cheap work, and no visibility into which projects consume the most budget.

Goal: Maximize value per dollar, not minimize cost.

2. Cost Optimization Strategies

Prompt Caching

Cache reads are 90% cheaper on Anthropic
Keep a stable system prompt (gets cached automatically)
Put frequently-referenced context at the start (prefix-based caching)
Breakeven: prompt reused 2+ times

Context Pruning

Use .claudeignore to exclude irrelevant files
Summarize long conversations instead of full history
Before: 50K tokens ($0.15/turn) → After: 15K tokens ($0.045/turn) =

Metric	Formula	Target
Daily spend	Sum cost_usd for today	< $10
Cost per session	Daily / sessions	< $0.50
Cache hit ratio	cache_read / (input + cache_read)	> 60%
Budget utilization	Monthly spend / budget	50-80%

Cost Tracking

1. Core Principle

2. Cost Optimization Strategies

Prompt Caching

Context Pruning

Cost Tracking

1. Core Principle

2. Cost Optimization Strategies

Prompt Caching

Context Pruning

Model Tiering

Batch Operations

3. Per-Session Tracking

4. Budget Alerts

5. Framework Integration

6. Anti-Patterns

Quick Reference

Things Mac

Trello

Production Scheduling

Jira Integration

Production Scheduling

Cost Aware Llm Pipeline