Estimate and compare LLM API costs across providers (OpenAI, Anthropic Claude, Google Gemini, DeepSeek, Mistral, Cohere). Includes per-token pricing, batch/caching discounts, workload templates for common scenarios (chatbot, RAG, summarization, code review), and a cross-provider comparison matrix with per-use-case recommendations. Use this skill when the user asks about API token costs, wants to compare model pricing, needs a monthly cost estimate, or asks which model is cheapest for their use case.
A reference skill that gives Claude accurate, up-to-date LLM API pricing data and calculation templates to estimate costs and compare providers instantly.
Pricing data is updated monthly. AI providers change prices frequently. For critical cost decisions, verify against the official pricing page linked at the top of each reference file.
Trigger on questions like:
Extract parameters from the user's question:
If the user specified a model: Read the matching references/pricing-*.md file and references/calculation-templates.md. Calculate the cost, showing each step.
If the user wants a comparison or didn't specify a model: Read references/comparison-matrix.md to find 2-3 options at different price/quality tiers. Then read the relevant pricing files for exact numbers.
Always show the math so the user can verify:
Input: 2,000 tokens × 1,000 requests/day = 2M tokens/day
Cost: 2M / 1M × $3.00 = $6.00/day input
Output: 500 tokens × 1,000 = 500K tokens/day
Cost: 0.5M / 1M × $15.00 = $7.50/day output
Total: $13.50/day → $405/month
Mention applicable optimizations:
For workload estimates, use the templates in references/calculation-templates.md as starting points.
Always note the "Last verified" date from the pricing file and suggest verifying on the official pricing page for critical decisions.
references/pricing-openai.md — OpenAI models: GPT-5.4, GPT-4.1, o-series, embeddings, audio, imagesreferences/pricing-anthropic.md — Claude models: Opus, Sonnet, Haiku + prompt caching + extended thinkingreferences/pricing-google.md — Gemini models: Pro, Flash, Flash-Lite + free tier detailsreferences/pricing-others.md — DeepSeek, Mistral, Cohere, xAI (Grok)references/comparison-matrix.md — Cross-provider comparison by tier and use casereferences/calculation-templates.md — Formulas, token estimation rules, workload templates