Name: Cost Estimate
Author: selcukyucel

スキルを検索.../

Cost Estimate | Skills Pool

Pipeline components:
  1. <component name> — <model> — <purpose> (e.g., "Classification — Claude Haiku — categorize ticket")
  2. <component name> — <model> — <purpose> (e.g., "Extraction — Claude Sonnet — extract fields")
  3. <component name> — <model/API> — <purpose> (e.g., "Embedding — text-embedding-3-small — embed query")

Token Breakdown:
  System prompt:        ~[N] tokens (fixed, cacheable)
  User input:           ~[N] tokens average (variable)
  RAG context:          ~[N] tokens ([K] chunks × [N] tokens/chunk)
  Tool definitions:     ~[N] tokens (fixed, cacheable)
  Conversation history: ~[N] tokens average (variable, multi-turn only)
  ─────────────────────────────
  Total input:          ~[N] tokens/request
  Expected output:      ~[N] tokens/request

Model	Input $/1M	Output $/1M	Cache $/1M	Input Cost	Output Cost	Cache Savings	Total/Request
Claude Opus 4.6	$15.00	$75.00	$1.88	$[N]	$[N]	-$[N]	$[N]
Claude Sonnet 4.6	$3.00	$15.00	$0.38	$[N]	$[N]	-$[N]	$[N]
Claude Haiku 4.5	$0.80	$4.00	$0.08	$[N]	$[N]	-$[N]	$[N]
GPT-4o	$2.50	$10.00	—	$[N]	$[N]	—	$[N]
GPT-4o-mini	$0.15	$0.60	—	$[N]	$[N]	—	$[N]

Cost Delta: <change description>
───────────────────────────────

                    Current         Proposed        Delta
Per request         $<N>            $<N>            +/- $<N> (+/- N%)
Monthly (1x)        $<N>            $<N>            +/- $<N> (+/- N%)
Monthly (10x)       $<N>            $<N>            +/- $<N> (+/- N%)

What changed:
  + Added: <new cost source> (+$<N>/request)
  - Removed: <removed cost source> (-$<N>/request)
  ~ Changed: <modified cost source> ($<old> → $<new>)

## Cost Estimate: <automation description>

**Date:** <date>
**Model:** <recommended model>
**Volume assumption:** <requests/month>

### Token Breakdown

| Source | Tokens | Type | Cacheable |
|--------|--------|------|-----------|
| System prompt | [N] | fixed | yes |
| User input | [N] avg | variable | no |
| RAG context | [N] | variable | no |
| Tool definitions | [N] | fixed | yes |
| Output | [N] | variable | no |
| **Total** | **[N] in / [N] out** | | |

### Cost Per Request

| Model | $/request | Quality | Recommendation |
|-------|----------|---------|----------------|
| [model] | $[N] | [quality note] | [recommended / alternative / too expensive] |

### Monthly Projection

| Scale | Requests/mo | Cost/mo | With Caching |
|-------|------------|---------|-------------|
| 1x | [N] | $[N] | $[N] |
| 10x | [N] | $[N] | $[N] |
| 100x | [N] | $[N] | $[N] |

### Optimization Opportunities

1. [optimization]: saves ~[N]% ($[N]/mo at current volume)

### Recommendation

[Which model to use, whether the cost is within budget, what optimizations to apply]

Cost Estimate

Cost Estimate — Token Cost Projection

Purpose

Input

Workflow

Step 1: Identify Token Sources

Cost Estimate

Cost Estimate — Token Cost Projection

Purpose

Input

Workflow

Step 1: Identify Token Sources

Step 2: Estimate Token Counts

Step 3: Calculate Costs Per Model

Step 4: Project Monthly Costs

Step 4b: Calculate Cost Delta (for changes to existing pipelines)

Step 5: Evaluate Optimization Opportunities

Step 6: Present Output

Step 7: Persist (optional)

Notes

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research

Scale	Requests/month	Cost/month (no cache)	Cost/month (with cache)
1x (current)	[N]	$[N]	$[N]
10x (growth)	[N]	$[N]	$[N]
100x (scale)	[N]	$[N]	$[N]