Name: Cost vs Quality Tradeoff
Author: latestaiagents

Cost vs Quality Tradeoff

Measure and optimize the cost/quality curve — which model, prompt, and settings give the best quality per dollar. Covers Pareto analysis, break-even thresholds, and when to spend more vs less. Use this skill when optimizing LLM spend, picking a default model for a feature, or deciding whether a premium model is worth it. Activate when: cost vs quality, model selection, eval cost, Pareto frontier, cheaper model, premium model tradeoff.

latestaiagents2 스타2026. 4. 15.

직업
카테고리: 금융 및 투자

Quality without cost context is half a decision. You need the Pareto frontier — for each quality bar, what's the cheapest config that hits it?

When to Use

Choosing a default model for a new feature
Reducing LLM spend on an existing feature
Justifying (or not) an upgrade to a premium model
Trading off prompt complexity, model size, and thinking budget

The Pareto Frontier

Plot each candidate config (model × prompt × settings) on quality (y-axis) vs cost per request (x-axis). The frontier is the set of configs where no other config is both cheaper AND better.

Any config NOT on the frontier is dominated — always strictly worse than another option. Drop it.

  quality
   ↑
 1 |    *A (opus + thinking)
   |    *B (opus)
   |*G *D (sonnet + few-shot)
   |*F *C (sonnet)
 0 |*E (haiku)
   +---------------→ cost

Pareto: A, B, D, C, E. Dominated: F (worse than E at same cost), G (worse than D at same cost).

Quality without cost context is half a decision. You need the Pareto frontier — for each quality bar, what's the cheapest config that hits it?

When to Use

Choosing a default model for a new feature
Reducing LLM spend on an existing feature
Justifying (or not) an upgrade to a premium model
Trading off prompt complexity, model size, and thinking budget

The Pareto Frontier

Plot each candidate config (model × prompt × settings) on quality (y-axis) vs cost per request (x-axis). The frontier is the set of configs where no other config is both cheaper AND better.

Any config NOT on the frontier is dominated — always strictly worse than another option. Drop it.

  quality
   ↑
 1 |    *A (opus + thinking)
   |    *B (opus)
   |*G *D (sonnet + few-shot)
   |*F *C (sonnet)
 0 |*E (haiku)
   +---------------→ cost

Pareto: A, B, D, C, E. Dominated: F (worse than E at same cost), G (worse than D at same cost).

Metric	Example
Input tokens / request	2,500
Output tokens / request	400
$ / request	$0.012
Quality score	0.87
p95 latency	1.8s

Cost vs Quality Tradeoff

When to Use

The Pareto Frontier

Cost vs Quality Tradeoff

When to Use

The Pareto Frontier

Measurement

Common Configs to Compare

Prompt as a Lever

Break-Even Analysis

Tiered Routing

Latency as a Third Axis

Cache-Aware Selection

Sample Budget

Anti-Patterns

Best Practices

Llm Trading Agent Security

Energy Procurement

Council

Carrier Relationship Management

Market Research

Market Research