A two-tier routing pattern for Claude API calls. Default for 80% of calls is Haiku — fast, cheap, and sufficient for structured work. Escalate to Sonnet only when the task needs real reasoning.

When to use each tier

FAST (Haiku) — use by default:

JSON extraction / structured output
Summarization of a paragraph into a sentence
Parameter generation with constraints
Classification (sentiment, category, intent)
Turning freeform text into a known schema
"Which of these is X" single-choice selection

DEEP (Sonnet) — use only when:

The task requires chains of reasoning over multiple facts
The output quality is user-facing and visibly matters (final report, strategy critique)
The decision has downstream cost if wrong (trade signal, architectural choice)
The input requires synthesizing information the model must weigh and trade off

When in doubt, start with FAST and only escalate if the output is visibly worse.

Cost math

A two-tier routing pattern for Claude API calls. Default for 80% of calls is Haiku — fast, cheap, and sufficient for structured work. Escalate to Sonnet only when the task needs real reasoning.

When to use each tier

FAST (Haiku) — use by default:

JSON extraction / structured output
Summarization of a paragraph into a sentence
Parameter generation with constraints
Classification (sentiment, category, intent)
Turning freeform text into a known schema
"Which of these is X" single-choice selection

DEEP (Sonnet) — use only when:

The task requires chains of reasoning over multiple facts
The output quality is user-facing and visibly matters (final report, strategy critique)
The decision has downstream cost if wrong (trade signal, architectural choice)
The input requires synthesizing information the model must weigh and trade off

When in doubt, start with FAST and only escalate if the output is visibly worse.

Model	$/1M input	$/1M output	Relative cost
Haiku 4.5	$0.80	$4.00	1x
Sonnet 4.6	$3.00	$15.00	~3.75x

Tier Router

When to use each tier

Cost math

Tier Router

When to use each tier

Cost math

Install

Usage

Anti-patterns

Xurl

Acp Router

Coding Standards

Api Design

Mcp Server Patterns

Backend Patterns