Smart LLM router that saves 67% on inference costs by routing each request to the cheapest model that can handle it. 55+ models across 9 providers (11 free NVIDIA models), all through one wallet.

Install

openclaw plugins install @blockrun/clawrouter

Setup

# Enable smart routing (auto-picks cheapest model per request)
openclaw models set blockrun/auto

# Or pin a specific model
openclaw models set openai/gpt-4o

How Routing Works

ClawRouter classifies each request into one of four tiers:

SIMPLE (40% of traffic) — factual lookups, greetings, translations → Gemini Flash ($0.60/M, 99% savings)
MEDIUM (30%) — summaries, explanations, data extraction → DeepSeek Chat ($0.42/M, 99% savings)

Smart LLM router that saves 67% on inference costs by routing each request to the cheapest model that can handle it. 55+ models across 9 providers (11 free NVIDIA models), all through one wallet.

Install

openclaw plugins install @blockrun/clawrouter

Setup

# Enable smart routing (auto-picks cheapest model per request)
openclaw models set blockrun/auto

# Or pin a specific model
openclaw models set openai/gpt-4o

How Routing Works

ClawRouter classifies each request into one of four tiers:

SIMPLE (40% of traffic) — factual lookups, greetings, translations → Gemini Flash ($0.60/M, 99% savings)
MEDIUM (30%) — summaries, explanations, data extraction → DeepSeek Chat ($0.42/M, 99% savings)

Clawrouter

Install

Setup

How Routing Works

Clawrouter

Install

Setup

How Routing Works

Available Models

Example Output

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns