Name: Stingy Route
Author: capitalthought

搜索技能.../

Stingy Route | Skills Pool

Task	Best option	Why
Simple questions, lookups	Gemini Flash or GPT-4o-mini	Near-free, fast, good enough
Summarizing a doc or article	Gemini Flash (1M context)	Handles huge inputs cheaply
Quick code formatting	Local model (Ollama/LM Studio)	Zero cost, instant
Grep/search codebase	Don't use AI at all	`rg`, `grep`, `find` are free and instant
Reading docs	Don't use AI at all	Just read the docs yourself
Running tests/builds	Don't use AI at all	Just run the command

Task	Best option	Cost estimate	Why
Standard code generation	Claude Sonnet or GPT-4o	~$0.10-0.50	Both excellent at code, 5x cheaper than Opus
Bug fixes with context	Claude Sonnet	~$0.20-0.80	Good tool use, understands codebases
Writing tests	Claude Sonnet	~$0.10-0.30	Mechanical task, doesn't need Opus
Code review (non-security)	Claude Sonnet	~$0.20-0.50	Catches most issues
Documentation	GPT-4o or Claude Sonnet	~$0.10-0.30	Either works well
Data transformation	Gemini Pro	~$0.10-0.40	Great at structured data
Explaining code	GPT-4o-mini or Haiku	~$0.02-0.10	Simple comprehension task

Task	Best option	Cost estimate	Why
Complex architecture decisions	Claude Opus	~$2-8	Best reasoning, worth the cost
Security review	Claude Opus	~$2-5	Accuracy critical, can't miss vulnerabilities
Multi-file refactors	Claude Opus or Sonnet	~$1-5	Needs to hold large context coherently
Debugging subtle race conditions	Claude Opus	~$2-8	Needs deep reasoning
Novel algorithm design	Claude Opus or o3	~$3-10	Frontier reasoning required

Platform	Best at	Worst at	Pricing model
Claude Code (Anthropic)	Tool use, code, long context, agents	Simple Q&A (overkill)	Subscription ($20-200/mo) or API
ChatGPT (OpenAI)	General knowledge, DALL-E, browsing, plugins	Complex tool orchestration	$20/mo Pro or API
Gemini (Google)	Huge context (1M+), Google integration, multimodal	Tool use, agentic workflows	Free tier generous, API cheap
Grok (xAI)	Real-time info (X/Twitter), fast, uncensored	Code quality, tool use	$8/mo Premium or API
Local models (Ollama)	Privacy, zero cost, offline	Quality ceiling, no tool use	Free (your hardware)

TASK: [user's task]

RECOMMENDED: [Model/Platform]
COST: ~$X.XX (estimated)
WHY: [one sentence]

ALTERNATIVES:
  Cheaper: [option] — [tradeoff]
  Better:  [option] — [cost difference and what you gain]

AVOID: [what NOT to use and why]

💡 TIP: [one actionable tip to reduce cost further]

Model	Input	Output	Notes
Claude Opus 4	$15	$75	Best quality, most expensive
Claude Sonnet 4	$3	$15	Best value for code
Claude Haiku 3.5	$0.80	$4	Great for simple tasks
GPT-4o	$2.50	$10	Strong general purpose
GPT-4o-mini	$0.15	$0.60	Extremely cheap, good quality
Gemini 2.5 Pro	$1.25	$10	Huge context window
Gemini 2.5 Flash	$0.15	$0.60	Cheapest capable model
Grok 3	$3	$15	Fast, real-time knowledge
o3	$10	$40	Best reasoning, very expensive
o4-mini	$1.10	$4.40	Good reasoning, cheaper

Stingy Route

/route — Smart Task Router

Step 1: Understand the Task

Step 2: Classify the Task

Stingy Route

/route — Smart Task Router

Step 1: Understand the Task

Step 2: Classify the Task

Step 3: Route to Best Option

Tier 1: Free / Near-Free Options

Tier 2: Budget Options ($0.10-$1.00 per task)

Tier 3: Premium Options ($1-$10 per task)

Platform-Specific Strengths

Step 4: Present the Recommendation

Step 5: Claude Code Specific Advice

Pricing Reference (as of April 2026 — verify before making decisions)

Rules

Continuous Learning V2

Continuous Learning V2

Continuous Learning V2

Continuous Learning

Continuous Learning

Pytorch Patterns