Name: Ai Subscription Tracker
Author: kivo360

Skills suchen.../

Ai Subscription Tracker | Skills Pool

Scenario	Monthly Cost	Annual Cost	Notes
CURRENT	~$278	~$3,336	Claude Max 20x ($200) + Fireworks + others
After Sonnet 5	~$130-150	~$1,560-1,800	Drop to Sonnet tier, simplify stack
Savings	~$130-150/mo	~$1,500-1,800/yr	When Sonnet 5 drops

accounts/fireworks/routers/kimi-k2p5-turbo

Model	Provider	Speed	Coding	Reasoning	Vision	Cost Level	Best For
Kimi K2.5 Turbo	Fireworks	⚡⚡⚡⚡⚡ (200t/s)	76.8% SWE-Bench	High	✅	Very Low	Orchestration, daily coding
Kimi K2.5	OpenCode Go/Moonshot	⚡⚡⚡ (60t/s)	76.8% SWE-Bench	High	✅	Very Low	Claude alternative
GPT-5.4	OpenAI	⚡⚡⚡ (40t/s)	80%+ SWE-Bench	Very High	✅	Medium	Deep reasoning, architecture
GPT-5.4 Mini	OpenAI	⚡⚡⚡⚡ (100t/s)	75% SWE-Bench	High	❌	Very Low	Quick tasks
Claude Opus 4.6	Anthropic	⚡⚡ (25t/s)	80.8% SWE-Bench	Very High	✅	High	Complex debugging
Claude Sonnet 4.6	Anthropic	⚡⚡⚡ (50t/s)	75% SWE-Bench	High	✅	Medium	General tasks
Gemini 3.1 Pro	Google	⚡⚡⚡ (60t/s)	75% SWE-Bench	High	✅	Low	Frontend, visual tasks
Gemini 3 Flash	Google	⚡⚡⚡⚡ (150t/s)	70% SWE-Bench	Medium	✅	Very Low	Documentation, fast tasks
MiniMax M2.7	MiniMax/OpenCode	⚡⚡⚡⚡ (120t/s)	80.2% SWE-Bench	High	❌	Very Low	Utility, code search
MiniMax M2.7 Highspeed	MiniMax	⚡⚡⚡⚡⚡ (200t/s)	80.2% SWE-Bench	High	❌	Low	Fast utility
GLM 5	Z.ai/OpenCode	⚡⚡⚡ (50t/s)	77.8% SWE-Bench	High	❌	Low	Claude-like orchestration
Grok Code Fast 1	xAI/GitHub	⚡⚡⚡⚡⚡ (250t/s)	70% SWE-Bench	Medium	❌	Very Low	Code grep, search

Model	Input	Output	Cache Read
Kimi K2.5 (regular)	$0.60	$2.00	-
Kimi K2.5 Turbo	$0.99	$4.94	$0.16
GPT-5.4	$2.50	$15.00	-
GPT-5.4 Mini	$0.25	$2.00	-
GPT-5-Nano	$0.05	$0.40	-
Claude Opus 4.6	$5.00	$25.00	$0.50
Claude Sonnet 4.6	$3.00	$15.00	$0.30
Claude Haiku 4.5	$1.00	$5.00	$0.10
Gemini 3.1 Pro	$2.00	$12.00	-
Gemini 3 Flash	$0.10	$0.40	-
MiniMax M2.7	$0.30	$1.20	$0.06
MiniMax M2.7 Highspeed	$0.60	$2.40	$0.06
GLM 5	$1.00	$3.20	$0.11

{
  "model": "fireworks/accounts/fireworks/routers/kimi-k2p5-turbo",
  "baseURL": "https://api.fireworks.ai/inference/v1"
}

Feature	Details
Codename	"Fennec"
Performance	Outperforms Claude Opus 4.5 internally
Price	~50% cheaper than Opus 4.5
Context Window	1 million tokens (vs 200K now)
Killer Feature	"Dev Team" / "Agent Swarm" mode — spawns multiple specialized agents (architect, backend, frontend, QA) that work in parallel
SWE-Bench	80.9% (far surpassing existing models)
Release	Imminent (likely Q2 2026)

## Month: [Month Year]

### Daily Average Usage
- Hours/day: ___
- Requests/day: ___
- Primary model: ___

### Costs This Month
| Provider | Planned | Actual | Notes |
|----------|---------|--------|-------|
| Claude Max 20x | $200 | $___ | Waiting for Sonnet 5 |
| Fireworks Fire Pass | $28 | $___ | Backup orchestrator |
| GitHub Copilot | $10 | $___ | |
| OpenAI | $20 | $___ | API usage: $___ |
| Google Gemini | $20 | $___ | |
| MiniMax (OpenCode Go) | $10 | $___ | If needed |
| **TOTAL** | **~$288** | **$___** | **Current spend** |

### Sonnet 5 Watch
- [ ] Release announced
- [ ] Benchmarks verified
- [ ] Pricing confirmed
- [ ] Agent Swarm tested
- [ ] Decision: Drop Max 20x?

### If Sonnet 5 is Good - New Stack:
| Provider | Expected Cost |
|----------|---------------|
| Claude Sonnet 5 | ~$50-75? |
| Fireworks | $28 |
| Copilot | $10 |
| OpenAI | $20 |
| Gemini | $20 |
| **TOTAL** | **~$128-153** |

### Model Usage Breakdown
| Model | % Usage | Tokens Used | Cost |
|-------|---------|-------------|------|
| Claude Opus 4.6 (Max 20x) | ___% | ___M | $200 |
| Kimi K2.5 Turbo | ___% | Unlimited | $28 |
| GPT-5.4 | ___% | ___M | $___ |
| Gemini Pro | ___% | ___M | $___ |
| MiniMax M2.7 | ___% | ___M | $___ |

### Optimizations Made / Planned
- [x] **CANCELLED Z.ai GLM** — Infrastructure trash, quality degraded
- [x] **CANCELLED MiniMax subscription** — Use OpenCode Go or pay-as-you-go
- [x] **ADDED Fireworks Fire Pass** — $28/mo unlimited Kimi 200t/s
- [ ] **PENDING: Drop Claude Max 20x** — Waiting for Sonnet 5 "Fennec"
  - If Sonnet 5 has Agent Swarm → Could replace OMO orchestration
  - If 1M context works → Could simplify entire stack
  - If ~$50-75/mo → Save $125-150/month
- [ ] **Evaluate Gemini need** — Do you actually use it beyond Google ecosystem?

### Notes
___

Agent	Primary Model	Fallback	Monthly Cost
Sisyphus (Orchestrator)	🔥 Claude Opus 4.6 (Max 20x)	Fireworks Kimi Turbo	$200
Prometheus (Planner)	Claude Opus 4.6	Fireworks Kimi	Included
Hephaestus (Deep work)	OpenAI GPT-5.4	Gemini Pro	$20 + API
Oracle (Consultant)	OpenAI GPT-5.4	Gemini Pro	API usage
Explore/Librarian	Fireworks Kimi / MiniMax	Copilot Grok	$28
Frontend	Gemini 3.1 Pro	Fireworks Kimi	$20

Agent	Primary Model	Why
Everything	🚀 Claude Sonnet 5 "Fennec"	Agent Swarm built-in? 1M context? 50% cheaper?
Backup	Fireworks Kimi Turbo	$28/mo unlimited
IDE	GitHub Copilot Pro	$10
Reasoning	GPT-5.4 (if needed)	$20
Google	Gemini Advanced	$20 (if still needed)

Provider	Billing	Support	Docs
Fireworks	Billing	[email protected]	Docs
GitHub Copilot	Settings	GitHub Support	Docs
OpenAI	Billing	OpenAI Help	API Docs
Google Gemini	Google One	Google Support	Gemini Docs
Anthropic Claude	Account	Anthropic Support	Claude Docs
Z.ai	Z.ai Billing	Z.ai Support	GLM Docs
MiniMax	MiniMax Platform	MiniMax Support	API Docs
OpenCode	OpenCode	Discord/Discord	Docs

Use Case	Primary Model	Provider	Monthly Cost	Why
Orchestration	🔥 Claude Opus 4.6 (Max 20x)	Anthropic	$200	You hit Pro limits, need 220K tokens/5hr
Deep Reasoning	GPT-5.4	OpenAI	$20 + API	Architecture, Oracle, Hephaestus
Frontend/UI	Gemini 3.1 Pro	Google	$20	Google ecosystem user
Backup/Fast	Kimi K2.5 Turbo	Fireworks	$28	Unlimited, 200 tok/s, testing as backup
IDE	Copilot	GitHub	$10	Completions
Utility	MiniMax M2.7	OpenCode Go	$10	If needed

Use Case	Primary Model	Expected Cost
Everything	🚀 Claude Sonnet 5 "Fennec"	~$50-75?
Backup	Fireworks Kimi Turbo	$28
IDE	Copilot Pro	$10
Specialized	GPT-5.4 (if needed)	$20
Google	Gemini Advanced (if still needed)	$20

Type	Price
Input	$0.99/1M tokens
Cached Input	$0.16/1M tokens
Output	$4.94/1M tokens

Ai Subscription Tracker

AI Subscription Tracker & Cost Optimizer

📊 Current Subscriptions

Ai Subscription Tracker

AI Subscription Tracker & Cost Optimizer

📊 Current Subscriptions

💰 Cost Summary (REAL CURRENT vs FUTURE)

🎯 Why Keep Claude Max 20x For Now

🔥 Fireworks AI Fire Pass

Pricing

What's Included

Model ID for Configuration

When to Use

Direct API Pricing (without Fire Pass)

🤖 Model Capabilities Matrix

Cost per 1M Tokens (API)

💵 Subscription Details

1. Fireworks AI Fire Pass ⭐⭐⭐⭐⭐

2. GitHub Copilot Pro ⭐⭐⭐⭐⭐

3. OpenAI ChatGPT Plus ⭐⭐⭐⭐⭐

4. Google Gemini Advanced ⭐⭐⭐⭐ (KEPT)

5. Claude Code Pro ⭐⭐⭐ (REDUCED FROM MAX)

6. Z.ai GLM Coding Plan ⭐ (CANCEL IMMEDIATELY - Infrastructure Trash)

Why Z.ai Infrastructure is Trash:

7. Claude (Anthropic) Max 20x ⭐⭐⭐⭐⭐ (KEEP UNTIL Sonnet 5)

Claude Sonnet 5 "Fennec" Leaks (March 2026)

📈 Usage Tracking Template

Monthly Usage Log

🎯 Current vs Future Configuration

CURRENT STACK (Pre-Sonnet 5)

FUTURE STACK (If Sonnet 5 Delivers)

The Sonnet 5 Bet

🔗 Provider Links

📋 Action Items Checklist (UPDATED - March 28, 2026)

Immediate Actions (This Week)

Short-term (This Month)

The Sonnet 5 Watch (Q2 2026)

Long-term (Post-Sonnet 5)

Long-term (Quarterly)

🎯 Current vs Future Primary Models

CURRENT REALITY (You're Paying $278/month)

IF SONNET 5 DELIVERS (Potential $130-150/month)

💡 Cost-Saving Tips (Your ACTUAL Strategy)

Things Mac

Trello

Production Scheduling

Jira Integration

Production Scheduling

Cost Aware Llm Pipeline