Name: Implementing Caching
Author: lushly-dev

Implementing Caching

Guides caching strategy selection and implementation across the full stack including HTTP caching, application-level caching (Redis, in-memory), frontend data caching (SWR, TanStack Query), LLM response caching (prompt caching, semantic caching), database query caching, cache invalidation patterns, and distributed cache architectures. Covers cache-aside, read-through, write-through, write-behind patterns, eviction policies (LRU/LFU), and agentic workflow caching considerations. Use when adding caching to an application, choosing a caching strategy, debugging stale data, optimizing API response times, reducing LLM costs, or designing distributed cache topologies. Triggers: cache, caching, Redis, CDN, TTL, cache invalidation, stale data, Cache-Control, ETag, SWR, TanStack Query, prompt caching, semantic cache, LRU, write-through, cache-aside, materialized view.

lushly-dev1 estrellas3 abr 2026

Ocupación
Categorías: Bases de Datos NoSQL

Expert guidance for caching strategy selection, implementation, and invalidation across HTTP, application, frontend, database, and LLM layers.

Capabilities

HTTP Caching -- Cache-Control headers, ETags, stale-while-revalidate, CDN edge caching, and s-maxage configuration
Application-Level Caching -- Redis patterns, in-memory caching, two-tier cache architectures, eviction policies (LRU/LFU), and connection pooling
Caching Patterns -- Cache-aside (lazy loading), read-through, write-through, write-behind (write-back), and refresh-ahead strategies
Frontend Data Caching -- SWR, TanStack Query, service workers, browser storage APIs, and optimistic updates
LLM Response Caching -- Anthropic prompt caching, semantic caching with embeddings, agentic plan caching, and workflow-level caching
Database Query Caching -- Materialized views, query result caching, connection pooling, and read replicas
Cache Invalidation -- TTL-based, event-driven, tag-based, versioned keys, and hybrid invalidation strategies
Distributed Caching -- Cache topology, replication, partitioning, consistency models, and failure handling

Implementing Caching

lushly-dev1 estrellas3 abr 2026

Ocupación
Categorías: Bases de Datos NoSQL

Capabilities

HTTP Caching -- Cache-Control headers, ETags, stale-while-revalidate, CDN edge caching, and s-maxage configuration

Application-Level Caching -- Redis patterns, in-memory caching, two-tier cache architectures, eviction policies (LRU/LFU), and connection pooling

Caching Patterns -- Cache-aside (lazy loading), read-through, write-through, write-behind (write-back), and refresh-ahead strategies

Frontend Data Caching -- SWR, TanStack Query, service workers, browser storage APIs, and optimistic updates

LLM Response Caching -- Anthropic prompt caching, semantic caching with embeddings, agentic plan caching, and workflow-level caching

Database Query Caching -- Materialized views, query result caching, connection pooling, and read replicas

Cache Invalidation -- TTL-based, event-driven, tag-based, versioned keys, and hybrid invalidation strategies

Distributed Caching -- Cache topology, replication, partitioning, consistency models, and failure handling

Request type	Load reference
HTTP headers, CDN, Cache-Control, ETags, browser caching	references/http-and-cdn-caching.md
Redis, in-memory, LRU/LFU, eviction, two-tier cache	references/application-level-caching.md
SWR, TanStack Query, service workers, browser storage	references/frontend-data-caching.md
Prompt caching, semantic caching, LLM cost reduction, agentic caching	references/llm-and-agentic-caching.md
Materialized views, query caching, database read optimization	references/database-query-caching.md
TTL, event-driven, tag-based, versioned invalidation strategies	references/cache-invalidation.md

Pattern	How it works	Best for
Cache-aside	App checks cache, on miss reads DB, writes to cache	General purpose; simple, widely understood
Read-through	Cache itself loads from DB on miss	Frameworks with cache-provider abstraction
Write-through	Writes go to cache and DB synchronously	Strong consistency requirements
Write-behind	Writes go to cache, async flush to DB	Write-heavy workloads; eventual consistency OK
Refresh-ahead	Cache proactively refreshes before expiry	Predictable access patterns; low-latency reads

Policy	Evicts	Best when
LRU (Least Recently Used)	Oldest-accessed entries	Recency predicts future access
LFU (Least Frequently Used)	Least-accessed entries	Popularity predicts future access (skewed workloads)
TTL (Time-To-Live)	Expired entries	Data has a known freshness window
Random	Random entries	Uniform access distribution

Data type	Suggested TTL	Rationale
Static assets (versioned)	1 year (immutable)	Filename changes on content change
User profile	5-15 minutes	Moderate change frequency
Product catalog	1-5 minutes	Balances freshness and performance
Session data	Match session timeout	Must not outlive session
API rate limit counters	Window duration	Must be exact
LLM prompt cache	5 min (Anthropic default)	Cost vs. freshness tradeoff
Search results	30-60 seconds	High change frequency
Configuration/feature flags	30-60 seconds	Must propagate quickly

Implementing Caching

Capabilities

Implementing Caching

Capabilities

Routing Logic

Core Principles

1. Cache Strategically, Not Universally

2. Choose the Right Caching Pattern

3. Choose the Right Eviction Policy

4. Layer Your Caches

5. Invalidation is Harder Than Caching

6. Agentic Workflow Considerations

Workflow

Quick Reference

Cache-Aside Pattern (Most Common)

HTTP Cache-Control Cheat Sheet

TTL Guidelines

Cache Key Design

Vector Index Tuning

Azure Resource Manager Redis Dotnet

Redis Expert

Elasticsearch

Cache Expert

Abp Mongodb