Reference patterns for distributed systems (DDIA), scalable architecture (Alex Xu), and pragmatic SRE. Loaded by backend-specialist and devops-engineer when working on system design, scaling, data architecture, or reliability concerns.
Reference knowledge for distributed systems, scalable architecture, and site reliability.
READ COMMITTED does NOT prevent Write Skew. Understand what your isolation level actually guarantees.UPDATE val = val + 1SELECT ... FOR UPDATEMonitor these and alert only when human action is needed:
| Signal | What to measure |
|---|---|
| Latency | p99 response time |
| Traffic | QPS / bytes per second |
| Errors | HTTP 5xx rate |
| Saturation | CPU, RAM, I/O — alert BEFORE 100% |
| Metric | What it measures |
|---|---|
| Deployment Frequency | How often you ship to prod |
| Lead Time for Changes | Commit → prod time |
| MTTR | Mean time to restore after failure |
| Change Failure Rate | % of deployments causing incidents |
main daily.