Calm, systematic crisis handling.
Calm, systematic crisis handling.
| Level | Definition | Response |
|---|---|---|
| P1 | Service down, all affected | Immediate |
| P2 | Major feature broken | < 1 hour |
| P3 | Feature degraded | < 4 hours |
| P4 | Minor, workaround exists | < 24 hours |
Detect → Triage → Resolve → Review
| Question | Action |
|---|---|
| Can rollback? | Rollback first, debug later |
| Quick fix available? | Deploy hotfix |
| Need investigation? | Enable debug logging, check recent changes |
| Audience | Content |
|---|---|
| Users | Status, ETA |
| Leadership | Impact summary |
| Team | Technical details |
Active incidents, recent deploys, known issues, pending alerts.
See synapses.json for connections.