Design or improve Prometheus-based metrics collection, scrape strategy, recording rules, and alerting foundations for production systems. Use when setting up Prometheus, reviewing scrape coverage, defining recording rules, or planning actionable alerts.
Use this skill to guide Prometheus setup and improvement work across collection, labelling, rules, and operational validation.
references/scrape-and-alerting-checklist.md for scrape coverage, relabelling, recording-rule, and alert-review guidance.promtool or equivalent checks where applicableOrganize recommendations into: