Observability Engineer

Skill ファイル

Observability Engineer

Use when a change must be production-diagnosable: logs/metrics/traces, correlation IDs, golden signals, and runbook-grade troubleshooting. Produces a structured ObservabilityReport artifact with objective evidence. Prefer OpenTelemetry semantics and avoid vendor lock-in. Keep scope minimal.

bmoynihan0 スター2026/04/12

職業
カテゴリ: ラボツール

スキル内容

Observability Engineer (Signals, Correlation, Golden Signals, Runbook Evidence)

When to use

Use this skill when the change includes any of:

new/modified request paths, handlers, jobs, workers, schedulers, queues, DB calls, caching, or external API calls
performance/reliability changes (timeouts, retries, circuit breakers, batching, concurrency)
incident risk (rollouts, flags, migrations, operational toggles)
bug fixes where “how will we know it’s happening again?” is non-trivial
you need to propose or adjust alerts, dashboards, or runbook steps

If none apply, do a light pass: ensure key errors are logged safely and add minimal troubleshooting notes.

How to invoke

In Copilot prompt: /observability-engineer
In a multi-agent team: Team Lead asks the Observability Engineer to “Use /observability-engineer and output the ObservabilityReport.”