技能档案

Prometheus Monitoring

Name: Prometheus Monitoring
Author: automateyournetwork

Prometheus monitoring — PromQL instant/range queries, metric discovery, metadata, scrape target health, system health checks (6 tools). Use when querying Prometheus metrics, checking scrape targets, investigating alert thresholds, or analyzing network device utilization trends.

automateyournetwork432 星标2026年3月16日

职业
分类: 监控

技能内容

MCP Server

Property	Value
Source	pab1it0/prometheus-mcp-server
Transport	stdio (default), SSE, or HTTP
Language	Python 3.10+
Tools	6 (query, range query, list metrics, metadata, targets, health check)
Auth	Basic auth (username/password), bearer token, or unauthenticated
Install	`pip3 install prometheus-mcp-server` (PyPI)
Run	`prometheus-mcp-server` (stdio)

How to Run

相关技能

Prometheus Monitoring | Skills Pool

# stdio mode (default — used by NetClaw)
PROMETHEUS_URL=http://prometheus:9090 prometheus-mcp-server

# HTTP transport mode
PROMETHEUS_MCP_SERVER_TRANSPORT=http PROMETHEUS_URL=http://prometheus:9090 prometheus-mcp-server

# With basic auth
PROMETHEUS_URL=http://prometheus:9090 PROMETHEUS_USERNAME=admin PROMETHEUS_PASSWORD=secret prometheus-mcp-server

# With bearer token (Grafana Cloud, Thanos, etc.)
PROMETHEUS_URL=https://prom.example.com PROMETHEUS_TOKEN=your_bearer_token prometheus-mcp-server

Variable	Required	Example	Description
`PROMETHEUS_URL`	Yes	`http://prometheus:9090`	Prometheus server endpoint
`PROMETHEUS_USERNAME`	No	`admin`	Basic auth username
`PROMETHEUS_PASSWORD`	No	`changeme`	Basic auth password
`PROMETHEUS_TOKEN`	No	`eyJhbG...`	Bearer token (Grafana Cloud, Thanos, Cortex)
`PROMETHEUS_URL_SSL_VERIFY`	No	`false`	Disable SSL certificate verification
`PROMETHEUS_REQUEST_TIMEOUT`	No	`30`	Request timeout in seconds (default: 30)
`PROMETHEUS_DISABLE_LINKS`	No	`true`	Disable Prometheus UI links in responses (saves context)
`ORG_ID`	No	`1`	Multi-tenant organization ID (Cortex/Mimir)
`PROMETHEUS_CUSTOM_HEADERS`	No	`{"X-Custom":"val"}`	Additional HTTP headers as JSON
`PROMETHEUS_MCP_SERVER_TRANSPORT`	No	`stdio`	Transport: stdio (default), http, or sse

Tool	Parameters	What It Does
`execute_query`	`query`, `timeout?`	Execute instant PromQL query at current time
`execute_range_query`	`query`, `start`, `end`, `step`, `timeout?`	Execute PromQL range query over time interval
`list_metrics`	`page?`, `page_size?`	Browse available metric names with pagination
`get_metric_metadata`	`metric?`, `limit?`	Retrieve metric type, help text, and unit info
`get_targets`	none	View scrape target details (up/down, labels, last scrape)
`health_check`	none	Check Prometheus server availability and readiness

health_check()
list_metrics(page=1, page_size=50)
execute_query(query="rate(ifHCInOctets{device='core-rtr-01'}[5m]) * 8")
execute_range_query(query="rate(ifHCOutOctets{device='core-rtr-01'}[5m]) * 8", start="2024-01-01T00:00:00Z", end="2024-01-01T01:00:00Z", step="60s")
get_targets()

Skill	Integration
grafana-observability	Grafana dashboards visualize Prometheus data; use Prometheus skill for direct PromQL when Grafana isn't available or for ad-hoc queries
pyats-health-check	Cross-reference pyATS device health with Prometheus time-series metrics
pyats-routing	Correlate OSPF/BGP state changes with Prometheus metric timelines
gait-session-tracking	Record all Prometheus queries and findings in GAIT audit trail
te-network-monitoring	Pair ThousandEyes path data with Prometheus infrastructure metrics
sdwan-ops	Correlate SD-WAN vManage alarms with Prometheus device metrics
servicenow-change-workflow	Reference Prometheus metrics as evidence in change requests

Prometheus Monitoring

MCP Server

How to Run

Prometheus Monitoring

MCP Server

How to Run

Environment Variables

Tools

Workflow: Network Device Metric Monitoring

Example: Interface Utilization Check

Workflow: Alert Threshold Investigation

Workflow: Capacity Planning

Integration with Other Skills

Important Rules

Error Handling

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid