스킬 파일

Kibana Alerting Rules

Name: Kibana Alerting Rules
Author: elastic

Create and manage Kibana alerting rules via REST API or Terraform. Use when creating, updating, or managing rule lifecycle (enable, disable, mute, snooze) or rules-as-code workflows.

elastic363 스타2026. 3. 13.

직업
카테고리: 클라우드

스킬 내용

Core Concepts

A rule has three parts: conditions (what to detect), schedule (how often to check), and actions (what happens when conditions are met). When conditions are met, the rule creates alerts, which trigger actions via connectors.

Authentication

All alerting API calls require either API key auth or Basic auth. Every mutating request must include the kbn-xsrf header.

kbn-xsrf: true

Required Privileges

all privileges for the appropriate Kibana feature (e.g., Stack Rules, Observability, Security)
read privileges for Actions and Connectors (to attach actions to rules)

API Reference

관련 스킬

Kibana Alerting Rules | Skills Pool

Operation	Method	Endpoint
Create rule	POST	`/api/alerting/rule/{id}`
Update rule	PUT	`/api/alerting/rule/{id}`
Get rule	GET	`/api/alerting/rule/{id}`
Delete rule	DELETE	`/api/alerting/rule/{id}`
Find rules	GET	`/api/alerting/rules/_find`
List rule types	GET	`/api/alerting/rule_types`
Enable rule	POST	`/api/alerting/rule/{id}/_enable`
Disable rule	POST	`/api/alerting/rule/{id}/_disable`
Mute all alerts	POST	`/api/alerting/rule/{id}/_mute_all`
Unmute all alerts	POST	`/api/alerting/rule/{id}/_unmute_all`
Mute alert	POST	`/api/alerting/rule/{rule_id}/alert/{alert_id}/_mute`
Unmute alert	POST	`/api/alerting/rule/{rule_id}/alert/{alert_id}/_unmute`
Update API key	POST	`/api/alerting/rule/{id}/_update_api_key`
Create snooze	POST	`/api/alerting/rule/{id}/snooze_schedule`
Delete snooze	DELETE	`/api/alerting/rule/{ruleId}/snooze_schedule/{scheduleId}`
Health check	GET	`/api/alerting/_health`

Field	Type	Description
`name`	string	Display name (does not need to be unique)
`rule_type_id`	string	The rule type (e.g., `.es-query`, `.index-threshold`)
`consumer`	string	Owning app: `alerts`, `apm`, `discover`, `infrastructure`, `logs`, `metrics`, `ml`, `monitoring`, `securitySolution`, `siem`, `stackAlerts`, `uptime`
`params`	object	Rule-type-specific parameters
`schedule`	object	Check interval, e.g., `{"interval": "5m"}`

Field	Type	Description
`actions`	array	Actions to run when conditions are met (each references a connector)
`tags`	array	Tags for organizing rules
`enabled`	boolean	Whether the rule runs immediately (default: true)
`notify_when`	string	`onActionGroupChange`, `onActiveAlert`, or `onThrottleInterval` (prefer setting per-action instead)
`alert_delay`	object	Alert only after N consecutive matches, e.g., `{"active": 3}`
`flapping`	object/null	Override flapping detection settings

curl -X POST "https://my-kibana:5601/api/alerting/rule/my-rule-id" \
  -H "kbn-xsrf: true" \
  -H "Content-Type: application/json" \
  -H "Authorization: ApiKey <your-api-key>" \
  -d '{
    "name": "High error rate",
    "rule_type_id": ".es-query",
    "consumer": "stackAlerts",
    "schedule": { "interval": "5m" },
    "params": {
      "index": ["logs-*"],
      "timeField": "@timestamp",
      "esQuery": "{\"query\":{\"match\":{\"log.level\":\"error\"}}}",
      "threshold": [100],
      "thresholdComparator": ">",
      "timeWindowSize": 5,
      "timeWindowUnit": "m",
      "size": 100
    },
    "actions": [
      {
        "id": "my-slack-connector-id",
        "group": "query matched",
        "params": {
          "message": "Alert: {{rule.name}} - {{context.hits}} hits detected"
        },
        "frequency": {
          "summary": false,
          "notify_when": "onActionGroupChange"
        }
      }
    ],
    "tags": ["production", "errors"]
  }'

curl -X GET "https://my-kibana:5601/api/alerting/rules/_find?per_page=20&page=1&search=cpu&sort_field=name&sort_order=asc" \
  -H "Authorization: ApiKey <your-api-key>"

filter=alert.attributes.tags:"production"

# Enable
curl -X POST ".../api/alerting/rule/{id}/_enable" -H "kbn-xsrf: true"

# Disable
curl -X POST ".../api/alerting/rule/{id}/_disable" -H "kbn-xsrf: true"

# Mute all alerts
curl -X POST ".../api/alerting/rule/{id}/_mute_all" -H "kbn-xsrf: true"

# Mute specific alert
curl -X POST ".../api/alerting/rule/{rule_id}/alert/{alert_id}/_mute" -H "kbn-xsrf: true"

# Delete
curl -X DELETE ".../api/alerting/rule/{id}" -H "kbn-xsrf: true"

terraform {
  required_providers {
    elasticstack = {
      source  = "elastic/elasticstack"
    }
  }
}

provider "elasticstack" {
  kibana {
    endpoints = ["https://my-kibana:5601"]
    api_key   = var.kibana_api_key
  }
}

resource "elasticstack_kibana_alerting_rule" "cpu_alert" {
  name         = "CPU usage critical"
  consumer     = "stackAlerts"
  rule_type_id = ".index-threshold"
  interval     = "1m"
  enabled      = true

  params = jsonencode({
    index              = ["metrics-*"]
    timeField          = "@timestamp"
    aggType            = "avg"
    aggField           = "system.cpu.total.pct"
    groupBy            = "top"
    termField          = "host.name"
    termSize           = 10
    threshold          = [0.9]
    thresholdComparator = ">"
    timeWindowSize     = 5
    timeWindowUnit     = "m"
  })

  tags = ["infrastructure", "production"]
}

curl -X PUT "https://my-kibana:5601/api/alerting/rule/my-rule-id" \
  -H "kbn-xsrf: true" \
  -H "Content-Type: application/json" \
  -H "Authorization: ApiKey <your-api-key>" \
  -d '{
    "name": "High error rate",
    "schedule": { "interval": "5m" },
    "params": { ... },
    "actions": [
      {
        "id": "<workflow-id>",
        "group": "query matched",
        "params": {},
        "frequency": { "summary": false, "notify_when": "onActionGroupChange" }
      }
    ]
  }'

Set action frequency per action, not per rule. The notify_when field at the rule level is deprecated in favor of per-action frequency objects. If you set it at the rule level and later edit the rule in the Kibana UI, it is automatically converted to action-level values.
Use alert summaries to reduce notification noise. Instead of sending one notification per alert, configure actions to send periodic summaries at a custom interval. Use "summary": true and set a throttle interval. This is especially valuable for rules that monitor many hosts or documents.
Choose the right action frequency for each channel. Use onActionGroupChange for paging/ticketing systems (fire once, resolve once). Use onActiveAlert for audit logging to an Index connector. Use onThrottleInterval with a throttle like "30m" for dashboards or lower-priority notifications.
Always add a recovery action. Rules without a recovery action leave incidents open in PagerDuty, Jira, and ServiceNow indefinitely. Use the connector's native close/resolve event action (e.g., eventAction: "resolve" for PagerDuty) in the Recovered action group.
Set a reasonable check interval. The minimum recommended interval is 1m. Very short intervals across many rules clog Task Manager throughput and increase schedule drift. The server setting xpack.alerting.rules.minimumScheduleInterval.value enforces this.
Use alert_delay to suppress transient spikes. Setting {"active": 3} means the alert only fires after 3 consecutive runs match the condition, filtering out brief anomalies.
Enable flapping detection. Alerts that rapidly switch between active and recovered are marked as "flapping" and notifications are suppressed. This is on by default but can be tuned per-rule with the flapping object.
Use server.publicBaseUrl for deep links. Set server.publicBaseUrl in kibana.yml so that {{rule.url}} and {{kibanaBaseUrl}} variables resolve to valid URLs in notifications.
Tag rules consistently. Use tags like production, staging, team-platform for filtering and organization in the Find API and UI.
Use Kibana Spaces to isolate rules by team or environment. Prefix API paths with /s/<space_id>/ for non-default spaces. Connectors are also space-scoped, so create matching connectors in each space.

Kibana Alerting Rules

Core Concepts

Authentication

Required Privileges

API Reference

Kibana Alerting Rules

Core Concepts

Authentication

Required Privileges

API Reference

Creating a Rule

Required Fields

Optional Fields

Example: Create an Elasticsearch Query Rule

Updating a Rule

Finding Rules

Lifecycle Operations

Terraform Provider

Triggering Kibana Workflows from Rules

Connectors and Actions in Rules

Best Practices

Common Pitfalls

Examples

Guidelines

Additional Resources

Feishu Drive

Nanoclaw Repl

Crosspost

Cloudflare

Mcp Integration

Setup Deploy