What is the Prometheus Alert Lifecycle?

The Prometheus alert lifecycle describes the process alerts follow from creation to resolution, enabling effective monitoring and timely notifications. Prometheus generates alerts based on pre-defined rules, which are then processed and managed by systems like Alertmanager.

Centralize & visualize your logs. Query everything with SQL.

Stages of the Alert Lifecycle

Define Alert Rules
Alerts are created using alerting rules in prometheus.yml or external rule files. Each rule specifies conditions to trigger an alert using PromQL.
Example:
yaml - alert: InstanceDown expr: up == 0 for: 5m labels: severity: critical annotations: summary: "Instance {{ $labels.instance }} is down" description: "No response from {{ $labels.instance }} for 5 minutes"
Alert Evaluation
Prometheus evaluates alerting rules at its scrape interval. If the rule condition is true, the alert enters a pending state but remains silent during this phase.
Firing Alerts
When the condition persists for the specified for duration, the alert transitions to a firing state. Prometheus then sends the alert to Alertmanager, including metadata (e.g., labels and annotations).
Routing and Notification
Alertmanager routes alerts based on defined rules, determining the recipients and notification channels (e.g., email, Slack, PagerDuty).
Example configuration:
yaml route: group_by: ['alertname', 'severity'] receiver: 'slack' receivers: - name: 'slack' slack_configs: - channel: '#alerts' text: "{{ .CommonAnnotations.summary }}"
Resolution
Once the alert condition is no longer true, Prometheus marks the alert as resolved and informs Alertmanager, which may notify users about the resolution.

Lifecycle Summary

Define Rule →
Evaluate Condition →
Pending Alert →
Firing Alert →
Alertmanager Processing →
Notification →
Resolution

Key Details

Pending State: Alerts stay in this state until the for duration is met. No notifications are sent during this phase.
Firing State: The alert is actively sent to Alertmanager for processing and notification.
Resolved State: Alerts are marked resolved when the condition becomes false, and updates are sent to Alertmanager.
Expiration: Resolved alerts that no longer match active rules are eventually purged from Prometheus.

Understanding and managing the alert lifecycle ensures that Prometheus monitoring is both reliable and actionable.

We call when your
website goes down

Get notified with a radically better infrastructure monitoring platform.

Got an article suggestion? Let us know

Explore more

What is the Difference Between a Gauge and a Counter?

How to Add Custom HTTP Headers in Prometheus

How To Manage Prometheus Counters

What Is The Job Label In Prometheus?

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.