Uptime & Heartbeat Monitors
Integrations

Prometheus Integration

Create incidents in iLert from Prometheus Alertmanager alerts

Prometheus is an open-source systems monitoring and alerting toolkit that uses a pull-based approach to collecting metrics in a timeseries database.

With iLert's Prometheus integration, you can automatically create incidents in iLert using the Prometheus' Alertmanager. That way, you will never miss a critical alert and always alert the right person using iLert's on-call schedules, automatic escalation, and multiple alerting channels. When the Alertmanager triggers an alert, iLert will alert the on-call person through their preferred channel, including SMS, phone calls, push notifications and Slack. iLert will automatically escalate to the next person, if the alert is not acknowledged. iLert also lets you define alerting rules based on support hours and delay alerts until your support hours start.

System Requirements

In iLert: Create Prometheus alert source

1. Go to Services --> Alert sources and click on Create new alert source

2. Give it a name and chose an escalation policy

3. Select Prometheus as the Integration type

4. A webhook URL will be generated on the next page. You will need this URL later in Prometheus.

In Prometheus Alertmanager: add webhook receiver

1. Add a Webhook configuration from the alert manager in the configuration file. Use the URL generated in iLert as the Webhook URL:

receivers:
- name: 'ilert.web.hook'
webhook_configs:
- url: 'https://api.ilert.com/api/v1/events/prometheus/e6bcfcbf-a38f-462a-af9d-1687809b7594'

2. You can now configure any route in the Alert Manager. In the following example, all alerts that do not match another route are sent to iLert:

route:
group_by: ['alertname']
group_wait: 10s
group_interval: 10s
repeat_interval: 1h
receiver: 'ilert.web.hook'

3. Restart the alert manager

4. Optional: Send a test alert through the Alert Manager API.

curl -d '[{"labels":{"Alertname":"iLert Test"},"annotations":{"summary":"iLert Test"}}]' http://localhost:9093/api/v1/alerts

FAQ

Will incidents in iLert be resolved automatically?

Yes, Prometheus also sends resolved events by default, as long as the send_resolved: false option is NOT set in the Webhook configuration of the alert manager. Furthermore, resolved events - just like firing events - are not sent until the next group_interval configuration in the alert manager.

Can I link Prometheus to multiple alert sources in iLert?

Yes, create several Webhook receivers in Prometheus and enter the URL of the alert source from iLert in the Webhook URL.

What if my internet connection is interrupted? Are the alerts generated in Prometheus lost?

No, alerts are not lost. The alert manager has a retry mechanism. In addition, we recommend that you monitor your Internet connection with an external monitoring service (e.g. using iLert's heartbeat feature or uptime monitoring). See here for a Prometheus Heartbeat Example.

Not all Prometheus Alerts incidents are created in iLert. Why?

The alerts from Prometheus are grouped and sent to iLert and bundled in an incident. Grouping is affected by the group_by configuration in the Alert Manager route.

Example:

route:
# The labels by which incoming alerts are grouped together. For example,
# multiple alerts coming in for cluster=A and alertname=LatencyHigh would
# be batched into a single group.
#
# To aggregate by all possible labels use '...' as the sole label name.
# This effectively disables aggregation entirely, passing through all
# alerts as-is. This is unlikely to be what you want, unless you have
# a very low alert volume or your upstream notification system performs
# its own grouping. Example: group_by: [...]
group_by: ['alertname', 'cluster', 'service']

The integration does not work. How do I find the issue?

First, look in the log file of the alert manager. If you can not find the issue, please contact our support at [email protected].