Good morning,
I want to create a condition for an alert, and since my monitorization
implies lots of up and downs when something is wrong, I am receiving lots
of mails, because if something fails, tries to reboot, goes on and then of
again for a while.
My current alerts looks like this:
- alert: smarttools_unavailable
expr: probe_http_status_code{job="smarttools_urls"}<= 199 OR
probe_http_status_code{job="smarttools_urls"} >= 300
for: 5m
annotations:
title: SmartTools unavailable
summary: HTTP failure {{ $labels.platform }}
description: "HTTP status = {{ $value }}"
Is there any way to indicates that the alarm must be off for 60 minutes or
so before it triggers again ?
Thanks!
--
You received this message because you are subscribed to the Google Groups
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/prometheus-users/508bb31a-9323-4720-9eca-d2baba60a553n%40googlegroups.com.