Hi again. Well, thinking about it, that makes sense. I guess I'll revisit our setup in light of this aspect. Thanks a ton!
--- Federico Buti On Wed, 5 Aug 2020 at 11:02, Brian Brazil <[email protected]> wrote: > On Wed, 5 Aug 2020 at 09:56, Federico Buti <[email protected]> wrote: > >> Hi. >> >> Thanks for the reply Brian. >> So one should not alert on absence of a metric? Never ever? Just on the >> upness of the targets? >> > > Generally you should alert on the absence of up, as that indicates > something has either gone wrong with service discovery or the service has > disappeared and no longer exists. > > Alerting on absence of scraped metrics only really applies in niche cases > where a target has a bug that'd result in it sometimes not exposing > metrics. The blackbox exporter doesn't have such a bug that I'm aware of. > > Brian > > > >> >> --- >> Federico Buti >> >> >> On Wed, 5 Aug 2020 at 10:46, Brian Brazil < >> [email protected]> wrote: >> >>> On Wed, 5 Aug 2020 at 09:32, Federico Buti <[email protected]> wrote: >>> >>>> Hi all. >>>> >>>> A few months ago we introduced target down rules to keep track of >>>> targets that were missing. The rules are relatively simple being something >>>> like e.g. >>>> >>>> alert: target_down_slower_scraping_jobs >>>> expr: up{job=~"monitoring-scripts-5m|monitoring-scripts-hourly"} == 0 >>>> for: 13m >>>> labels: >>>> severity: average >>>> annotations: >>>> // annotations here >>>> >>>> >>>> >>> >>> >>>> A few days ago we wanted to introduce absence rules and we added them >>>> for both targets and metrics. That is all ok but with a side effect that we >>>> didn't consider, i.e. a metrics absent alert would of course spawn if the >>>> corresponding target is down. >>>> >>> >>> That doesn't sound right, make sure your absent rules were on "up". >>> >>> Brian >>> >>> >>>> Looking into it I've found this blog post >>>> <https://www.robustperception.io/absent-alerting-for-scraped-metrics> >>>> proposing >>>> to use unless binary operator but I'm not sure I've understood the usage >>>> and its implications. >>>> >>>> Unless returns the first metric unless we have some match for the >>>> second. If I write something like >>>> >>>> expr: up{job="node"} == 1 unless absent(check_success{check="xxxxx", >>>> stack="yyy",environment="zzz"}) >>>> >>>> I'm just going to return the upness if everything is fine with the >>>> node. Isn't that wrong? I mean, that would result in an alert because the >>>> node is up, which is not what we want. Even changing that to 0 would not >>>> solve the problem since we would still return the absence. Maybe changing >>>> to zero and inverting the two? But then wouldn't I have duplicated alerts >>>> for the upness? >>>> >>>> Is there a way to make sure absent rules take in account targets down? >>>> Or should I approach the issue in some other different ways which I'm not >>>> considering now? >>>> >>>> Thanks in advance, >>>> F. >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "Prometheus Users" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/prometheus-users/7758b7cc-79e9-4b0a-b39f-bff6bcb62d4co%40googlegroups.com >>>> <https://groups.google.com/d/msgid/prometheus-users/7758b7cc-79e9-4b0a-b39f-bff6bcb62d4co%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> >>> >>> -- >>> Brian Brazil >>> www.robustperception.io >>> >> > > -- > Brian Brazil > www.robustperception.io > -- You received this message because you are subscribed to the Google Groups "Prometheus Users" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/prometheus-users/CA%2B%2Bt_8MKz1sn-RwqJLFo%3D1W-mYnWr1jv%2BytyKwnNBXEc3%2BaKdA%40mail.gmail.com.

