[prometheus-users] Sending inhibited or muted Alertmanager alerts when the mute expires?

2024-03-18 Thread Chris Siebenmann
Alertmanager has supported inhibiting alert notifications for a long time, and somewhat more recently it's added support for muting them during specific time ranges (or only sending them in specific time ranges). As far as I know, the behaviour (for both inhibited alerts and muted alerts) is that

Re: [prometheus-users] Re: better way to get notified about (true) single scrape failures?

2024-03-18 Thread Ben Kochie
I usually recommend throwing out any "But this is how Icinga does it". thinking. The way we do things in Prometheus for this kind of thing is to simply think about "availability". For any scrape failures: avg_over_time(up[5m]) < 1 For more than one scrape failure (assuming 15s intervals)

Re: [prometheus-users] snmp_exporter: mibs, indexes, and tables, oh my

2024-03-18 Thread Ben Kochie
Ok, I think I see what is going on. Liebert uses an OBJECT-IDENTITY OID tree `lgpEnvTemperature` to label various metrics, rather than store them as strings in the target device. This style of MIB identity lookup table is not supported by the snmp_exporter. No reason we don't support it, but it's