Scenario: I'm monitoring my APC UPS via SNMP (1.3.6.1.4.1.318.1.1.1.4.1.1.0). If I get a response of 2, all is well. However the other possible responses might need different actions. For example response 3 means On Battery, but that may be because of the weekly self test, so I'd want the alert to mention that. Response 10 means HardwareFailureBypass, which would suggest a more serious problem straight away and require a different alert to different people.
Scenario 2: I'm monitoring temperature in the Data Centre. If the temp is greater than 25C, I'd want to email the help desk. If the temp is greater than 30C however, I'd want to send an SMS to a techie, and if the temp reached 40C I'd want to text everyone I could every 10 minutes!
I guess what I'm wondering about is some more flexibility within the check response and alerting framework - maybe some if...then type scripting. I can do it with multiple checks (sometimes nested, eg Scenario 2 above) but this seems a bit crude. Any ideas?
Cheers,
Ian
_________________________________
Ian K Gray
OEL IS - European Infrastructure Support
Tel: +44 1236 502661
Mob: +44 7881 518854
To unsubscribe send a message with UNSUBSCRIBE as subject to [email protected]
