Hi Andrey,

I don't know if these answer your question specifically, but I found them in the User Guide.

Packet-based Test Procedure
Whenever InterMapper tests a packet-based device, it uses the following procedure:


1.. InterMapper sends the appropriate probe packet (ping, SNMP get-request, DNS query, etc.) 2.. InterMapper waits the timeout interval specified for the particular device. 3.. If a response arrives, InterMapper examines its contents and sets the device status based on that response 4.. However, if no response arrives, InterMapper sends another probe packet 5.. The above procedure is repeated until a response arrives or the specified number of probes has been sent 6.. If no response has arrived after the final timeout, InterMapper sets the device status to Down. 7.. In any event, the device is scheduled to be tested again at a time set by the map's (or the device's) poll interval. The default timeout is three seconds, with a default probe count of three seconds. Consequently, InterMapper will take nine seconds to declare a device is down (three probes, waiting three seconds each). Both the timeout and the number of probes can be set for each device.

This often gives rise to 21 second or 51 second outages. What's happening here is that the device fails to respond to one set of probes (for example, after nine seconds), but responds immediately at the next poll 30 or 60 seconds later. This gives an outage duration to be (30-9=21) seconds or (60-9=51) seconds.



TCP-based probes
Probes like "HTTP", "SMTP", and "LDAP" and others test the ability of a server to accept a TCP connection on a specific listening port, and to respond to a scripted interchange.

a.. For more information on TCP-based probes see Server Probes - Standard .
What Happens in a TCP-based interchange
1.. InterMapper first attempts to connect to the specified port at the device's address. 2.. If this connection attempt fails, InterMapper shows the device in the DOWN state.

If InterMapper successfully connects to the listening port, InterMapper sends protocol-specific commands through the TCP connection to test the server's responses and compare them to expected values. 3.. InterMapper changes the status of the device (e.g. ALARM, WARNING, OKAY, DOWN) if an error condition is detected, or if the execution of InterMapper's probe is interrupted for any reason. 4.. If InterMapper doesn't receive a proper response for 60 seconds, or if the TCP connection is lost while waiting for a response, the InterMapper probe will set status of the device to the proper condition.
Bob Merrill (Documentation Guy)

----- Original Message ----- From: "Andrey Gordon" <[email protected]>
To: "InterMapper Discussion" <[email protected]>
Sent: Tuesday, September 01, 2009 3:16 PM
Subject: [IM-Talk] False positives protection


You can probably tell I got back to tweaking IM again.....

Anyway, I was curious if there is a mechanism that is built in into IM server to double check the failures. For example, if a probe detects a condition that normally triggers an alert (let's say Alarm). Does InterMapper double check that the condition in fact exists or it was just a blip, glitch, etc before alerting?

I'd be interested to see a feature that allows control over that kind of behavior. Let's say, just like under Notifiers one can control the number of repeating alerts, it would be nice to control there number of times the condition has to be triggered before alerts goes out.

The reason this is important is so we could make sure that the condition is there before sending an alert to a pager at 3am. I'd like to be able to setup alerts going to email after one occurrence and to pager after, let's say 3. Many times, but the time I crawl out of bed to see what it is, it already cleared.

Setting it up with delayed notification kind of does the job, but not really. All it does is delays the notification for a minimum of a minute, which with default probe timers only gives it an extra one time to probe it.

I'd like to see Intermapper after detecting a condition being able to verify it a controllable number of times before sending an alert to a give notifier.

Consider this a feature request, but I'd still want to understand what the logic of this behavior is today as implemented. I'm suspecting many on this list would be interested to hear how IM behaves on condition detection....

Thank you,
Andrey
__________________________________________________________
Andrey Gordon | Integrity Interactive | Network Engineer | +1.781.398.3518

____________________________________________________________________
List archives: http://www.mail-archive.com/intermapper-talk%40list.dartware.com/
To unsubscribe: send email to: [email protected]




____________________________________________________________________
List archives: http://www.mail-archive.com/intermapper-talk%40list.dartware.com/
To unsubscribe: send email to: [email protected]

Reply via email to