Hi Andrey,
I don't know if these answer your question specifically, but I found them in
the User Guide.
Packet-based Test Procedure
Whenever InterMapper tests a packet-based device, it uses the following
procedure:
1.. InterMapper sends the appropriate probe packet (ping, SNMP
get-request, DNS query, etc.)
2.. InterMapper waits the timeout interval specified for the particular
device.
3.. If a response arrives, InterMapper examines its contents and sets the
device status based on that response
4.. However, if no response arrives, InterMapper sends another probe
packet
5.. The above procedure is repeated until a response arrives or the
specified number of probes has been sent
6.. If no response has arrived after the final timeout, InterMapper sets
the device status to Down.
7.. In any event, the device is scheduled to be tested again at a time set
by the map's (or the device's) poll interval.
The default timeout is three seconds, with a default probe count of three
seconds. Consequently, InterMapper will take nine seconds to declare a
device is down (three probes, waiting three seconds each). Both the timeout
and the number of probes can be set for each device.
This often gives rise to 21 second or 51 second outages. What's happening
here is that the device fails to respond to one set of probes (for example,
after nine seconds), but responds immediately at the next poll 30 or 60
seconds later. This gives an outage duration to be (30-9=21) seconds or
(60-9=51) seconds.
TCP-based probes
Probes like "HTTP", "SMTP", and "LDAP" and others test the ability of a
server to accept a TCP connection on a specific listening port, and to
respond to a scripted interchange.
a.. For more information on TCP-based probes see Server Probes - Standard
.
What Happens in a TCP-based interchange
1.. InterMapper first attempts to connect to the specified port at the
device's address.
2.. If this connection attempt fails, InterMapper shows the device in the
DOWN state.
If InterMapper successfully connects to the listening port, InterMapper
sends protocol-specific commands through the TCP connection to test the
server's responses and compare them to expected values.
3.. InterMapper changes the status of the device (e.g. ALARM, WARNING,
OKAY, DOWN) if an error condition is detected, or if the execution of
InterMapper's probe is interrupted for any reason.
4.. If InterMapper doesn't receive a proper response for 60 seconds, or if
the TCP connection is lost while waiting for a response, the InterMapper
probe will set status of the device to the proper condition.
Bob Merrill (Documentation Guy)
----- Original Message -----
From: "Andrey Gordon" <[email protected]>
To: "InterMapper Discussion" <[email protected]>
Sent: Tuesday, September 01, 2009 3:16 PM
Subject: [IM-Talk] False positives protection
You can probably tell I got back to tweaking IM again.....
Anyway, I was curious if there is a mechanism that is built in into IM
server to double check the failures. For example, if a probe detects a
condition that normally triggers an alert (let's say Alarm). Does
InterMapper double check that the condition in fact exists or it was just
a blip, glitch, etc before alerting?
I'd be interested to see a feature that allows control over that kind of
behavior. Let's say, just like under Notifiers one can control the number
of repeating alerts, it would be nice to control there number of times
the condition has to be triggered before alerts goes out.
The reason this is important is so we could make sure that the condition
is there before sending an alert to a pager at 3am. I'd like to be able
to setup alerts going to email after one occurrence and to pager after,
let's say 3. Many times, but the time I crawl out of bed to see what it
is, it already cleared.
Setting it up with delayed notification kind of does the job, but not
really. All it does is delays the notification for a minimum of a minute,
which with default probe timers only gives it an extra one time to probe
it.
I'd like to see Intermapper after detecting a condition being able to
verify it a controllable number of times before sending an alert to a
give notifier.
Consider this a feature request, but I'd still want to understand what
the logic of this behavior is today as implemented. I'm suspecting many
on this list would be interested to hear how IM behaves on condition
detection....
Thank you,
Andrey
__________________________________________________________
Andrey Gordon | Integrity Interactive | Network Engineer |
+1.781.398.3518
____________________________________________________________________
List archives:
http://www.mail-archive.com/intermapper-talk%40list.dartware.com/
To unsubscribe: send email to: [email protected]
____________________________________________________________________
List archives:
http://www.mail-archive.com/intermapper-talk%40list.dartware.com/
To unsubscribe: send email to: [email protected]