Hi all !
here's my Mon 0.99.2 configuration :
# Very simple mon.cf file
#
# $Id: very-simple.cf 1.1 Sat, 26 Aug 2000 15:22:34 -0400 trockij $
#
alertdir = /usr/lib/mon/alert.d
mondir = /usr/lib/mon/mon.d
logdir = /usr/lib/mon/log.d
maxprocs = 20
histlength = 100
randstart = 60s
authtype = getpwnam
userfile = /usr/lib/mon/etc/auth.cf
dtlogging = yes
dtlogfile = /usr/lib/mon/log.d/downtime.log
hostgroup Ext nsext.mydomain.com
watch Ext
service ping
description Ping to Ext DNS every 4min.
interval 4m
monitor fping.monitor
allow_empty_group
period wd {Sat-Sund}
alert smsrouter.alert
alertafter 2m
alertevery 2h
upalert smsrouterup.alert
################################################
last day, the Router service went down, so 'Ext' watching began to fail..
the problem is that the failure leght has been 1h 39 mintues !! and never
became Alert.. so no Alert has been sent for that hour..
this would be a great problem, as the service we are monitoring is our
Router connectivity..
any ideas on the reason why this could happen ?
..and we also need to monitor the responding time of the service..
I mean the service 'fails' only if the fping did not respond in xxminutes ..
I've read before how to do it, but I can't find it right now..
Any help would be appreciated..
thank you very much
GB
_______________________________________________
mon mailing list
mon@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/mon