I've run into a case where a monitor failed and the detailed output
shows that it received the first four of it's eight hostgroup arguments
from the wrong hostgroup. The wrong arguments seem to replace the
expected arguments and are from a four member hostgroup defined right
before the hostgroup that was supposed to be used.  The last 4
arguments are correct. The mail.alert plugin which informed me of the
failure, shows the correct parameters in "Members:".

Is there a known problem like this?

In case anyone wants some details...

Here is the mail alert (The summary output came from wrapper script,
the guts of the monitor created the table which shows the mixed up
arguments.):


Summary output                 : rm: allout: A file or directory in the path
name
does not exist.

Group                 : TPLSIPDUPL
Service               : LSIP
Time noticed          : Wed Jan 26 07:31:59 2005
Secs until next alert : 
Members                        : lsip_.250.h1.drq lsip_.250.h1.drq 
lsip_.250.h2.drq
                                                 lsip_.250.h2.drq 
lsip_.867.h1.drq lsip_.867.h1.drq
                                                 lsip_.867.h2.drq 
lsip_.867.h2.drq

Detailed text (if any) follows:
-------------------------------
DASTX->2, Some (3/4) txs Failed.   Err tx#: 1,2,4
tx#|____req
file____|____status____|_resp_|len_|st|sv|__qual____|ql|_server_|r#|sq|____sent____|_req
path
  1 lsip1.001.h1.drq Err Das No NPA     10   50 00 1b 0101000000 16
0011891b 00 00   1153545578 
  2 lsip1.001.h2.drq Err No Path        10   50 02 1b 0102000000 16
00000000 01 01   1153545578 
  3 lsip2.002.h1.drq OK                100   46 00 1b 010161cb78 16
0011891b 03 02   1153545588 
  4 lsip2.002.h2.drq Err No Path         0   50 02 1b 0102000000 16
00000000 02 03   1153545588 

  5 lsip_.867.h1.drq Err Das No NPA     10   50 00 1b 0101000000 16
0011891b 04 04   1153545618 
  6 lsip_.867.h1.drq Err Das No NPA      0   50 00 1b 0101000000 16
0011891b 05 05   1153545628 
  7 lsip_.867.h2.drq Err No Path         0   50 02 1b 0102000000 16
00000000 06 06   1153545628 
  8 lsip_.867.h2.drq Err No Path         0   50 02 1b 0102000000 16
00000000 07 07   1153545628 






Here is a snip of the .cf (Note the ^M from a PC editor.  They have not
seemed to be a problem so far.):

...

hostgroup TPLSIPUNIQ lsip1.001.h1.drq lsip1.001.h2.drq lsip2.002.h1.drq
lsip2.002.h2.drq ^M
^M
hostgroup TPLSIPDUPL lsip_.250.h1.drq lsip_.250.h1.drq lsip_.250.h2.drq
lsip_.250.h2.drq lsip_.867.h1.drq lsip_.867.h1.drq lsip_.867.h2.drq
lsip_
.867.h2.drq ^M
^M

...

watch TPLSIPDUPL^M
    service LSIP ^M
        interval 10s^M
        monitor lsip.monitor  ^M
        period wd {Mon-Sun}^M
            alert mail.alert [EMAIL PROTECTED]
            alert mail.alert [EMAIL PROTECTED]
            upalert mail.alert [EMAIL PROTECTED]
            alertevery 3m^M

...



Thanks in advance,

Michael Vogt




                
__________________________________ 
Do you Yahoo!? 
Yahoo! Mail - Find what you need with new enhanced search.
http://info.mail.yahoo.com/mail_250

_______________________________________________
mon mailing list
mon@linux.kernel.org
http://linux.kernel.org/mailman/listinfo/mon

Reply via email to