Package: nagios-statd-server
Version: 3.12-1
Severity: normal

I am running nagios-statd on a number of machines, and periodically for reasons 
I cannot explain, a set of those machines suddenly stop responding to 
nagios-statd requests and I get a lot of nagios alerts "(Service Check Timed 
Out)". When i go to the system and investigate, I find that there are *two* 
nagios-statd processes running, not one:

nobody    5431  0.0  0.0   6368  2672 ?        Ss   Feb21   0:13 
/usr/bin/python /usr/sbin/nagios-statd --pid=/var/run/nagios-statd.pid 
nobody   13520  0.0  0.0   6368  2340 ?        S    Mar01   0:00 
/usr/bin/python /usr/sbin/nagios-statd --pid=/var/run/nagios-statd.pid 

The most recent one was one that was just started. This is puzzling because 
nagios-statd normally cannot start if something else is bound on its port 
(1040), but for some reason this second one has... and while it is running, the 
first one is unable to process requests. I have to kill both of these processes 
and then start up nagios-statd again for things to work like normal.

This is quite frustrating when it happens, because it requires killing two 
processes on a number of machines. I'm interested in any suggestions for 
troubleshooting/debugging as I am somewhat at a loss.

thanks,
micah

Note: this is different than #562645

-- System Information:
Debian Release: squeeze/sid
  APT prefers unstable
  APT policy: (500, 'unstable'), (1, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.32-trunk-vserver-686 (SMP w/1 CPU core)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages nagios-statd-server depends on:
ii  python                        2.5.4-9    An interactive high-level object-o

nagios-statd-server recommends no packages.

nagios-statd-server suggests no packages.



-- 
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]

Reply via email to