On 12/20/10 8:16 AM, [email protected] wrote:
> Alternatively, could you recommend a good system/resource monitoring tool 
> that would be able to let me know if nagios is down and restart it 
> automatically?
>
Add a cronjob on a five (or whatever you're comfortable with) minute 
interval, similar to:
#!/bin/bash

PATH=/bin:/usr/bin:/usr/local/bin
PID=`cat /home/nagios/nagios/var/nagios.lock`
PIDTEST=`kill -0 ${PID} 2>&1 >/dev/null`

if [ "${PIDTEST}" -eq "1" ]
then
     rm /home/nagios/nagios/var/nagios.lock
     # INSERT RESTART COMMAND HERE
     echo "Killed Lockfile and restarted Nagios" | mail -s "Nagios 
restart `hostname`" [email protected]
fi
 >>>

Just be aware that it'll also trigger that if block, if nagios is 
running under a different username.  You can check for that by doing 
some tests in the script with ps and grep.

> _____________________________________________
> From:   Berg, Eric: IT (NYK)
> Sent:   Monday, December 20, 2010 11:03 AM
> To:     '[email protected]'
> Subject:        Nagios kept from restarting after reboot by lock file
>
> Gee, this seems like an annoying newbie problem, but if Nagios crashes or is 
> killed (as on system reboot), it leaves a lock file around that prevents it 
> from starting again until the lock file is manually removed.
>
> I see this on Monday mornings after weekend reboots on a Red Hat Linux box:
>
> nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its already 
> held by another instance of Nagios (PID 0).  Bailing out...
Sounds like something in the shutdown process is throwing a 0 into the 
pid file, or the startup in the rc script is.

Either way, you should never have a 0 in there, either the rc script is 
putting the wrong data in there, or it's reporting incorrectly.
> Does anyone know if there's a config option or something else that obviates 
> the need to write a wrapper scropt to check to see if Nagios is really 
> running and remove the lock file (look slike Nagios already knows it's not 
> running by virtue of the value of the PID inthis very message!) so that it 
> can cleanly start up again?

-- 
Mike Lindsey


------------------------------------------------------------------------------
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
_______________________________________________
Nagios-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Reply via email to