Re: [Nagios-users] lazy nagios und cygwin...
That's surprising, but good news. I hope you don't rely on it Oh, we do, after running it for half a year without problems the hard part was to get the standard check scripts running as they won't compile under cygwin. The main nagios executable compiles and runs nicely (hem... almost..) with --enable-cygwin . Interesting definition of "quite successfully". Well, as said, the error did not appear always. It happens now and then, sometimes twice a day and then the system runs for weeks. I've never seen anything like it. I haven't and we are running other nagios installations too, on several different linux installations. Cygwin != unix. At the core, windows deals with files and sockets in a very different way to linux and solaris (and probably other unixes) It wouldn't surprise me if the something changed with the status file. If you can't install a linux distro onto the box, run up a virtualbox environment to host nagios in. It wouldn't be perfect, but will probably be better than cygwin. For reasons that are quite complicated in this case it is not an option to run nagios in a virtual box. I posed this question here mainly to see if this happens with unices also - if not, it is probably something for the cygwin mailing list. I just observed that after ending one of the two existing nagios processes with the windows task manager, things continued to run normally without a restart. So now I suspect it is either a blocked file access or some fork() problem- these sometimes happen with cygwin. I enabled the large installation quirks that should reduce the number of forks, let's see if that helps. Regards, Jakob -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] lazy nagios und cygwin...
This is gonna be hard to debug with out any debugging output. I concur, it is by no means "normal" for nagios to "just stop doing anything" without user intervention. For comparison, I'm running Nagios 3.0.6 from Debian with 34 hosts and 107 services. It has been running continually for 3 months, which is when I first brought it online. -lee On Tue, Feb 24, 2009 at 9:42 AM, Jakob Curdes wrote: > Hello, > > we are running nagios 3.0.5 under cygwin quite successfully in a > relatively simple setup (most check results come via NRPE from NSCA++ > clients on windows machines). Everything is well, but sometimes a > awkward thing happens: nagios just stops doing anything. > > - It continues to run : the processes are visible in the Windows process > table > - It ceases to execute checks: checks just stop bein executed > - It ceases to write anything to a logfile > - It also ceases to write anything to a debug file with Debug=16 (last > entry is nothing suspicous) > > After killing the nagios processes and restarting the windows service > (nagios is registered as a windows service using the cygrunsrv facility) > everything is back to normal. > What I find most annoying is that even in the CGI output everything > seems normal - just that the last checks are several hours overdue, but > no alarm or other signal is shown. > > We are not sure if this is cygwin/windows related or if it is a problem > which also happens in the "normal" unix environment. > Comments would be very welcome. I will now turn up debugging further in > the hope that then we see something interesting. > > Regards, > Jakob Curdes > > -- > Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA > -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise > -Strategies to boost innovation and cut costs with open source participation > -Receive a $600 discount off the registration fee with the source code: SFAD > http://p.sf.net/sfu/XcvMzF8H > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] lazy nagios und cygwin...
> Hello, > > we are running nagios 3.0.5 under cygwin quite successfully > in a relatively simple setup That's surprising, but good news. I hope you don't rely on it > (most check results come via > NRPE from NSCA++ clients on windows machines). Everything is > well, but sometimes a awkward thing happens: nagios just > stops doing anything. > > - It continues to run : the processes are visible in the > Windows process table > - It ceases to execute checks: checks just stop bein executed > - It ceases to write anything to a logfile > - It also ceases to write anything to a debug file with > Debug=16 (last entry is nothing suspicous) Interesting definition of "quite successfully". > After killing the nagios processes and restarting the windows > service (nagios is registered as a windows service using the > cygrunsrv facility) everything is back to normal. > What I find most annoying is that even in the CGI output > everything seems normal - just that the last checks are > several hours overdue, but no alarm or other signal is shown. > > We are not sure if this is cygwin/windows related or if it is > a problem which also happens in the "normal" unix environment I've never seen anything like it. Our main nagios installation has failed in the past -- the disk has filled up, preventing status updates, and the power has failed -- not the box UPS power, but the Aircon GS power, so we had to shut down as much as possible. If it happens again we wont be shutting down nagios, too essential. The Aircon should be powered by diesel generators, they didn't kick in (not nagios monitored you see!) > Comments would be very welcome. I will now turn up debugging > further in the hope that then we see something interesting. Cygwin != unix. At the core, windows deals with files and sockets in a very different way to linux and solaris (and probably other unixes) It wouldn't surprise me if the something changed with the status file. If you can't install a linux distro onto the box, run up a virtualbox environment to host nagios in. It wouldn't be perfect, but will probably be better than cygwin. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] lazy nagios und cygwin...
On Feb 24, 2009, at 8:42 AM, Jakob Curdes wrote: > > We are not sure if this is cygwin/windows related or if it is a > problem > which also happens in the "normal" unix environment. It is not normal under unix. -- Marc -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] lazy nagios und cygwin...
Hello, we are running nagios 3.0.5 under cygwin quite successfully in a relatively simple setup (most check results come via NRPE from NSCA++ clients on windows machines). Everything is well, but sometimes a awkward thing happens: nagios just stops doing anything. - It continues to run : the processes are visible in the Windows process table - It ceases to execute checks: checks just stop bein executed - It ceases to write anything to a logfile - It also ceases to write anything to a debug file with Debug=16 (last entry is nothing suspicous) After killing the nagios processes and restarting the windows service (nagios is registered as a windows service using the cygrunsrv facility) everything is back to normal. What I find most annoying is that even in the CGI output everything seems normal - just that the last checks are several hours overdue, but no alarm or other signal is shown. We are not sure if this is cygwin/windows related or if it is a problem which also happens in the "normal" unix environment. Comments would be very welcome. I will now turn up debugging further in the hope that then we see something interesting. Regards, Jakob Curdes -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null