Re: [Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Jakob Curdes




That's surprising, but good news. I hope you don't rely on it
  
Oh, we do, after running it for half a year without problems the 
hard part was to get the standard check scripts running as they won't 
compile under cygwin.
The main nagios executable compiles and runs nicely (hem... almost..) 
with --enable-cygwin .


Interesting definition of "quite successfully". 
  
Well, as said, the error did not appear always. It happens now and then, 
sometimes twice a day and then the system runs for weeks.
I've never seen anything like it. 
I haven't and we are running other nagios installations too, on several 
different linux installations.
Cygwin != unix. At the core, windows deals with files and sockets 
in a very different way to linux and solaris (and probably other 
unixes)


It wouldn't surprise me if the something changed with the status file. 
If you can't install a linux distro onto the box, run up a virtualbox 
environment to host nagios in. It wouldn't be perfect, but will probably


be better than cygwin.
  


For reasons that are quite complicated in this case it is not an option 
to run nagios in a virtual box.
I posed this question here mainly to see if this happens with unices 
also - if not, it is probably something for the cygwin mailing list.


I just observed that after ending one of the two existing nagios 
processes with the windows task manager, things continued to run 
normally without a restart.
So now I suspect it is either a blocked file access or some fork() 
problem- these sometimes happen with cygwin.
I enabled the large installation quirks that should reduce the number of 
forks, let's see if that helps.



Regards,
Jakob

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Lee Azzarello
This is gonna be hard to debug with out any debugging output.

I concur, it is by no means "normal" for nagios to "just stop doing
anything" without user intervention. For comparison, I'm running
Nagios 3.0.6 from Debian with 34 hosts and 107 services. It has been
running continually for 3 months, which is when I first brought it
online.

-lee

On Tue, Feb 24, 2009 at 9:42 AM, Jakob Curdes  wrote:
> Hello,
>
> we are running nagios 3.0.5 under cygwin quite successfully in a
> relatively simple setup (most check results come via NRPE from NSCA++
> clients on windows machines). Everything is well, but sometimes a
> awkward thing happens: nagios just stops doing anything.
>
> - It continues to run : the processes are visible in the Windows process
> table
> - It ceases to execute checks: checks just stop bein executed
> - It ceases to write anything to a logfile
> - It also ceases to write anything to a debug file with Debug=16 (last
> entry is nothing suspicous)
>
> After killing the nagios processes and restarting the windows service
> (nagios is registered as a windows service using the cygrunsrv facility)
> everything is back to normal.
> What I find most annoying is that even in the CGI output everything
> seems normal - just that the last checks are several hours overdue, but
> no alarm or other signal is shown.
>
> We are not sure if this is cygwin/windows related or if it is a problem
> which also happens in the "normal" unix environment.
> Comments would be very welcome. I will now turn up debugging further in
> the hope that then we see something interesting.
>
> Regards,
> Jakob Curdes
>
> --
> Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
> -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
> -Strategies to boost innovation and cut costs with open source participation
> -Receive a $600 discount off the registration fee with the source code: SFAD
> http://p.sf.net/sfu/XcvMzF8H
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Paul Weaver
> Hello,
> 
> we are running nagios 3.0.5 under cygwin quite successfully 
> in a relatively simple setup 

That's surprising, but good news. I hope you don't rely on it

> (most check results come via 
> NRPE from NSCA++ clients on windows machines). Everything is 
> well, but sometimes a awkward thing happens: nagios just 
> stops doing anything.
> 
> - It continues to run : the processes are visible in the 
> Windows process table
> - It ceases to execute checks: checks just stop bein executed
> - It ceases to write anything to a logfile
> - It also ceases to write anything to a debug file with 
> Debug=16 (last entry is nothing suspicous)

Interesting definition of "quite successfully". 

> After killing the nagios processes and restarting the windows 
> service (nagios is registered as a windows service using the 
> cygrunsrv facility) everything is back to normal.
> What I find most annoying is that even in the CGI output 
> everything seems normal - just that the last checks are 
> several hours overdue, but no alarm or other signal is shown.
> 
> We are not sure if this is cygwin/windows related or if it is 
> a problem which also happens in the "normal" unix environment

I've never seen anything like it. Our main nagios installation has 
failed in the past -- the disk has filled up, preventing status
updates, and the power has failed -- not the box UPS power, but 
the Aircon GS power, so we had to shut down as much as possible. If it 
happens again we wont be shutting down nagios, too essential.

The Aircon should be powered by diesel generators, they didn't kick in 
(not nagios monitored you see!)

> Comments would be very welcome. I will now turn up debugging 
> further in the hope that then we see something interesting.

Cygwin != unix. At the core, windows deals with files and sockets 
in a very different way to linux and solaris (and probably other 
unixes)

It wouldn't surprise me if the something changed with the status file. 
If you can't install a linux distro onto the box, run up a virtualbox 
environment to host nagios in. It wouldn't be perfect, but will probably

be better than cygwin.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Marc Powell

On Feb 24, 2009, at 8:42 AM, Jakob Curdes wrote:

>
> We are not sure if this is cygwin/windows related or if it is a  
> problem
> which also happens in the "normal" unix environment.

It is not normal under unix.

--
Marc


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Jakob Curdes
Hello,

we are running nagios 3.0.5 under cygwin quite successfully in a 
relatively simple setup (most check results come via NRPE from NSCA++ 
clients on windows machines). Everything is well, but sometimes a 
awkward thing happens: nagios just stops doing anything.

- It continues to run : the processes are visible in the Windows process 
table
- It ceases to execute checks: checks just stop bein executed
- It ceases to write anything to a logfile
- It also ceases to write anything to a debug file with Debug=16 (last 
entry is nothing suspicous)

After killing the nagios processes and restarting the windows service 
(nagios is registered as a windows service using the cygrunsrv facility)
everything is back to normal.
What I find most annoying is that even in the CGI output everything 
seems normal - just that the last checks are several hours overdue, but 
no alarm or other signal is shown.

We are not sure if this is cygwin/windows related or if it is a problem 
which also happens in the "normal" unix environment.
Comments would be very welcome. I will now turn up debugging further in 
the hope that then we see something interesting.

Regards,
Jakob Curdes

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null