Re: [Nagios-users] Issues with NEB modules breaking after restart

2013-06-14 Thread Andrew Widdersheim
divTo answer my own question... I'm pretty sure two nagios instances were 
spawned at once. The nagios init script that comes with nagios-core is the best 
at handling this situation./div
  
--
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Issues with NEB modules breaking after restart

2013-06-13 Thread Andrew Widdersheim
I recently just upgraded to the latest 3.5.0 release of nagios-core and just 
added livestatus into my environment. We are trying to replace NDO but 
currently have the two running at the same time along with NCPD for perfdata 
which as far as I know there shouldn't be an issues .

The first issue I had was where Nagios would segfault every night during it's 
routine log rotation so I applied the 0007-fix_downtime_struct.dif patch which 
seems to have fixed that issue. I experienced a new issue this morning where 
when restarting Nagios none of the NEB modules uninitialized properly. Nagios 
was able to start and initialized all of the NEB modules but a few seconds 
later Nagios uninitialized them again. This isn't like anything I've seen 
before and none of the NEB modules worked after this occurred. Here is what the 
logs looked like.


[Thu Jun 13 09:30:29 2013] Caught SIGTERM, shutting down...
[Thu Jun 13 09:30:30 2013] Successfully shutdown... (PID=14098)
[Thu Jun 13 09:30:31 2013] livestatus: Socket thread has terminated
[Thu Jun 13 09:30:41 2013] Nagios 3.5.0 starting... (PID=481)
[Thu Jun 13 09:30:41 2013] Local time is Thu Jun 13 09:30:41 EDT 2013
[Thu Jun 13 09:30:41 2013] LOG VERSION: 2.0
[Thu Jun 13 09:30:41 2013] livestatus: Livestatus 1.2.2p2 by Mathias Kettner. 
Socket: '/usr/local/nagios/var/rw/livestatus.sock'
[Thu Jun 13 09:30:41 2013] livestatus: Please visit us at 
http://mathias-kettner.de/
[Thu Jun 13 09:30:41 2013] livestatus: Hint: please try out OMD - the Open 
Monitoring Distribution
[Thu Jun 13 09:30:41 2013] livestatus: Please visit OMD at http://omdistro.org
[Thu Jun 13 09:30:41 2013] livestatus: Removed old left over socket file 
/usr/local/nagios/var/rw/livestatus.sock
[Thu Jun 13 09:30:41 2013] livestatus: archive path /drbd/r1/nagios/archives
[Thu Jun 13 09:30:41 2013] livestatus: Finished initialization. Further log 
messages go to /drbd/r1/nagios/livestatus.log
[Thu Jun 13 09:30:41 2013] Event broker module 
'/usr/local/mk-livestatus/livestatus.o' initialized successfully.
[Thu Jun 13 09:30:41 2013] npcdmod: Copyright (c) 2008-2009 Hendrik Baecker 
(andu...@process-zero.de) - http://www.pnp4nagios.org
[Thu Jun 13 09:30:41 2013] npcdmod: /usr/local/pnp4nagios/etc/npcd.cfg 
initialized
[Thu Jun 13 09:30:41 2013] npcdmod: spool_dir = 
'/dev/shm/pnp4nagios/var/spool/'.
[Thu Jun 13 09:30:41 2013] npcdmod: perfdata file 
'/dev/shm/pnp4nagios/var/perfdata.dump'.
[Thu Jun 13 09:30:41 2013] npcdmod: Ready to run to have some fun!
[Thu Jun 13 09:30:41 2013] livestatus: Timeperiod cache not updated, there are 
no timeperiods (yet)
[Thu Jun 13 09:30:41 2013] Event broker module 
'/usr/local/pnp4nagios/lib64/npcdmod.o' initialized successfully.
[Thu Jun 13 09:30:41 2013] ndomod: NDOMOD 1.5.2 (06-08-2012) Copyright (c) 2009 
Nagios Core Development Team and Community Contributors
[Thu Jun 13 09:30:41 2013] ndomod: Successfully connected to data sink.  0 
queued items to flush.
[Thu Jun 13 09:30:41 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' 
initialized successfully.
[Thu Jun 13 09:30:43 2013] Finished daemonizing... (New PID=482)
[Thu Jun 13 09:30:44 2013] TIMEPERIOD TRANSITION: 24x7;-1;1
[Thu Jun 13 09:30:47 2013] Event broker module 
'/usr/local/mk-livestatus/livestatus.o' deinitialized successfully.
[Thu Jun 13 09:30:47 2013] npcdmod: If you don't like me, I will go out! Bye.
[Thu Jun 13 09:30:47 2013] Event broker module 
'/usr/local/pnp4nagios/lib64/npcdmod.o' deinitialized successfully.
[Thu Jun 13 09:30:47 2013] ndomod: Shutdown complete.
[Thu Jun 13 09:30:47 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' 
deinitialized successfully.

Here is the next restart after this where things happened as I would expect:

[Thu Jun 13 09:52:25 2013] Successfully shutdown... (PID=482)
[Thu Jun 13 09:52:26 2013] livestatus: Socket thread has terminated
[Thu Jun 13 09:52:26 2013] Event broker module 
'/usr/local/mk-livestatus/livestatus.o' deinitialized successfully.
[Thu Jun 13 09:52:26 2013] npcdmod: If you don't like me, I will go out! Bye.
[Thu Jun 13 09:52:26 2013] Event broker module 
'/usr/local/pnp4nagios/lib64/npcdmod.o' deinitialized successfully.
[Thu Jun 13 09:52:26 2013] ndomod: Shutdown complete.
[Thu Jun 13 09:52:26 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' 
deinitialized successfully.
[Thu Jun 13 09:52:29 2013] Nagios 3.5.0 starting... (PID=20081)
[Thu Jun 13 09:52:29 2013] Local time is Thu Jun 13 09:52:29 EDT 2013
[Thu Jun 13 09:52:29 2013] LOG VERSION: 2.0
[Thu Jun 13 09:52:29 2013] livestatus: Livestatus 1.2.2p2 by Mathias Kettner. 
Socket: '/usr/local/nagios/var/rw/livestatus.sock'
[Thu Jun 13 09:52:29 2013] livestatus: Please visit us at 
http://mathias-kettner.de/
[Thu Jun 13 09:52:29 2013] livestatus: Hint: please try out OMD - the Open 
Monitoring Distribution
[Thu Jun 13 09:52:29 2013] livestatus: Please visit OMD at http://omdistro.org
[Thu Jun 13 09:52:29 2013] livestatus: archive path /drbd/r1/nagios/archives
[Thu Jun 

Re: [Nagios-users] High Availabilty with Nagios

2013-05-09 Thread Andrew Widdersheim
I did a talk at last years conference that touches on HA Nagios setup which 
uses DRBD and pacemaker. There were also talks about mod_gearman and Merlin 
that might also be helpful. The slides (and maybe video?) are available on 
nagios.org. Here is a link to my slides:

http://www.slideshare.net/nagiosinc/andrew-widdersheim-nagiosisdownbosswantstosee-you
 
--
Learn Graph Databases - Download FREE O'Reilly Book
Graph Databases is the definitive new guide to graph databases and 
their applications. This 200-page book is written by three acclaimed 
leaders in the field. The early access version is available now. 
Download your free book today! http://p.sf.net/sfu/neotech_d2d_may
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] solutions for off-server PNP4Nagios perfdata processing?‏

2012-10-03 Thread Andrew Widdersheim
There are two options that I know of. The first is there is an undocumented NEB 
module that comes with PNP4Nagios located in src/module called modpnpsender.c 
that looks like it send data to a remote server to get processed. If anything 
you can use it as a starting point. 

This blog article talks about using it and if you look at the source you can 
find the original author and possibly more details: 
 
--
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] solutions for off-server PNP4Nagios perfdata processing?

2012-10-03 Thread Andrew Widdersheim
There are two options that I know of. The first is there is an undocumented NEB 
module that comes with PNP4Nagios located in src/module called modpnpsender.c 
that looks like it send data to a remote server to get processed. If anything 
you can use it as a starting point. 

This blog article talks about using it and if you look at the source you can 
find the original author and possibly more details: 
http://www.semintelligent.com/blog/articles/38/nagios-performance-tuning-early-lessons-learned-lessons-shared-part-4-scalable-performance-data-graphing
The second option is setting up mod_gearman which can put perfdata into it's 
own queue and configure PNP4Nagios as a gearman worker to pick up the data 
remotely and process it. All of this is described here:
http://docs.pnp4nagios.org/pnp-0.6/modes#gearman_mode
-Andrew W.
--
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null