Re: [Nagios-users] Issues with NEB modules breaking after restart
divTo answer my own question... I'm pretty sure two nagios instances were spawned at once. The nagios init script that comes with nagios-core is the best at handling this situation./div -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Issues with NEB modules breaking after restart
I recently just upgraded to the latest 3.5.0 release of nagios-core and just added livestatus into my environment. We are trying to replace NDO but currently have the two running at the same time along with NCPD for perfdata which as far as I know there shouldn't be an issues . The first issue I had was where Nagios would segfault every night during it's routine log rotation so I applied the 0007-fix_downtime_struct.dif patch which seems to have fixed that issue. I experienced a new issue this morning where when restarting Nagios none of the NEB modules uninitialized properly. Nagios was able to start and initialized all of the NEB modules but a few seconds later Nagios uninitialized them again. This isn't like anything I've seen before and none of the NEB modules worked after this occurred. Here is what the logs looked like. [Thu Jun 13 09:30:29 2013] Caught SIGTERM, shutting down... [Thu Jun 13 09:30:30 2013] Successfully shutdown... (PID=14098) [Thu Jun 13 09:30:31 2013] livestatus: Socket thread has terminated [Thu Jun 13 09:30:41 2013] Nagios 3.5.0 starting... (PID=481) [Thu Jun 13 09:30:41 2013] Local time is Thu Jun 13 09:30:41 EDT 2013 [Thu Jun 13 09:30:41 2013] LOG VERSION: 2.0 [Thu Jun 13 09:30:41 2013] livestatus: Livestatus 1.2.2p2 by Mathias Kettner. Socket: '/usr/local/nagios/var/rw/livestatus.sock' [Thu Jun 13 09:30:41 2013] livestatus: Please visit us at http://mathias-kettner.de/ [Thu Jun 13 09:30:41 2013] livestatus: Hint: please try out OMD - the Open Monitoring Distribution [Thu Jun 13 09:30:41 2013] livestatus: Please visit OMD at http://omdistro.org [Thu Jun 13 09:30:41 2013] livestatus: Removed old left over socket file /usr/local/nagios/var/rw/livestatus.sock [Thu Jun 13 09:30:41 2013] livestatus: archive path /drbd/r1/nagios/archives [Thu Jun 13 09:30:41 2013] livestatus: Finished initialization. Further log messages go to /drbd/r1/nagios/livestatus.log [Thu Jun 13 09:30:41 2013] Event broker module '/usr/local/mk-livestatus/livestatus.o' initialized successfully. [Thu Jun 13 09:30:41 2013] npcdmod: Copyright (c) 2008-2009 Hendrik Baecker (andu...@process-zero.de) - http://www.pnp4nagios.org [Thu Jun 13 09:30:41 2013] npcdmod: /usr/local/pnp4nagios/etc/npcd.cfg initialized [Thu Jun 13 09:30:41 2013] npcdmod: spool_dir = '/dev/shm/pnp4nagios/var/spool/'. [Thu Jun 13 09:30:41 2013] npcdmod: perfdata file '/dev/shm/pnp4nagios/var/perfdata.dump'. [Thu Jun 13 09:30:41 2013] npcdmod: Ready to run to have some fun! [Thu Jun 13 09:30:41 2013] livestatus: Timeperiod cache not updated, there are no timeperiods (yet) [Thu Jun 13 09:30:41 2013] Event broker module '/usr/local/pnp4nagios/lib64/npcdmod.o' initialized successfully. [Thu Jun 13 09:30:41 2013] ndomod: NDOMOD 1.5.2 (06-08-2012) Copyright (c) 2009 Nagios Core Development Team and Community Contributors [Thu Jun 13 09:30:41 2013] ndomod: Successfully connected to data sink. 0 queued items to flush. [Thu Jun 13 09:30:41 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully. [Thu Jun 13 09:30:43 2013] Finished daemonizing... (New PID=482) [Thu Jun 13 09:30:44 2013] TIMEPERIOD TRANSITION: 24x7;-1;1 [Thu Jun 13 09:30:47 2013] Event broker module '/usr/local/mk-livestatus/livestatus.o' deinitialized successfully. [Thu Jun 13 09:30:47 2013] npcdmod: If you don't like me, I will go out! Bye. [Thu Jun 13 09:30:47 2013] Event broker module '/usr/local/pnp4nagios/lib64/npcdmod.o' deinitialized successfully. [Thu Jun 13 09:30:47 2013] ndomod: Shutdown complete. [Thu Jun 13 09:30:47 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' deinitialized successfully. Here is the next restart after this where things happened as I would expect: [Thu Jun 13 09:52:25 2013] Successfully shutdown... (PID=482) [Thu Jun 13 09:52:26 2013] livestatus: Socket thread has terminated [Thu Jun 13 09:52:26 2013] Event broker module '/usr/local/mk-livestatus/livestatus.o' deinitialized successfully. [Thu Jun 13 09:52:26 2013] npcdmod: If you don't like me, I will go out! Bye. [Thu Jun 13 09:52:26 2013] Event broker module '/usr/local/pnp4nagios/lib64/npcdmod.o' deinitialized successfully. [Thu Jun 13 09:52:26 2013] ndomod: Shutdown complete. [Thu Jun 13 09:52:26 2013] Event broker module '/usr/local/nagios/bin/ndomod.o' deinitialized successfully. [Thu Jun 13 09:52:29 2013] Nagios 3.5.0 starting... (PID=20081) [Thu Jun 13 09:52:29 2013] Local time is Thu Jun 13 09:52:29 EDT 2013 [Thu Jun 13 09:52:29 2013] LOG VERSION: 2.0 [Thu Jun 13 09:52:29 2013] livestatus: Livestatus 1.2.2p2 by Mathias Kettner. Socket: '/usr/local/nagios/var/rw/livestatus.sock' [Thu Jun 13 09:52:29 2013] livestatus: Please visit us at http://mathias-kettner.de/ [Thu Jun 13 09:52:29 2013] livestatus: Hint: please try out OMD - the Open Monitoring Distribution [Thu Jun 13 09:52:29 2013] livestatus: Please visit OMD at http://omdistro.org [Thu Jun 13 09:52:29 2013] livestatus: archive path /drbd/r1/nagios/archives [Thu Jun
Re: [Nagios-users] High Availabilty with Nagios
I did a talk at last years conference that touches on HA Nagios setup which uses DRBD and pacemaker. There were also talks about mod_gearman and Merlin that might also be helpful. The slides (and maybe video?) are available on nagios.org. Here is a link to my slides: http://www.slideshare.net/nagiosinc/andrew-widdersheim-nagiosisdownbosswantstosee-you -- Learn Graph Databases - Download FREE O'Reilly Book Graph Databases is the definitive new guide to graph databases and their applications. This 200-page book is written by three acclaimed leaders in the field. The early access version is available now. Download your free book today! http://p.sf.net/sfu/neotech_d2d_may ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] solutions for off-server PNP4Nagios perfdata processing?
There are two options that I know of. The first is there is an undocumented NEB module that comes with PNP4Nagios located in src/module called modpnpsender.c that looks like it send data to a remote server to get processed. If anything you can use it as a starting point. This blog article talks about using it and if you look at the source you can find the original author and possibly more details: -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] solutions for off-server PNP4Nagios perfdata processing?
There are two options that I know of. The first is there is an undocumented NEB module that comes with PNP4Nagios located in src/module called modpnpsender.c that looks like it send data to a remote server to get processed. If anything you can use it as a starting point. This blog article talks about using it and if you look at the source you can find the original author and possibly more details: http://www.semintelligent.com/blog/articles/38/nagios-performance-tuning-early-lessons-learned-lessons-shared-part-4-scalable-performance-data-graphing The second option is setting up mod_gearman which can put perfdata into it's own queue and configure PNP4Nagios as a gearman worker to pick up the data remotely and process it. All of this is described here: http://docs.pnp4nagios.org/pnp-0.6/modes#gearman_mode -Andrew W. -- Don't let slow site performance ruin your business. Deploy New Relic APM Deploy New Relic app performance management and know exactly what is happening inside your Ruby, Python, PHP, Java, and .NET app Try New Relic at no cost today and get our sweet Data Nerd shirt too! http://p.sf.net/sfu/newrelic-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null