[Nagios-users] Application monitoring using Nagios!!!
Hi, I am using Nagios Version 3.0.6 on a Linux server. Please let me know whether we have an option of configuring the credentials so that Nagios will automatically login to an application and send an alert on success or failure. Thanks, Ravi G -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios kept from restarting after reboot by lockfile
eric.b...@barclayscapital.com wrote: > > It's weirdwhen I run nagios and kill it with -9, it leaves the pid > file intact, but when I restart it, it zero's out the pid file and starts > just fine. when I just kill it with the default kill signal, it removes the > pid file. This isn't weird. That's how it should work. kill -9 sends an uncatchable, compulsory, kill signal (SIGKILL) to the process giving it no time to clean up before exiting. The default kill signal is SIGTERM, which can be caught and handled (or ignored) by the process. Restarting Nagios from the web interface, doesn't terminate and restart the process (the PID doesn't change), only re-initializes it. -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios kept from restarting after reboot by lockfile
Good stuff, Dan. I was not aware of the differences between how the reboot and shutdown commands handle the reboot process. Turns out that we're doing a reboot -f, which explains why I have orphaned PID files laying around. I'm going to make the call right now that to fight the fight to have 'reboot -f' changed to the plays-more-nicely-with-others "shutdown -r" is already lost and I'm going to work around that in code. Thanks for helping clarify this. It's weirdwhen I run nagios and kill it with -9, it leaves the pid file in tact, but when I restart it, it zero's out the pid file and starts just fine. when I just kill it with the default kill signal, it removes the pid file. In any case, I now know what the issues are and how to address this. Thanks again very much for you help, guys. You are a feature of Nagios. Eric > -Original Message- > From: Daniel Wittenberg [mailto:daniel.wittenberg.r...@statefarm.com] > Sent: Tuesday, December 21, 2010 9:23 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lockfile > > So are you using the actual "reboot" command not "shutdown -r > now" which > is a little friendlier? The standard nagios shutdown script > should take > care of cleaning those up for you. Otherwise putting something like: > rm -f ; service nagios start > in your rc.local would take care of it. But when you mention > pid file, > are you saying the PID file is still there, or the lock file? Since > they are different things. Again though, if nagios it > shutdown properly > you shouldn't be seeing that. > > Dan > > -Original Message- > From: eric.b...@barclayscapital.com > [mailto:eric.b...@barclayscapital.com] > Sent: Monday, December 20, 2010 6:59 PM > To: nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios kept from restarting after rebootby > lockfile > > We reboot all of our hosts on a weekly basis. I used to > price myself in > keeping my boxes up as long as possible, but having spent years now > supporting mission-critical financial production applications, I'm on > board with the weekly reboots. Lets you know early if some system or > app change is problematic. > > Reboot is being done via a standard reboot command. > > I've looked around for rc scripts that might address this issue, but > haven't found any. Got any pointers? > > Regarding the rc.local solution, a) I'd prefer to solve the > problem, not > just address the symptoms, and b) elsewhere in this thread I've > described the roadblocks that we have to doing anything a > system level. > Yep, that's right, boys, we survive in the app developer layer within > which we do not have root on these boxes. It's a tedious, > time-consuming, frustrating, productivity-killing endeavor to do just > about anything you can't do yourself. > > Sogot any sample RC scripts, or command line params to nagios to > make it smart enough to know that the PID that is in it's PID > file isn't > an active process? > > Thanks. > > Eric > > > -Original Message- > > From: Daniel Wittenberg > [mailto:daniel.wittenberg.r...@statefarm.com] > > Sent: Monday, December 20, 2010 11:56 AM > > To: Nagios Users List > > Subject: Re: [Nagios-users] Nagios kept from restarting after > > reboot by lockfile > > > > Couple questions > > 1) Why do you have to reboot your monitoring server weekly? > > 2) How is the reboot being done? > > > > Reason I ask 2) is because the standard rc script will remove the > > lockfile when nagios is told to stop. So if you are having > > this problem > > is sounds like you are not doing a clean shutdown and > > something could be > > wrong. > > > > Either way, I guess worst case one way to check for this > would be put > > something like this in your /etc/rc.d/rc.local: > > rm -f /var/lock/subsys/nagios > > > > Assuming that's where your lockfile is. > > > > Dan > > > > > > -Original Message- > > From: eric.b...@barclayscapital.com > > [mailto:eric.b...@barclayscapital.com] > > Sent: Monday, December 20, 2010 10:16 AM > > To: eric.b...@barclayscapital.com; > nagios-users@lists.sourceforge.net > > Subject: Re: [Nagios-users] Nagios kept from restarting after > > reboot by > > lockfile > > > > Alternatively, could you recommend a good system/resource monitoring > > tool that would be able to let me know if nagios is down and > > restart it > > automatically? > > > > _ > > From: Berg, Eric: IT (NYK) > > Sent: Monday, December 20, 2010 11:03 AM > > To: 'nagios-users@lists.sourceforge.net' > > Subject:Nagios kept from restarting after reboot by > lock file > > > > Gee, this seems like an annoying newbie problem, but if > Nagios crashes > > or is killed (as on system reboot), it leaves a lock file > around that > > prevents it from starting again until the lock file is > > manually removed. > > > > I see t
Re: [Nagios-users] nagios mail alerts to exchange
> Does anyone know where I can get a guide on how to use postfix to > mail a local exchange server - just now I have a local installation of > postfix no config, the mails just bounce off the exchange server. > I did it years ago and cannot for the life of me remember how - I think > I was using postfix and fetchmail, I'm not using postfix but have a few boxes running exim and some running sendmail.I have exim and sendmail both configured to use my exchange server as a smarthost for all mail. Exchange is set to accept all inbound mail connections from 192.168.0.0/16 and route & relay it accordingly. I didn't do my own sendmail config, and I'm not to familiar with sendmail; but in exim I set it up for no local mail and all mail is sent through a relay or smarthost. Postfix should have similar settings. Mark From: Andrew Fay [mailto:andrew@ajl.co.uk] Sent: Tuesday, December 21, 2010 12:31 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] nagios mail alerts to exchange Cheers, Andy o--o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o--o Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Windows Check_NT
On 21 December 2010 17:08, Andrew Fay wrote: > command_line /usr/lib/nagios/plugins/check_nt -H > $HOSTADDRESS$ -v $ARG1$ -v $ARG2$ check_nt shouldn't need two "-v" arguments. Run: /usr/local/nagios/libexec/check_nt --help from the command line to see what arguments are expected. Then run the check_nt command at the command line and make sure it is returning the information you expect before configuring it in Nagios. You'll find some example Nagios command definitions at: http://nsclient.org/nscp/wiki/NSClientListener -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Windows Check_NT
Am 21.12.10 18:08 schrieb(en) Andrew Fay: Check_NT is not playing ball, when it is scanning services it is coming up with : [snipped description] The following works for me in the commands.cfg... # 'check_nt' command definition define command{ command_namecheck_nt command_line$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$ } ...and the xp machine's config: define service{ use hourly-service host_name winxp-05, winxp-06 service_description C:\ Drive Space check_command check_nt!USEDDISKSPACE!-l C -w 80 -c 90 } The NSC.ini file on the xp boxes looks as follows (comments stripped): connand_time [modules] CheckSystem.dll CheckDisk.dll NSClientListener.dll NRPEListener.dll SysTray.dll CheckEventLog.dll CheckHelpers.dll CheckWMI.dll [Settings] allowed_hosts=/32 use_file=1 [log] debug=0 file=C:\Programme\NSClient++\NSC.log [NSClient] [NRPE] command_timeout=300 allow_arguments=1 allow_nasty_meta_chars=1 [Check System] [External Script] command_timeout=300 [External Scripts] [External Alias] [NSCA Agent] [NSCA Commands] [NRPE Handlers] command[check_win_updates]=c:\\windows\\system32\\cscript.exe //NoLogo //T:300 C:\\Programme\\NSClient++\\plugins\\check_windows_updates.wsf /w:0 /c:1 [NRPE Client Handlers] check_other=-H 192.168.0.1 -p 5666 -c remote_command -a arguments Hth, Albrecht. pgpxvDWcR11Zj.pgp Description: PGP signature -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios mail alerts to exchange
Does anyone know where I can get a guide on how to use postfix to mail a local exchange server - just now I have a local installation of postfix no config, the mails just bounce off the exchange server. I did it years ago and cannot for the life of me remember how - I think I was using postfix and fetchmail, Cheers, Andy o--o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o--o-- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Windows Check_NT
Hello, Hopefully someone can help me out, I am setting up Nagios on an Ubuntu box to monitor Windows servers, Check_NT is not playing ball, when it is scanning services it is coming up with : NSClient++ Version UNKNOWN2010-12-21 17:04:060d 0h 0m 4s 4/4 Usage:check_nt -H host -v variable [-p port] [-w warning] [-c critical] C:\ Drive Space UNKNOWN2010-12-21 17:05:310d 0h 0m 9s 4/4 check_nt: Could not parse arguments I am using the package install which is version 3.2.1, the test machine is just an XP desktop that I am trying to get Nagios to pick up. I have installed NSClient++-0.3.8-Win32 on the client machine with pretty much default settings aside from the only machine that is able to access it is the name of the ubuntu machine, taking this out doesn't make a difference. Commands I am using (which are just copied straight from templates) : define command { command_namecheck_nt command_line/usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v $ARG1$ -v $ARG2$ } define service{ use generic-service host_name computer1 service_descriptionNSClient++ Version check_command check_nt!CLIENTVERSION } Can anyone help? Cheers, Andy o--o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o--o-- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Looking Glass Filters
Has anyone had success with the NLG version 1.1.0b? I've been working with it and see some odd behavior while trying to set up host filters. I've read through the FILTERS_HOW_TO.txt file and was successful in creating one additional filter to the default set, but since then have not been able to update the filters, no matter how I edit the server version of s3_filter_inc. The odd thing is that I can successfully change the name of the filters in the list, but can't seem to effect the actual content of the filters. Thoughts? Thanks. Chris Caswell MCNC (North Carolina Research and Education Network) -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Email Notifications
Ok, no problem - create a new contact/contact group, then add them to whatever service or host definition(s) you want the notifications for using "contact_groups"... like this: define service { service_description MyService check_period24x7 max_check_attempts 5 contact_groups linuxadmins,operations ... } cheers, James -Original Message- From: Luc MAIGNAN [mailto:luc.maig...@winxpert.com] Sent: Tuesday, December 21, 2010 11:02 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Email Notifications I wasn't enough clear, I think. Today I have all notifications to one user. I don't want to change this. But for only a list a specified services, I want to notify another user Le 21/12/10 16:56, James Pratt a écrit : > > -Original Message- > From: Luc MAIGNAN [mailto:luc.maig...@winxpert.com] > Sent: Tuesday, December 21, 2010 10:36 AM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] Email Notifications > > Hi, > > > I wonder if it is possible to have email notifications but JUST for a > specified list of monitored services ? > > Thanks for any help > > -- > -- > -- > > Hi - > > I'm pretty sure you can use : > > notifications_enabled 0 > > in the service or host definition for the ones you don't want > notifications on ... then again, there are probably other ways as well.. > > > cheers, > James > > -- > Forrester recently released a report on the Return on > Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost > savings, and break-even within 7 months. Over 3 million businesses > have gone Google with Google Apps: > an online email calendar, and document program that's accessible from > your browser. Read the Forrester report: > http://p.sf.net/sfu/googleapps-sfnew > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Email Notifications
I wasn't enough clear, I think. Today I have all notifications to one user. I don't want to change this. But for only a list a specified services, I want to notify another user Le 21/12/10 16:56, James Pratt a écrit : > > -Original Message- > From: Luc MAIGNAN [mailto:luc.maig...@winxpert.com] > Sent: Tuesday, December 21, 2010 10:36 AM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] Email Notifications > > Hi, > > > I wonder if it is possible to have email notifications but JUST for a > specified list of monitored services ? > > Thanks for any help > > > -- > > Hi - > > I'm pretty sure you can use : > > notifications_enabled 0 > > in the service or host definition for the ones you don't want > notifications on ... then again, there are probably other ways as well.. > > > cheers, > James > > -- > Forrester recently released a report on the Return on Investment (ROI) of > Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even > within 7 months. Over 3 million businesses have gone Google with Google Apps: > an online email calendar, and document program that's accessible from your > browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Email Notifications
-Original Message- From: Luc MAIGNAN [mailto:luc.maig...@winxpert.com] Sent: Tuesday, December 21, 2010 10:36 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Email Notifications Hi, I wonder if it is possible to have email notifications but JUST for a specified list of monitored services ? Thanks for any help -- Hi - I'm pretty sure you can use : notifications_enabled 0 in the service or host definition for the ones you don't want notifications on ... then again, there are probably other ways as well.. cheers, James -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Email Notifications
Hi, I wonder if it is possible to have email notifications but JUST for a specified list of monitored services ? Thanks for any help -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios kept from restarting after reboot by lock file
I didn't realize that it was so difficult to associate the process with the PID. I'm used to perl, and solutions like this are generally pretty portable. Both your solution and Mikes make perfect sense. I was hoping that I had missed something and wanted to get the skinny before I start actually doing work to get around this. I'll just add the process checks to the wrapper script that runs Nagios. WRT the complexity of Nagios and config editors, I've looked a several, and they do seem to provide some pretty straight-forward configuration help. I don't think our environment is quite so simple. We've got about 4 major applications, each of which has multiple services associated with it as well as different support organizations, so we've got a fairly sophisticated setup. To make matters even more interesting, for at least some of these services, the hosts on which they run -- and often the ports -- change on a regular basis, so we're looking toward writing some templated solutions to generating the configurations for Nagios...which, as you can imagine, provides some interesting challenges. Anyway, thanks, folks, for your help. Much appreciated. Eric > -Original Message- > From: Andreas Ericsson [mailto:a...@op5.se] > Sent: Tuesday, December 21, 2010 2:45 AM > To: Nagios Users List > Cc: Berg, Eric: IT (NYK) > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lock file > > On 12/21/2010 01:54 AM, eric.b...@barclayscapital.com wrote: > > If you had any idea how difficult it is just to do the most basic > > system administration tasks in the environment within which we're > > working, you'd be shaking your head in empathetic embarassment. > > Filling out tickets and -- get this: our company has an official > > state for our tickets referred to as "Chasing", which is when, after > > entering your ticket into this multi-million dollar ticket tracking > > system, you then have to manually pick up the phone, email, or walk > > over to bug the guy who should now act on the ticket. It's a > > nightmare. I built it. > > > > Sounds unpleasant, inefficient and costly. > > > That was intended to express the irony of my search for a > solution to > > keep my monitoring system up. I'm shocked that nagios > can't tell the > > difference between a pid file that refers to a running process and > > one that refers to one that's not running any more. > > It can, but it's damn near impossible to do portably from a > script, and > hard enough to do from C. It's easy enough to check that > *some* sort of > process is running with the pid in the lockfile, but in order to do it > correct, one should also check that it's a Nagios process, and that's > the hard part. > > The quick workaround is to add a boot script that runs before the > Nagios startup script and unconditionally removes the lockfile. That's > not a universal solution though, since the lockfile will remain if > Nagios crashes. > > > It's the first > > thing about Nagios that's got my head scractching...besides the > > complex set of dependencies that you have to negotiate to do pretty > > much anything. > > > > Well, there are plenty of tools to help with configuration and such. > Nacoma is one of them. Nagiosql is another. > > -- > Andreas Ericsson andreas.erics...@op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > ___ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging with Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. ___ -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to c
Re: [Nagios-users] Nagios kept from restarting after reboot by lockfile
So are you using the actual "reboot" command not "shutdown -r now" which is a little friendlier? The standard nagios shutdown script should take care of cleaning those up for you. Otherwise putting something like: rm -f ; service nagios start in your rc.local would take care of it. But when you mention pid file, are you saying the PID file is still there, or the lock file? Since they are different things. Again though, if nagios it shutdown properly you shouldn't be seeing that. Dan -Original Message- From: eric.b...@barclayscapital.com [mailto:eric.b...@barclayscapital.com] Sent: Monday, December 20, 2010 6:59 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios kept from restarting after rebootby lockfile We reboot all of our hosts on a weekly basis. I used to price myself in keeping my boxes up as long as possible, but having spent years now supporting mission-critical financial production applications, I'm on board with the weekly reboots. Lets you know early if some system or app change is problematic. Reboot is being done via a standard reboot command. I've looked around for rc scripts that might address this issue, but haven't found any. Got any pointers? Regarding the rc.local solution, a) I'd prefer to solve the problem, not just address the symptoms, and b) elsewhere in this thread I've described the roadblocks that we have to doing anything a system level. Yep, that's right, boys, we survive in the app developer layer within which we do not have root on these boxes. It's a tedious, time-consuming, frustrating, productivity-killing endeavor to do just about anything you can't do yourself. Sogot any sample RC scripts, or command line params to nagios to make it smart enough to know that the PID that is in it's PID file isn't an active process? Thanks. Eric > -Original Message- > From: Daniel Wittenberg [mailto:daniel.wittenberg.r...@statefarm.com] > Sent: Monday, December 20, 2010 11:56 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lockfile > > Couple questions > 1) Why do you have to reboot your monitoring server weekly? > 2) How is the reboot being done? > > Reason I ask 2) is because the standard rc script will remove the > lockfile when nagios is told to stop. So if you are having > this problem > is sounds like you are not doing a clean shutdown and > something could be > wrong. > > Either way, I guess worst case one way to check for this would be put > something like this in your /etc/rc.d/rc.local: > rm -f /var/lock/subsys/nagios > > Assuming that's where your lockfile is. > > Dan > > > -Original Message- > From: eric.b...@barclayscapital.com > [mailto:eric.b...@barclayscapital.com] > Sent: Monday, December 20, 2010 10:16 AM > To: eric.b...@barclayscapital.com; nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by > lockfile > > Alternatively, could you recommend a good system/resource monitoring > tool that would be able to let me know if nagios is down and > restart it > automatically? > > _ > From: Berg, Eric: IT (NYK) > Sent: Monday, December 20, 2010 11:03 AM > To: 'nagios-users@lists.sourceforge.net' > Subject:Nagios kept from restarting after reboot by lock file > > Gee, this seems like an annoying newbie problem, but if Nagios crashes > or is killed (as on system reboot), it leaves a lock file around that > prevents it from starting again until the lock file is > manually removed. > > I see this on Monday mornings after weekend reboots on a Red Hat Linux > box: > > nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its > already held by another instance of Nagios (PID 0). Bailing out... > > Does anyone know if there's a config option or something else that > obviates the need to write a wrapper scropt to check to see > if Nagios is > really running and remove the lock file (look slike Nagios > already knows > it's not running by virtue of the value of the PID inthis > very message!) > so that it can cleanly start up again? > > Thanks. > > Eric > > ___ > > This e-mail may contain information that is confidential, > privileged or > otherwise protected from disclosure. If you are not an intended > recipient of this e-mail, do not duplicate or redistribute it by any > means. Please delete it and any attachments and notify the sender that > you have received it in error. Unless specifically indicated, this > e-mail is not an offer to buy or sell or a solicitation to buy or sell > any securities, investment products or other financial product or > service, an official confirmation of any transaction, or an official > statement of Barclays. Any views or opinions presented are > solely those > of the author and do not necessarily represent those of Barclay
Re: [Nagios-users] Not a valid parent
On Tue, Dec 21, 2010 at 08:43:13AM -0500, stan wrote: > What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > > What am I doing wrong? > Answer to my own question. I had named the config file p,2fwi,cfg, not pm2fwi.cfg -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Not a valid parent
On 21 December 2010 13:43, stan wrote: > What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > > What am I doing wrong? I think this usually means that the host 'pm2fwi' is not defined in your Nagios configuration. I expect you might be looking for a slight typo in the hostname for example pm2fwl rather than pm2fwi or somesuch. The kind of mistake I usually make is I set up a new directory with host and service definitions in it then forget to add a cfg_dir directive in nagios.cfg. -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Not a valid parent
> What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > - Make sure the 'host_name' definition in the nagios configuration file, matches the intended name of the parent. Sometimes people use alternate names, and confuse with the fact that if host_name is used, then NAGIOS does not use DNS names. M. -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Not a valid parent
What causes this error message: Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! This is the insdie, and outside of a firewall. They are in 2 diferent subnets, and 2 different domains. Still, this relatiionship is corect, and I would expcect Nagios to allow me to define it, What am I doing wrong? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Centreon installation question
I don't know if anyone on the list uses Centreon web front-end, but I have a question. I have to install and set up a new Nagios server. I have already installed RHEL5, now I'm trying to get the base packages (Apache, PHP, MySQL) that are needed. I have been trying to follow the directions here: http://en.doc.centreon.com/Setup:Prerequisite/Centos/Fedora/RHEL and here: http://www.nagioswiki.com/wiki/index.php/Installing_Centreon_on_Centos_5 But they conflict and both bring me to the same point: the major packages don't seem to be there. I can set up Yum and install a simple package such as htop from their repository, but the major packages such as Apache, MySQL, and PHP don't seem to be there. I've even gone out manually to look for them, but they're not there: http://apt.sw.be/redhat/el5/en/i386/dag/RPMS/ or here: http://packages.sw.be/ Does anyone have any experience with installing Centreon? Thanks, Chris -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios kept from restarting after reboot by lockfile
On Mon, 20 Dec 2010 19:58:47 -0500, wrote: > Reboot is being done via a standard reboot command. Don't use `reboot`. Use `shutdown -r` instead. And the system (including Nagios) should close down correctly. But rebooting a monitoring server is rather silly in my view. Hugo. -- hvdko...@vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null