[Nagios-users] Override service notification time
Hi I am currently using Nagios with host groups to so that I can add the same service check to multiple hosts without having to create the same service multiple times, which is all working fine, the problem is that I don't want non mission critical hosts alerting staff to a service problem in the middle of the night when the problem can wait till the morning. Will I have to duplicate service config so I can change the notification periodor can I have it set on a per host basis and have that override what is set in the service template? Thanks David -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Override service notification time
maybe you can check this: http://exchange.nagios.org/directory/Addons/Notifications/*-Notification-Managers/Rule-2DBased-Notifier/details and see if it's what you need. On Mon, Aug 8, 2011 at 10:04 PM, David Wilkinson nagios-us...@noroutetohost.net wrote: Hi I am currently using Nagios with host groups to so that I can add the same service check to multiple hosts without having to create the same service multiple times, which is all working fine, the problem is that I don't want non mission critical hosts alerting staff to a service problem in the middle of the night when the problem can wait till the morning. Will I have to duplicate service config so I can change the notification period or can I have it set on a per host basis and have that override what is set in the service template? Thanks David -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Core Log file
Hello, I was told the NAGIOS CORE logfile would log any changes made to host command execution ie: External commands:who turned off what and when. Upon examination of my logfile it sems that this is not the case. The log file indicates only that a particular service was enabled or disable but it does not indiacte the user id that performed the change. Can anyone point me in the right direction to make this happen? Snippet of log file : [root@its019-lap-v:var]# pwd /usr/local/nagios/var [root@its019-lap-v:var]# ls -lat total 188 drwxrwxr-x 5 nagios nagios 4096 Aug 8 11:15 . -rw-rw-r-- 1 nagios nagios 51204 Aug 8 11:15 status.dat -rw-r--r-- 1 nagios nagios 4561 Aug 8 11:09 nagios.log -rw--- 1 nagios nagios 50697 Aug 8 11:09 retention.dat drwxrwxr-x 2 nagios nagios 4096 Aug 8 00:00 archives -rw-r--r-- 1 nagios nagios 41229 Aug 5 15:09 objects.cache -rw-r--r-- 1 nagios nagios 6 Aug 1 13:24 nagios.lock drwxrwsr-x 2 nagios nagcmd 4096 Aug 1 13:24 rw drwxrwxr-x 9 nagios nagios 4096 Jul 1 15:06 .. drwxrwxr-x 3 nagios nagios 4096 Jun 27 15:54 spool [root@its019-lap-v:var]# cat nagios.log [1312776000] LOG ROTATION: DAILY [1312776000] LOG VERSION: 2.0 [1312776541] Auto-save of retention data completed successfully. [1312780141] Auto-save of retention data completed successfully. [1312783741] Auto-save of retention data completed successfully. [1312787341] Auto-save of retention data completed successfully. [1312790941] Auto-save of retention data completed successfully. [1312794541] Auto-save of retention data completed successfully. [1312798141] Auto-save of retention data completed successfully. [1312801741] Auto-save of retention data completed successfully. [1312805172] EXTERNAL COMMAND: DEL_ALL_SVC_COMMENTS;TPC Server;root space [1312805341] Auto-save of retention data completed successfully. [1312808941] Auto-save of retention data completed successfully. [1312812541] Auto-save of retention data completed successfully. [1312816017] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;HTTP [1312816033] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;SSH [1312816141] Auto-save of retention data completed successfully. -- Thank you, Bob Molerio Systems Administrator New York University ITS Computer Facilities Services/Infrastructure Level C-2 75 Third Avenue New York NY 10003-5527 email:robert.mole...@nyu.edu robert.mole...@nyu.edu -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Core Log file
You might want to check your values in nagios.cfg: # INITIAL STATES LOGGING OPTION # If you want Nagios to log all initial host and service states to # the main log file (the first time the service or host is checked) # you can enable this option by setting this value to 1. If you # are not using an external application that does long term state # statistics reporting, you do not need to enable this option. In # this case, set the value to 0. log_initial_states=0 # EXTERNAL COMMANDS LOGGING OPTION # If you don't want Nagios to log external commands, set this value # to 0. If external commands should be logged, set this value to 1. # Note: This option does not include logging of passive service # checks - see the option below for controlling whether or not # passive checks are logged. log_external_commands=1 On Mon, Aug 8, 2011 at 12:01 PM, Robert J Molerio rjm...@nyu.edu wrote: Hello, I was told the NAGIOS CORE logfile would log any changes made to host command execution ie: External commands:who turned off what and when. Upon examination of my logfile it sems that this is not the case. The log file indicates only that a particular service was enabled or disable but it does not indiacte the user id that performed the change. Can anyone point me in the right direction to make this happen? Snippet of log file : [root@its019-lap-v:var]# pwd /usr/local/nagios/var [root@its019-lap-v:var]# ls -lat total 188 drwxrwxr-x 5 nagios nagios 4096 Aug 8 11:15 . -rw-rw-r-- 1 nagios nagios 51204 Aug 8 11:15 status.dat -rw-r--r-- 1 nagios nagios 4561 Aug 8 11:09 nagios.log -rw--- 1 nagios nagios 50697 Aug 8 11:09 retention.dat drwxrwxr-x 2 nagios nagios 4096 Aug 8 00:00 archives -rw-r--r-- 1 nagios nagios 41229 Aug 5 15:09 objects.cache -rw-r--r-- 1 nagios nagios 6 Aug 1 13:24 nagios.lock drwxrwsr-x 2 nagios nagcmd 4096 Aug 1 13:24 rw drwxrwxr-x 9 nagios nagios 4096 Jul 1 15:06 .. drwxrwxr-x 3 nagios nagios 4096 Jun 27 15:54 spool [root@its019-lap-v:var]# cat nagios.log [1312776000] LOG ROTATION: DAILY [1312776000] LOG VERSION: 2.0 [1312776541] Auto-save of retention data completed successfully. [1312780141] Auto-save of retention data completed successfully. [1312783741] Auto-save of retention data completed successfully. [1312787341] Auto-save of retention data completed successfully. [1312790941] Auto-save of retention data completed successfully. [1312794541] Auto-save of retention data completed successfully. [1312798141] Auto-save of retention data completed successfully. [1312801741] Auto-save of retention data completed successfully. [1312805172] EXTERNAL COMMAND: DEL_ALL_SVC_COMMENTS;TPC Server;root space [1312805341] Auto-save of retention data completed successfully. [1312808941] Auto-save of retention data completed successfully. [1312812541] Auto-save of retention data completed successfully. [1312816017] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;HTTP [1312816033] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;SSH [1312816141] Auto-save of retention data completed successfully. -- Thank you, Bob Molerio Systems Administrator New York University ITS Computer Facilities Services/Infrastructure Level C-2 75 Third Avenue New York NY 10003-5527 email:robert.mole...@nyu.edu robert.mole...@nyu.edu -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Core Log file
On 2011-08-08 18:01, Robert J Molerio wrote: Hello, I was told the NAGIOS CORE logfile would log any changes made to host command execution ie: External commands:who turned off what and when. Upon examination of my logfile it sems that this is not the case. The log file indicates only that a particular service was enabled or disable but it does not indiacte the user id that performed the change. Can anyone point me in the right direction to make this happen? this isn't possible with nagios core. you'd better catch up on your application sending the external commands, logging that to somewhere. e.g. if you are using the nagios classic ui with the cmd.cgi, you might hack that to log the user into a seperate file. we did hack that in icinga classic and cmd.cgi so that each user and ip address are logged into a seperated file (see https://dev.icinga.org/issues/1161 for more information). but vanilla nagios cgis don't support that - supposingly you'll open a feature request (if not already there) on tracker.nagios.org to get that feature into nagios too. kind regards, michael Snippet of log file : [root@its019-lap-v:var]# pwd /usr/local/nagios/var [root@its019-lap-v:var]# ls -lat total 188 drwxrwxr-x 5 nagios nagios 4096 Aug 8 11:15 . -rw-rw-r-- 1 nagios nagios 51204 Aug 8 11:15 status.dat -rw-r--r-- 1 nagios nagios 4561 Aug 8 11:09 nagios.log -rw--- 1 nagios nagios 50697 Aug 8 11:09 retention.dat drwxrwxr-x 2 nagios nagios 4096 Aug 8 00:00 archives -rw-r--r-- 1 nagios nagios 41229 Aug 5 15:09 objects.cache -rw-r--r-- 1 nagios nagios 6 Aug 1 13:24 nagios.lock drwxrwsr-x 2 nagios nagcmd 4096 Aug 1 13:24 rw drwxrwxr-x 9 nagios nagios 4096 Jul 1 15:06 .. drwxrwxr-x 3 nagios nagios 4096 Jun 27 15:54 spool [root@its019-lap-v:var]# cat nagios.log [1312776000] LOG ROTATION: DAILY [1312776000] LOG VERSION: 2.0 [1312776541] Auto-save of retention data completed successfully. [1312780141] Auto-save of retention data completed successfully. [1312783741] Auto-save of retention data completed successfully. [1312787341] Auto-save of retention data completed successfully. [1312790941] Auto-save of retention data completed successfully. [1312794541] Auto-save of retention data completed successfully. [1312798141] Auto-save of retention data completed successfully. [1312801741] Auto-save of retention data completed successfully. [1312805172] EXTERNAL COMMAND: DEL_ALL_SVC_COMMENTS;TPC Server;root space [1312805341] Auto-save of retention data completed successfully. [1312808941] Auto-save of retention data completed successfully. [1312812541] Auto-save of retention data completed successfully. [1312816017] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;HTTP [1312816033] EXTERNAL COMMAND: ENABLE_SVC_NOTIFICATIONS;Nagios Server;SSH [1312816141] Auto-save of retention data completed successfully. -- Thank you, Bob Molerio Systems Administrator New York University ITS Computer Facilities Services/Infrastructure Level C-2 75 Third Avenue New York NY 10003-5527 email:robert.mole...@nyu.edu mailto:robert.mole...@nyu.edu -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- DI (FH) Michael Friedrich Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria email: michael.friedr...@univie.ac.at phone: +43 1 4277 14359 mobile:+43 664 60277 14359 fax: +43 1 4277 14338 web: http://www.univie.ac.at/zid http://www.aco.net Icinga Core IDOUtils Developer http://www.icinga.org -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Override service notification time
David Wilkinson wrote: Will I have to duplicate service config so I can change the notification period or can I have it set on a per host basis and have that override what is set in the service template? You can set it on a per-host basis. If you do not define a notification timeperiod in a service's template or definition, the service will inherit its notification timeperiod from its associated host. Check out the section on implied inheritance here: http://nagios.sourceforge.net/docs/nagioscore/3/en/objectinheritance.html I have separate host templates for my development and production servers which have different notification timeperiods, and one set of service definitions associated with both types of hosts. -- -Chris -- Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_http with multiple outcomes?
We're monitoring a local Jetty (Java webserver) process using an application status page. When everything's going well, it includes the string OK, which we check for. This should be a clearly successful status test. When everything's not going well, we get some sort of 4xx or 5xx error message. This should trigger alerts immediately. When some things are going well and others aren't fully up to speed (slow database), we'll get a DATABASE_TEST_RAN_LONG, which isn't ideal, but at least for a few occurances (n = 5) we can live with. In particular, we DON'T want a single result sounding off pagers in the middle of the night. The current test looks like: define command{ command_namecheck_jetty command_line/usr/lib/nagios/plugins/check_http -H '$HOSTADDRESS$' -u /serviceStatus -e 200 -s OK } What would be a sane process of getting Nagios to: - Report all clear when we get a 200 status and OK text on page? - Wait for 6 consecutive instances of DATABASE_TEST_RAN_LONG before alerting for that result. - Alert immediately on any cases not matching one of the above? I don't believe we can capture this in a single test unless I'm missing something. Thanks in advance. -- Dr. Ed Morbius Chief Scientist Krell Power Systems Unlimited -- uberSVN's rich system and user administration capabilities and model configuration take the hassle out of deploying and managing Subversion and the tools developers use with it. Learn more about uberSVN and get a free download at: http://p.sf.net/sfu/wandisco-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http with multiple outcomes?
Quoting Edward Morbius dredmorb...@gmail.com: When some things are going well and others aren't fully up to speed (slow database), we'll get a DATABASE_TEST_RAN_LONG, which isn't ideal, but at least for a few occurances (n = 5) we can live with. In particular, we DON'T want a single result sounding off pagers in the middle of the night. You can specify -e OK,DATABASE_TEST_RAN_LONG, then let the plugin decide if it's slow or not, with the -w, -c and -t parameters. Terry [-w warn time] [-c critical time] [-t timeout] [ The current test looks like: define command{ command_name check_jetty command_line /usr/lib/nagios/plugins/check_http -H '$HOSTADDRESS$' -u /serviceStatus -e 200 -s OK } What would be a sane process of getting Nagios to: - Report all clear when we get a 200 status and OK text on page? - Wait for 6 consecutive instances of DATABASE_TEST_RAN_LONG before alerting for that result. - Alert immediately on any cases not matching one of the above? I don't believe we can capture this in a single test unless I'm missing something. Thanks in advance. -- Dr. Ed Morbius Chief Scientist Krell Power Systems Unlimited -- Terry Carmen CNY Support, LLC Web. Database. Business. http://www.cnysupport.com-- uberSVN's rich system and user administration capabilities and model configuration take the hassle out of deploying and managing Subversion and the tools developers use with it. Learn more about uberSVN and get a free download at: http://p.sf.net/sfu/wandisco-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suggestions for event correlation managers?
Anyone? C'mon, don't be shy! :-) -- Trever From: Furnish, Trever G [tgfurn...@herffjones.com] Sent: Friday, August 05, 2011 4:45 PM To: nagios-users@lists.sourceforge.net Cc: Boeglin, Adam R Subject: [Nagios-users] Suggestions for event correlation managers? Hello, I'm looking for suggestions for applying Nagios' style of event handling (escalations, recoveries, acknowledgements), hopefully with some improvements (aggregation), to events coming from many different (non-Nagios) sources. I know of a few Nagios-specific notification aggregators, but can anyone recommend a good (preferably inexpensive / OSS) way of expanding that to include many other tools? I know about SNARE and RiverMuse, but they're relatively expensive. We make heavy use of Nagios as well as several other tools (MSFT SCOM, HP SIM, Oracle Grid Control, AlertSite.net, etc). They're all sending alerts in various forms to a small group of admins and engineers, so many of us get alerts from all of the tools, sometimes from more than one tool regarding a single event. Nagios does a great job of flexibly managing alerts from its own events, but I don't see how I'd hook in the other tools. Several of the tools (e.g. SCOM and SIM) don't even have any concept of event correlation -- breakage and recovery are two separate events. I see tools like SNARE, RiverMuse ECM, and a few others filling this gap, at least partially, but I don't yet have experience with them and they're relatively expensive. Anyone doing this effectively with OSS tools or low-cost tools or a good home-grown approach you wouldn't mind sharing (and possibly collaborating on)? -- Trever Furnish, tgfurn...@herffjones.com Herff Jones, Inc. Solutions Architect Phone: 317.612.3519 -- BlackBerryreg; DevCon Americas, Oct. 18-20, San Francisco, CA The must-attend event for mobile developers. Connect with experts. Get tools for creating Super Apps. See the latest technologies. Sessions, hands-on labs, demos much more. Register early save! http://p.sf.net/sfu/rim-blackberry-1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- uberSVN's rich system and user administration capabilities and model configuration take the hassle out of deploying and managing Subversion and the tools developers use with it. Learn more about uberSVN and get a free download at: http://p.sf.net/sfu/wandisco-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null