[Nagios-users] Flexible downtime not working
Hi, We use Nagios 3.2.0 on SLES 10 SP2 to monitor a few hundred services. Yesterday, we put a particular service into flexible downtime from around 7am to 2pm. However, it seems to still be stuck in flexible downtime so any alerts after 2pm haven't been sent out. I'm not entirely sure why this is happening as I've used this before and never had any problems. Has anyone come across this before ? Is there a way to kick this service back into normal monitoring ? Thanks, Deborah This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the intended recipient, please delete this e-mail immediately. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio endeavours to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Flexible downtime not working
Hello Deborah, If you use Flexible Downtimes, don't forget to select Flexible from the Select-Menu (Type). Otherwise the fixed dates are taken (Start Date, End Date). To remove a host or service from a scheduled downtime, click on Downtime in the left navigation. There you can delete the wanted entry. cheers On Mon, Apr 2, 2012 at 10:42 AM, Deborah Martin deborah.mar...@kognitio.com wrote: Hi, ** ** We use Nagios 3.2.0 on SLES 10 SP2 to monitor a few hundred services. ** ** Yesterday, we put a particular service into flexible downtime from around 7am to 2pm. However, it seems to still be stuck in flexible downtime so any alerts after 2pm haven’t been sent out. ** ** I’m not entirely sure why this is happening as I’ve used this before and never had any problems. Has anyone come across this before ? Is there a way to kick this service back into normal monitoring ? ** ** Thanks, Deborah ** ** ** This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the intended recipient, please delete this e-mail immediately. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio endeavours to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Flexible downtime not working
Hi Claudio, Thanks for info. But I'm a little puzzled. I clarified the steps that were taken. It turns out that he didn't use Flexible but instead used fixed. However, he's sure that the dates were set to 1st April for the start and end date. So, surely once the date changed to the 2nd April, at the very least, normal monitoring should have resumed or is that not correct ? In fact I took your advice and removed the fixed downtime manually. These are the nagios log entries I see for this service :- 04-01-2012 00:00:00 04-01-2012 06:00:00 0d 6h 0m 0s SERVICE OK (HARD) 60 rows in dsa.mdf_reservation in last 10 minutes 04-01-2012 06:00:00 04-01-2012 06:46:11 0d 0h 46m 11s SERVICE DOWNTIME START Start of scheduled downtime 04-01-2012 06:46:11 04-01-2012 14:00:10 0d 7h 13m 59s SERVICE CRITICAL (HARD) 0 rows in dsa.mdf_reservation in last 10 minutes 04-01-2012 14:00:10 04-02-2012 11:26:18 0d 21h 26m 8s SERVICE OK (HARD) 331 rows in dsa.mdf_reservation in last 10 minutes 04-02-2012 11:26:18 04-02-2012 11:47:16 0d 0h 20m 58s+ SERVICE DOWNTIME END End of scheduled downtime The last entry is when I manually put the service back to normal monitoring. Regards, Deborah From: Claudio Kuenzler [mailto:c...@claudiokuenzler.com] Sent: 02 April 2012 10:29 To: Nagios Users List Subject: Re: [Nagios-users] Flexible downtime not working Hello Deborah, If you use Flexible Downtimes, don't forget to select Flexible from the Select-Menu (Type). Otherwise the fixed dates are taken (Start Date, End Date). To remove a host or service from a scheduled downtime, click on Downtime in the left navigation. There you can delete the wanted entry. cheers On Mon, Apr 2, 2012 at 10:42 AM, Deborah Martin deborah.mar...@kognitio.commailto:deborah.mar...@kognitio.com wrote: Hi, We use Nagios 3.2.0 on SLES 10 SP2 to monitor a few hundred services. Yesterday, we put a particular service into flexible downtime from around 7am to 2pm. However, it seems to still be stuck in flexible downtime so any alerts after 2pm haven't been sent out. I'm not entirely sure why this is happening as I've used this before and never had any problems. Has anyone come across this before ? Is there a way to kick this service back into normal monitoring ? Thanks, Deborah This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the intended recipient, please delete this e-mail immediately. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio endeavours to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.netmailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the intended recipient, please delete this e-mail immediately. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio endeavours to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Flexible downtime not working
Well without blaming anyone, the easiest reason for this to cause would be a human mistake in interpreting the schedule downtime form (often happens with fixed/flexible) and by setting a wrong end date. When you were in the Downtime overview, did you take a look at the specified End Time column? What was written there before you clicked the delete icon? I suggest you go to your var/archives folder and check the logfile of April 1st for the external command, sth like this: [158804] EXTERNAL COMMAND: SCHEDULE_SVC_DOWNTIME;hostname;NTP;158695;165895;0;0;3600;Nagios Admin;test flexible [173670] EXTERNAL COMMAND: SCHEDULE_SVC_DOWNTIME;hostname;SSH;173648;184448;1;0;7200;Nagios Admin;test fixed These timestamps tell you WHEN the external command was launched, when the downtime starts and when the downtime ends. Also, note the 0 and 1 behind the end timestamp. Integer 0 indicates a flexible downtime, a 1 indicates a fixed scheduled downtime. To answer your question: Yes, after the end of the scheduled downtime, normal monitoring should have resumed. On Mon, Apr 2, 2012 at 2:54 PM, Deborah Martin deborah.mar...@kognitio.comwrote: Hi Claudio, ** ** Thanks for info. But I’m a little puzzled. ** ** I clarified the steps that were taken. It turns out that he didn’t use Flexible but instead used fixed. However, he’s sure that the dates were set to 1st April for the start and end date. So, surely once the date changed to the 2nd April, at the very least, normal monitoring should have resumed or is that not correct ? ** ** In fact I took your advice and removed the fixed downtime manually. ** ** ** ** These are the nagios log entries I see for this service :- ** ** 04-01-2012 00:00:00 04-01-2012 06:00:00 0d 6h 0m 0s SERVICE OK (HARD) 60 rows in dsa.mdf_reservation in last 10 minutes 04-01-2012 06:00:00 04-01-2012 06:46:11 0d 0h 46m 11s SERVICE DOWNTIME START Start of scheduled downtime 04-01-2012 06:46:11 04-01-2012 14:00:10 0d 7h 13m 59s SERVICE CRITICAL (HARD) 0 rows in dsa.mdf_reservation in last 10 minutes 04-01-2012 14:00:10 04-02-2012 11:26:18 0d 21h 26m 8s SERVICE OK (HARD) 331 rows in dsa.mdf_reservation in last 10 minutes 04-02-2012 11:26:18 04-02-2012 11:47:16 0d 0h 20m 58s+ SERVICE DOWNTIME END End of scheduled downtime ** ** The last entry is when I manually put the service back to normal monitoring. ** ** Regards, Deborah ** ** ** ** *From:* Claudio Kuenzler [mailto:c...@claudiokuenzler.com] *Sent:* 02 April 2012 10:29 *To:* Nagios Users List *Subject:* Re: [Nagios-users] Flexible downtime not working ** ** Hello Deborah, If you use Flexible Downtimes, don't forget to select Flexible from the Select-Menu (Type). Otherwise the fixed dates are taken (Start Date, End Date). To remove a host or service from a scheduled downtime, click on Downtime in the left navigation. There you can delete the wanted entry. cheers On Mon, Apr 2, 2012 at 10:42 AM, Deborah Martin deborah.mar...@kognitio.com wrote: Hi, We use Nagios 3.2.0 on SLES 10 SP2 to monitor a few hundred services. Yesterday, we put a particular service into flexible downtime from around 7am to 2pm. However, it seems to still be stuck in flexible downtime so any alerts after 2pm haven’t been sent out. I’m not entirely sure why this is happening as I’ve used this before and never had any problems. Has anyone come across this before ? Is there a way to kick this service back into normal monitoring ? Thanks, Deborah This e-mail and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you are not the intended recipient, please delete this e-mail immediately. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio endeavours to prevent the transmission of viruses via e-mail, we cannot guarantee that any e-mail or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages
[Nagios-users] Fix for battery state Learning in check_perc.pl from Matt Stanford
I have no idea if anybody else is using this plugin or not, but I've been using it quite successfully on a bunch of older Dell machines, and it's been doing great, except for throwing a critical error when the Battery state is Learning. I updated the code to handle the Learning state, but can't seem to find the author to submit the patch. If anybody us using the plugin and would find the updated version useful, it's on my website at: https://www.cnysupport.com/download/check_perc/check_perc.pl Terry -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios message broker experience
Did somebody have experience with the setup of Nagios message broker from http://snippets.syabru.ch/nagios-msg-broker/ Thanks. Mikhail. -- This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] how to detect even a fraction of downtime for the URL or underlying Jboss service on a production server
Hi List, Hope you doing fine... I have a very critical web application (URL) hosted on a production server on the Jboss service. As now I am monitoring the URL availability and the production server availability using nagios. But as I am polling to check these on an interval of 5 mints and also there is check latency often on my nagios server because of high load, I am not able to detect even a seconds downtime of this url or server. An idea which I am thinking of, is monitoring the log files of jboss for this service availability using log monitoring (the passive way) with freshness threshold set to very low. Will it work ? what are your suggestions ? Thanks, Manish Kumar Open Source Tools Team, IMS, Mahindra Satyam Mob: +919035435306 DISCLAIMER: This email (including any attachments) is intended for the sole use of the intended recipient/s and may contain material that is CONFIDENTIAL AND PRIVATE COMPANY INFORMATION. Any review or reliance by others or copying or distribution or forwarding of any or all of the contents in this message is STRICTLY PROHIBITED. If you are not the intended recipient, please contact the sender by email and delete all copies; your cooperation in this regard is appreciated. -- Better than sec? Nothing is better than sec when it comes to monitoring Big Data applications. Try Boundary one-second resolution app monitoring today. Free. http://p.sf.net/sfu/Boundary-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null