Re: [Nagios-users] too much passive check status coming in
On 11/18/2010 03:11 AM, marc pascual wrote: On Thu, Nov 18, 2010 at 3:27 AM, Andreas Ericssona...@op5.se wrote: On 11/17/2010 08:11 PM, marc pascual wrote: Hello, I have too much passive checks coming in (12*880) to the point that some aren't getting processed by nagios. Now I've decided to rewrite the plugins so that it will only report state changes via nsca. (OK-to-CRIT and CRIT-to-OK). The problem I see with this move is that, if something temporarily hampers an nsca communication (e.g. transient high load, or temporary network blip), then there's no guaranteed way for nagios to know the real state of the service on the next check cycle. I wonder if anyone out there has implemented their checks similar to this, and what solutions or workarounds were implemented to make sure that nagios will eventually get updated with the correct states. You want freshness checks. Browse the nagios documentation for it and you'll find what you're looking for. Welcome to the list btw. It seems things finally worked out for you. But freshness checks imply that checks should be coming in at a regular interval (unless I'm mistaken). In this case, I don't have an idea when the next check result will come as I'm only sending results on state changes. Ah, right. In that case, check out Merlin. It replaces NSCA with a much faster protocol for transmitting check-data, and you get failover for free in case the poller node goes to lunch. http://www.op5.org/community/plugin-inventory/op5-projects/merlin -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Some alerts not getting to sendmail
Good morning, or whatever as the case may be... I have a Nagios 3.2.1install which is showing a problem I'm unsure how to troubleshoot further. It's either something simple I'm missing, or a deeper, more difficult problem. Or a transient to be perhaps put on a shelf until it happens again. First, the questions: - Is the notifications log absolute? - Meaning, if a notification is shown in this log, it has passed all filters (notification options etc) and Nagios believes it was submitted to the MTA. - Is there anywhere besides the MTA's log,status.dat and nagios.log to look for clues to mail problems? == Details - Running on FreeBSD 7.0, using stock sendmail on localhost. - In general, everything is working fine. 125 hosts, 1600 ish services. This system has been up and stable for a few months. Host and service notifications of all kinds go out properly all the time. Last night, I had a host go down. Notification got to my cell phone and the other contacts it's configured to just fine. This morning, I dealt with the problem host and Nagios showed it back up. But no Host up notification to any of the configured contacts. The Notifications log shows the host up notifications as having been sent. There's nothing in /var/log/maillog for the time Nagios says the notifications were sent. In status.dat, the record for my cell contact has a last_host_notification line with the epoch time version of the exact second the notification was in theory sent. Host and template records included at the bottom of this email. I've included one contact def, but there were 4 contacts, using 2 different scripts that should have received the notification. As far as I can see, there is nothing in the host configuration or related templates that would keep a host up notification from being sent. We use custom host-notify scripts which log actions, and again, no entries for the specific problem, but lots of other notifications before and after. These scripts could be the problem, but I want to rule out other issues first. Thank you for your time, Tim Palmer === Host config: define host{ host_name host.foo.bar.tld use dslam alias Anytown DSLAM address xxx.xxx.xxx.xxx parents another.foo.bar.tld } define host{ namedslam use generic-host check_period24x7 check_interval 5 retry_interval 1 max_check_attempts 10 check_command check_dslam_uptime_snmp notification_period 24x7 notification_interval 0 notification_optionsd,u,r contact_groups contact1, contact2 register0 } define host{ namegeneric-host notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 failure_prediction_enabled 1 process_perf_data 0 retain_status_information 1 retain_nonstatus_information1 notification_period 24x7 register0 } Contact: define contact{ contact_nameme_text use text-contact alias me_text email npanxx_lf...@txt.smx.gateway } define contact{ nametext-contact use generic-contact service_notification_optionsc,r,w service_notification_commands notify-by-textmessage-service host_notification_commands notify-by-textmessage-host register0 } define contact{ namegeneric-contact service_notification_period 24x7 host_notification_period24x7 service_notification_optionsc,r host_notification_options d,r service_notification_commands notify-by-email-service host_notification_commands notify-by-email-host register0 } -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list
Re: [Nagios-users] Some alerts not getting to sendmail
On 11/18/2010 03:48 PM, Tim Palmer wrote: Good morning, or whatever as the case may be... I have a Nagios 3.2.1install which is showing a problem I'm unsure how to troubleshoot further. It's either something simple I'm missing, or a deeper, more difficult problem. Or a transient to be perhaps put on a shelf until it happens again. First, the questions: - Is the notifications log absolute? - Meaning, if a notification is shown in this log, it has passed all filters (notification options etc) and Nagios believes it was submitted to the MTA. Yes. - Is there anywhere besides the MTA's log,status.dat and nagios.log to look for clues to mail problems? The receiving end comes to mind, or any server(s) in between. == Details - Running on FreeBSD 7.0, using stock sendmail on localhost. - In general, everything is working fine. 125 hosts, 1600 ish services. This system has been up and stable for a few months. Host and service notifications of all kinds go out properly all the time. Last night, I had a host go down. Notification got to my cell phone and the other contacts it's configured to just fine. This morning, I dealt with the problem host and Nagios showed it back up. But no Host up notification to any of the configured contacts. The Notifications log shows the host up notifications as having been sent. There's nothing in /var/log/maillog for the time Nagios says the notifications were sent. In status.dat, the record for my cell contact has a last_host_notification line with the epoch time version of the exact second the notification was in theory sent. Host and template records included at the bottom of this email. I've included one contact def, but there were 4 contacts, using 2 different scripts that should have received the notification. As far as I can see, there is nothing in the host configuration or related templates that would keep a host up notification from being sent. We use custom host-notify scripts which log actions, and again, no entries for the specific problem, but lots of other notifications before and after. These scripts could be the problem, but I want to rule out other issues first. Notifications are a pretty integral part to what makes Nagios worth anything at all. Since you're using homebrewed scripts and noone else has reported any problems with them, I suggest you first debug your own scripts, or enable debug-logging for notifications. The dosc will tell you how to do that. It won't help for this occurrance of the failed notifications, but it will definitely help you in the future if it ever happens again. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Some alerts not getting to sendmail
Andreas Ericsson wrote: On 11/18/2010 03:48 PM, Tim Palmer wrote: Good morning, or whatever as the case may be... I have a Nagios 3.2.1install which is showing a problem I'm unsure how to troubleshoot further. It's either something simple I'm missing, or a deeper, more difficult problem. Or a transient to be perhaps put on a shelf until it happens again. First, the questions: - Is the notifications log absolute? - Meaning, if a notification is shown in this log, it has passed all filters (notification options etc) and Nagios believes it was submitted to the MTA. Yes. Excellent, thank you. That's the critical bit for me regarding Nagios. - Is there anywhere besides the MTA's log,status.dat and nagios.log to look for clues to mail problems? The receiving end comes to mind, or any server(s) in between. == Details - Running on FreeBSD 7.0, using stock sendmail on localhost. - In general, everything is working fine. 125 hosts, 1600 ish services. This system has been up and stable for a few months. Host and service notifications of all kinds go out properly all the time. Last night, I had a host go down. Notification got to my cell phone and the other contacts it's configured to just fine. This morning, I dealt with the problem host and Nagios showed it back up. But no Host up notification to any of the configured contacts. The Notifications log shows the host up notifications as having been sent. There's nothing in /var/log/maillog for the time Nagios says the notifications were sent. In status.dat, the record for my cell contact has a last_host_notification line with the epoch time version of the exact second the notification was in theory sent. Host and template records included at the bottom of this email. I've included one contact def, but there were 4 contacts, using 2 different scripts that should have received the notification. As far as I can see, there is nothing in the host configuration or related templates that would keep a host up notification from being sent. We use custom host-notify scripts which log actions, and again, no entries for the specific problem, but lots of other notifications before and after. These scripts could be the problem, but I want to rule out other issues first. Notifications are a pretty integral part to what makes Nagios worth anything at all. Since you're using homebrewed scripts and noone else has reported any problems with them, I suggest you first debug your own scripts, or enable debug-logging for notifications. The dosc will tell you how to do that. It won't help for this occurrance of the failed notifications, but it will definitely help you in the future if it ever happens again. Agreed on all counts. Now that you've confirmed the final-ness of the notifications log, I am comfortable looking outside Nagios to the scripts, system and sendmail. I'm sure there's a reasonable, logical explanation for a small subset of mail not getting from Nagios to the local MTA... Thank you Tim -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] More than one command defined
Hi, I have a need to run two different commands when processing the performance data from Nagios (3.2.2) I want the first command to run the process-perfdata.pl script from pnp4nagios and the second command to log various things to a different log file. The reason is that we've only just had a requirement to process performance data for pnp4nagios graphing. But historically we've created our own performance data log which is then used within a database to produce SLA reports developed in-house (no graphing just numbers produced for the SLA reports) If I do the following I don't get an errors when I run nagios -v nagios.cfg ... :- define command { command_name process-service-perfdata-file command_line first command - process performance data for pnp4nagios command_line second command - log different data to a different file } I'd like to avoid creating a script with both commands in if I can help it as it's extremely useful to keep it simple and visible in the misccommands.cfg file. Perhaps there is some other way I can do this or is a script with both commands really the only way ? Any help / pointers would be really appreciated. If I'm being dense about this and the answer is staring right at me feel free to say so! Thanks, Deborah *** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. Kognitio Limited, a company registered in England and Wales. Registered number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92. Kognitio Inc, a company incorporated in Delaware, principal office 180 North Stetson, Suite 3500, Chicago, IL 60601, USA ***-- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] More than one command defined
On 11/18/2010 05:25 PM, Deborah Martin wrote: Hi, I have a need to run two different commands when processing the performance data from Nagios (3.2.2) I want the first command to run the process-perfdata.pl script from pnp4nagios and the second command to log various things to a different log file. The reason is that we've only just had a requirement to process performance data for pnp4nagios graphing. But historically we've created our own performance data log which is then used within a database to produce SLA reports developed in-house (no graphing just numbers produced for the SLA reports) If I do the following I don't get an errors when I run nagios -v nagios.cfg ... :- define command { command_name process-service-perfdata-file command_linefirst command - process performance data for pnp4nagios command_linesecond command - log different data to a different file } I'm not surprised. This is a sort of unintentional fallout that comes from using the template system the way it's defined today. I guess only the second command_line will get run though, right? I'd like to avoid creating a script with both commands in if I can help it as it's extremely useful to keep it simple and visible in the misccommands.cfg file. Perhaps there is some other way I can do this or is a script with both commands really the only way ? You could use Merlin and set the perfdata_table option in the database section of the daemon compound. Then you can grab the perfdata items from the merlin database. Note that you'll have to drop old entries from that table every once in a while though, as the table will surely grow huge otherwise. That doesn't really let you have things visible in the misccommands.cfg file though, so you'll have to pick your poison on this one. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] More than one command defined
I'm looking at the same issue and our thoughts are to use NDO utils or possibly custom NEB to grab perf data that way, but then still let pnp4nagios run the 'normal' way. Dan From: Deborah Martin [mailto:deborah.mar...@kognitio.com] Sent: Thursday, November 18, 2010 10:26 AM To: 'Nagios Users List' Subject: [Nagios-users] More than one command defined Importance: High Hi, I have a need to run two different commands when processing the performance data from Nagios (3.2.2) I want the first command to run the process-perfdata.pl script from pnp4nagios and the second command to log various things to a different log file. The reason is that we've only just had a requirement to process performance data for pnp4nagios graphing. But historically we've created our own performance data log which is then used within a database to produce SLA reports developed in-house (no graphing just numbers produced for the SLA reports) If I do the following I don't get an errors when I run nagios -v nagios.cfg ... :- define command { command_name process-service-perfdata-file command_line first command - process performance data for pnp4nagios command_line second command - log different data to a different file } I'd like to avoid creating a script with both commands in if I can help it as it's extremely useful to keep it simple and visible in the misccommands.cfg file. Perhaps there is some other way I can do this or is a script with both commands really the only way ? Any help / pointers would be really appreciated. If I'm being dense about this and the answer is staring right at me feel free to say so! Thanks, Deborah *** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. Kognitio Limited, a company registered in England and Wales. Registered number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92. Kognitio Inc, a company incorporated in Delaware, principal office 180 North Stetson, Suite 3500, Chicago, IL 60601, USA *** -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Core 3.2.3 host check retry interval
On Tue, 2010-11-16 at 22:52 +0100, Andreas Ericsson wrote: That one was in 3.2.2 too though. Could you try un-commenting the lines mentioned there and see if that helps? It looks like something weird is still happening after making that change. I checked some more hosts and the retry_interval is low, but only for HOST UP alerts. [11-18-2010 01:23:31] SERVICE ALERT: hcsprodnwweb5;Service: Epilog;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds [11-18-2010 01:23:41] HOST ALERT: hcsprodnwweb5;DOWN;SOFT;1;CRITICAL - 10.3.2.177: rta nan, lost 100% [11-18-2010 01:24:01] HOST ALERT: hcsprodnwweb5;UP;SOFT;2;OK - 10.3.2.177: rta 1.943ms, lost 0% [11-18-2010 01:32:51] HOST ALERT: wwwhost;DOWN;SOFT;2;CRITICAL - 10.3.1.11: rta nan, lost 100% [11-18-2010 01:34:02] HOST ALERT: wwwhost;DOWN;HARD;3;CRITICAL - 10.3.1.11: rta nan, lost 100% [11-18-2010 01:34:21] HOST ALERT: wwwhost;UP;HARD;1;OK - 10.3.1.11: rta 115.733ms, lost 20% But sometimes it works the way I expect it to. [11-18-2010 01:38:41] HOST ALERT: wwwhost;DOWN;SOFT;2;CRITICAL - 10.3.1.11: rta nan, lost 100% [11-18-2010 01:39:51] HOST ALERT: wwwhost;DOWN;HARD;3;CRITICAL - 10.3.1.11: rta 488.367ms, lost 80% [11-18-2010 01:49:21] HOST ALERT: wwwhost;UP;HARD;1;OK - 10.3.1.11: rta 31.928ms, lost 0% I'm going to try reverting back to Nagios 3.2.1 to see what happens. It's possible I had the problem then but never noticed. Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] macro for notified contacts
Is there a macro that expands to the list of notified or will be notified contacts in a Host or Service notification? I'm currently using $NOTIFICATIONRECIPIENTS$, but this doesn't expand to contacts that are actually notified because if a contact's notification period filters them out, it still shows up in $NOTIFICATIONRECIPIENTS$. IMPORTANT: The information contained in this email and/or its attachments is confidential. If you are not the intended recipient, please notify the sender immediately by reply and immediately delete this message and all its attachments. Any review, use, reproduction, disclosure or dissemination of this message or any attachment by an unintended recipient is strictly prohibited. Neither this message nor any attachment is intended as or should be construed as an offer, solicitation or recommendation to buy or sell any security or other financial instrument. Neither the sender, his or her employer nor any of their respective affiliates makes any warranties as to the completeness or accuracy of any of the information contained herein or that this message or any of its attachments is free of viruses. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] SNMP checks, using multiple snmpgets
Hi, Im trying to get my SNMP monitoring a bit more informative. I want to monitor port status on ethernet switches (which Ive tested and works quite nicely). Im using the instructions given here: http://www.thegeekstuff.com/2008/11/how-to-monitor-network-switch-and-ports-using-nagios/ What would be nice is that in the out put of a give port, rather than just saying up or down Id like to also include the output from the ifDescr along with the port status. Logically this would seem to be possible, give the how to above shows in example 7 a config whereby you monitor multiple ports in a single command. I tried this config: check_command check_snmp!-C public -o ifDescr.52 -m RFC1213-MI B -o ifOperStatus.52 -r 1 -m RFC1213-MIB The output of the command is fine, I see the string returned from the ifDescr first then the status of the port, but Nagios takes this as being an error state and marks the service as faulted permanently. If anyone can understand that, do you know if what I want is possible to achieve? :P thanks! Andy. -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] server profiling options
I'm looking at minimizing the CPU impact that nagios has on our server, and done some of the basic performance tuning stuff, but what I see right now is a lot of the nagios worker procs generating a lot of CPU and curious if there was a way people have used to watch what those processes and threads were doing that might be taknig the most cycles to try and reduce it? Thanks! Dan -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroup with no members - Enhancement
I was curious if we were able to make any progress on getting this patch included? Thanks! Dan -Original Message- From: Max Schubert [mailto:m...@webwizarddesign.com] Sent: Wednesday, November 03, 2010 7:04 AM To: Andreas Ericsson; Nagios Users List Subject: Re: [Nagios-users] hostgroup with no members - Enhancement Will do my best to do this this week - if I don't get to it this week I will forget about it for sure, all the patches our team has done to make Nagios (some very very specific to our env) work for us will be available on our team github account; will do my best to get that updated this week as well and post that url along with the patch / docs to this list or I will forget to do that for another few months too. Max On 11/3/10, Andreas Ericsson a...@op5.se wrote: On 11/02/2010 04:54 PM, Max Schubert wrote: allow_empty_hostgroups = 1 I'll take this, but rename the option to something more mnemonic, as empty hostgroups are already allowed and it's a bit ambiguous what it means. Any chance you could re-roll the patch against latest CVS head and include a brief documentation update? Name the var allow_empty_hostgroups in the docs so I can find it when I sed it in. If you want any special attributions (company or person thanks, beyond the addition of your name to the THANKS file and your name and email in the commit log message), let me know. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Achieve Improved Network Security with IP and DNS Reputation. Defend against bad network traffic, including botnets, malware, phishing sites, and compromised hosts - saving your company time, money, and embarrassment. Learn More! http://p.sf.net/sfu/hpdev2dev-nov ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Some alerts not getting to sendmail
Tim Palmer wrote: Andreas Ericsson wrote: On 11/18/2010 03:48 PM, Tim Palmer wrote: Good morning, or whatever as the case may be... I have a Nagios 3.2.1install which is showing a problem I'm unsure how to troubleshoot further. It's either something simple I'm missing, or a deeper, more difficult problem. Or a transient to be perhaps put on a shelf until it happens again. First, the questions: - Is the notifications log absolute? - Meaning, if a notification is shown in this log, it has passed all filters (notification options etc) and Nagios believes it was submitted to the MTA. Yes. Excellent, thank you. That's the critical bit for me regarding Nagios. - Is there anywhere besides the MTA's log,status.dat and nagios.log to look for clues to mail problems? The receiving end comes to mind, or any server(s) in between. == Details - Running on FreeBSD 7.0, using stock sendmail on localhost. - In general, everything is working fine. 125 hosts, 1600 ish services. This system has been up and stable for a few months. Host and service notifications of all kinds go out properly all the time. Last night, I had a host go down. Notification got to my cell phone and the other contacts it's configured to just fine. This morning, I dealt with the problem host and Nagios showed it back up. But no Host up notification to any of the configured contacts. The Notifications log shows the host up notifications as having been sent. There's nothing in /var/log/maillog for the time Nagios says the notifications were sent. In status.dat, the record for my cell contact has a last_host_notification line with the epoch time version of the exact second the notification was in theory sent. Host and template records included at the bottom of this email. I've included one contact def, but there were 4 contacts, using 2 different scripts that should have received the notification. As far as I can see, there is nothing in the host configuration or related templates that would keep a host up notification from being sent. We use custom host-notify scripts which log actions, and again, no entries for the specific problem, but lots of other notifications before and after. These scripts could be the problem, but I want to rule out other issues first. Notifications are a pretty integral part to what makes Nagios worth anything at all. Since you're using homebrewed scripts and noone else has reported any problems with them, I suggest you first debug your own scripts, or enable debug-logging for notifications. The dosc will tell you how to do that. It won't help for this occurrance of the failed notifications, but it will definitely help you in the future if it ever happens again. Agreed on all counts. Now that you've confirmed the final-ness of the notifications log, I am comfortable looking outside Nagios to the scripts, system and sendmail. I'm sure there's a reasonable, logical explanation for a small subset of mail not getting from Nagios to the local MTA... Thank you Tim Note to self, and whoever else might be listening - properly quoting plugin output before releasing into the shell is a Good Thing. Blaming ex-employees for the oversight is tempting, but cowardly. And Tim's Trouble Shooting Rule #1 holds again - It's your fault, find what you did wrong or, Never forget you're an idiot. Tim -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] More than one command defined
Thanks for the input everyone. Much appreciated. I'll try with merlin and see where that goes. And the script idea as well. The trouble is Nagios is a complicated beast (to me anyway). Coupled with that and getting pnp4nagios working as well this week, my head hurts from config file overload :) Anyway all food for thought! Thanks again. Deborah -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: 18 November 2010 17:10 To: Nagios Users List Cc: Deborah Martin Subject: Re: [Nagios-users] More than one command defined On 11/18/2010 05:25 PM, Deborah Martin wrote: Hi, I have a need to run two different commands when processing the performance data from Nagios (3.2.2) I want the first command to run the process-perfdata.pl script from pnp4nagios and the second command to log various things to a different log file. The reason is that we've only just had a requirement to process performance data for pnp4nagios graphing. But historically we've created our own performance data log which is then used within a database to produce SLA reports developed in-house (no graphing just numbers produced for the SLA reports) If I do the following I don't get an errors when I run nagios -v nagios.cfg ... :- define command { command_name process-service-perfdata-file command_linefirst command - process performance data for pnp4nagios command_linesecond command - log different data to a different file } I'm not surprised. This is a sort of unintentional fallout that comes from using the template system the way it's defined today. I guess only the second command_line will get run though, right? I'd like to avoid creating a script with both commands in if I can help it as it's extremely useful to keep it simple and visible in the misccommands.cfg file. Perhaps there is some other way I can do this or is a script with both commands really the only way ? You could use Merlin and set the perfdata_table option in the database section of the daemon compound. Then you can grab the perfdata items from the merlin database. Note that you'll have to drop old entries from that table every once in a while though, as the table will surely grow huge otherwise. That doesn't really let you have things visible in the misccommands.cfg file though, so you'll have to pick your poison on this one. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. *** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. Kognitio Limited, a company registered in England and Wales. Registered number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92. Kognitio Inc, a company incorporated in Delaware, principal office 180 North Stetson, Suite 3500, Chicago, IL 60601, USA ***-- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] server profiling options
On Thu, Nov 18, 2010 at 3:18 PM, Daniel Wittenberg daniel.wittenberg.r...@statefarm.com wrote: I’m looking at minimizing the CPU impact that nagios has on our server, and done some of the basic performance tuning stuff, but what I see right now is a lot of the nagios worker procs generating a lot of CPU and curious if there was a way people have used to watch what those processes and threads were doing that might be taknig the most cycles to try and reduce it? If you have a large number of monitored hosts and services, you may want to look into use_large_installation_tweaks: http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html -- Breandan Dezendorf brean...@dezendorf.com bwdez...@gmail.com -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] server profiling options
Yup, already enabled... Dan -Original Message- From: Breandan Dezendorf [mailto:brean...@dezendorf.com] Sent: Thursday, November 18, 2010 3:05 PM To: Nagios Users List Subject: Re: [Nagios-users] server profiling options On Thu, Nov 18, 2010 at 3:18 PM, Daniel Wittenberg daniel.wittenberg.r...@statefarm.com wrote: I'm looking at minimizing the CPU impact that nagios has on our server, and done some of the basic performance tuning stuff, but what I see right now is a lot of the nagios worker procs generating a lot of CPU and curious if there was a way people have used to watch what those processes and threads were doing that might be taknig the most cycles to try and reduce it? If you have a large number of monitored hosts and services, you may want to look into use_large_installation_tweaks: http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html -- Breandan Dezendorf brean...@dezendorf.com bwdez...@gmail.com -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] server profiling options and pre-caching macros
Watching nagios in debug mode for awhile, made me think about the processing of macros and the performance impact. It seems that for most macro's, like user defined ones, they don't change. So does it really make sense to keep processing them over and over and over...could that be something that maybe you could do in the pre-cache stage when nagios starts. Would that cause more problems? Dan -Original Message- From: Breandan Dezendorf [mailto:brean...@dezendorf.com] Sent: Thursday, November 18, 2010 3:05 PM To: Nagios Users List Subject: Re: [Nagios-users] server profiling options On Thu, Nov 18, 2010 at 3:18 PM, Daniel Wittenberg daniel.wittenberg.r...@statefarm.com wrote: I'm looking at minimizing the CPU impact that nagios has on our server, and done some of the basic performance tuning stuff, but what I see right now is a lot of the nagios worker procs generating a lot of CPU and curious if there was a way people have used to watch what those processes and threads were doing that might be taknig the most cycles to try and reduce it? If you have a large number of monitored hosts and services, you may want to look into use_large_installation_tweaks: http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html -- Breandan Dezendorf brean...@dezendorf.com bwdez...@gmail.com -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today http://p.sf.net/sfu/msIE9-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null