Re: [Nagios-users] check_ntp_peer parsing error
On Sat, Jun 29, 2013 at 04:01:55PM +0200, Holger Weiß wrote: JFTR: * Marc Haber mh+nagios-us...@zugschlus.de [2013-06-26 21:22]: I have a system running Debian oldstable with Nagios-Plugins 1.4.15. A few weeks ago, my check_ntp_peer checks have started acting up: $ /usr/lib/nagios/plugins/check_ntp_peer --hostname=2001:1b18:f:4::2 --warning 3 --critical 5 --jwarn 10 --jcrit 20 --twarn 2: --tcrit 3: --swarn 2 --scrit 2 -v 3 candidate peers available synchronization source found Getting offset, jitter and stratum for peer e20a parsing offset from peer e20a: error: unable to read server offset response. parsing jitter from peer e20a: error: unable to read server jitter/dispersion response. parsing stratum from peer e20a: error: unable to read server stratum response. NTP CRITICAL: Offset unknown, jitter=-1,00, stratum=-1, truechimers=6| jitter=-1,00;10,00;20,00;0,00 stratum=-1;2;2;0;16 truechimers=6;0;0;0 This is probably caused by a bug in Force10 switches mentioned here: http://news.ntppool.org/2013/06/ipv6-monitoring-problems-for-g.html Due to that bug, the check_ntp_peer requests got duplicated on their way to the server, and the server therefore sent multiple responses per request. check_ntp_peer then stumbled over those duplicated responses. That's a bug, I'll fix it later today. Thanks to Marc for providing tcpdump output and for his help with tracking the issue down. This was really fun to investigate. Thanks for help ans inspiration. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_ntp_peer parsing error
Hi, I have a system running Debian oldstable with Nagios-Plugins 1.4.15. A few weeks ago, my check_ntp_peer checks have started acting up: $ /usr/lib/nagios/plugins/check_ntp_peer --hostname=2001:1b18:f:4::2 --warning 3 --critical 5 --jwarn 10 --jcrit 20 --twarn 2: --tcrit 3: --swarn 2 --scrit 2 -v 3 candidate peers available synchronization source found Getting offset, jitter and stratum for peer e20a parsing offset from peer e20a: error: unable to read server offset response. parsing jitter from peer e20a: error: unable to read server jitter/dispersion response. parsing stratum from peer e20a: error: unable to read server stratum response. NTP CRITICAL: Offset unknown, jitter=-1,00, stratum=-1, truechimers=6| jitter=-1,00;10,00;20,00;0,00 stratum=-1;2;2;0;16 truechimers=6;0;0;0 The server itself is reachable and gives plausible answers: $ ntpq -c pe 2001:1b18:f:4::2 remote refid st t when poll reach delay offset jitter == +ptbtime1.ptb.de .PTB.1 u 17 64 177 26.8911.663 1.340 +ptbtime2.ptb.de .PTB.1 u 14 64 177 27.015 -0.286 1.271 -ns1.customer-re 192.53.103.104 2 u 18 64 1778.0543.469 0.835 -130.149.220.2 130.133.1.10 2 u 20 64 177 21.243 -0.139 1.154 *ntp0.rrze.ipv6. .GPS.1 u 19 64 177 21.170 -5.249 1.345 -stratum2-2.NTP. 129.70.130.702 u 15 64 177 21.216 -2.147 1.039 $ ntpq -c associations 2001:1b18:f:4::2 ind assid status conf reach auth condition last_event cnt === 1 57862 943a yes yes none candidatesys_peer 3 2 57863 9424 yes yes none candidate reachable 2 3 57864 9324 yes yes none outlyer reachable 2 4 57865 9324 yes yes none outlyer reachable 2 5 57866 963a yes yes none sys.peersys_peer 3 6 57867 9324 yes yes none outlyer reachable 2 $ This behavior does not happen with all of my check_ntp_peer checks. I have not yet found out under which circumstances this behavior happens. For your reference, I have currently opened the ntp server on the IPv6 address listed above for in-depth queries from anywhere. What is going wrong? Is this a bug with check_ntp_peer? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to deal with hosts that are not always on
On Thu, Aug 30, 2012 at 03:34:49AM -0700, Patrick Morris wrote: On 7/29/2012 1:51 AM, Marc Haber wrote: I have a number of hosts that are not always turned on, such as my Notebook, my home server (which sleeps if not used) and a number of test VMs. To get a reasonable volume of checks in my test installation of Nagios, I would like to have those machines monitored by it. Obviously, I have notifications for those hosts turned off. This, however, leads to non-zero numbers in the critical, warning and unknown fields in the CGI, which gives a wrong impression of my health status since it is normal and expected that those hosts/services are down. Is there any solution for this dilemma? Can I have Nagios monitor hosts without reporting them as down when they are down? I know this may sound strange, but I would like to have statistics about how long the machines have been up and when. If you want to know how long the machines have been up and when, don't you also need to know when they're not up? I'm not sure how you'd get any idea of your uptimes if you completely ignore downtimes -- Nagios would think they'd always been up, and you'd have no usable statistics. You're right, but this is only my play installation, so it delivers not useable statistics anyway. Otoh, the case host up, but no ssh server would be a reportable offense, for example. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to deal with hosts that are not always on
On Thu, Aug 30, 2012 at 01:11:45PM +0200, Werner Flamme wrote: we use action handler scripts to schedule downtime in this case. very dirty hack, but neat. Thanks. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] How to deal with hosts that are not always on
Hi, I have a number of hosts that are not always turned on, such as my Notebook, my home server (which sleeps if not used) and a number of test VMs. To get a reasonable volume of checks in my test installation of Nagios, I would like to have those machines monitored by it. Obviously, I have notifications for those hosts turned off. This, however, leads to non-zero numbers in the critical, warning and unknown fields in the CGI, which gives a wrong impression of my health status since it is normal and expected that those hosts/services are down. Is there any solution for this dilemma? Can I have Nagios monitor hosts without reporting them as down when they are down? I know this may sound strange, but I would like to have statistics about how long the machines have been up and when. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_ntp_peer reports stratum off-by-one?
Hi, On Thu, Jul 19, 2012 at 02:51:31PM +0200, Holger Weiß wrote: * Marc Haber mh+nagios-us...@zugschlus.de [2012-07-14 21:02]: $ ntpq -c pe torres.zugschlus.de remote refid st t when poll reach delay offset jitter == *ptbtime1.ptb.de .PTB.1 u 1044 1024 377 18.521 -1.513 0.258 +ptbtime2.ptb.de .PTB.1 u 903 1024 377 18.132 -1.277 0.373 -node01.mserverz 85.10.240.2533 u 651 1024 377 18.721 -7.067 6.829 zefiris.semidef 192.53.103.108 2 u 490 1024 377 16.4964.906 0.680 -evie.fra1.as411 78.47.148.1743 u 974 1024 377 18.8794.630 0.616 +geiger-online.c 81.94.123.17 2 u 556 1024 377 18.2004.669 0.694 $ /usr/lib/nagios/plugins/check_ntp_peer -H torres.zugschlus.de --warning=3 --critical=5 --jwarn=1 --jcrit=2 --twarn=5 --tcrit=5 --swarn=2 --scrit=2 NTP OK: Offset -0,001 secs, jitter=0,00, stratum=1, truechimers=5|offset=-0,001000s;3,00;5,00; jitter=0,00;1,00;2,00;0,00 stratum=1;2;2;0;16 truechimers=5;5;5;0 $ Does this mean that the --swarn und --twarn options actually check the stratum of the best peer of the queried server, not the queried server itself? The --swarn and --scrit options (not --twarn), yes.¹ ¹ http://permalink.gmane.org/gmane.network.nagios.plugins/4333 I would be in favour of sending a second query to get the results right. Correcting the documentation to reflect what the software really does is only the second best solution. The current combination of documentation and software behavior is confusing at best. And, while we're at it, why is this: $ /usr/lib/nagios/plugins/check_ntp_peer -H torres.zugschlus.de --twarn=5 --tcrit=4 -v 3 candidate peers available synchronization source found Getting offset, jitter and stratum for peer 20cf parsing offset from peer 20cf: -0,001 NTP CRITICAL: Offset -0,001 secs, truechimers=5|offset=-0,001000s;60,00;120,00; truechimers=5;5;4;0 $ critical? --tcrit=4 is equivalent to --tcrit=0:4 and tells the plugin to raise a critical alert if the number of truechimers is outside the range {0..4}. Now that you mention it, things are clear. And it is also mentioned in the --help. But, is the example, -m :5 -n :3 not the other way round? Thanks for helping. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 31958061 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 31958062 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios monitor Windows diskspace
Hi all. I am currently deploying a Nagios installation to monitor several hundreds of nodes within our company. I installed NSClient++ on the WXP machines and configured Nagios to query it. Standard monitoring plugins work flawlessly (CPU load, Memory usage, NSC++ version), but I can't seem to make the disk space check work. The error reported by nagios is wrong -l argument and the configuration is as follows: define service{ use generic-service hostgroup_name win-workstations service_description C:\ Drive Space check_command check_nt!USEDDISKSPACE!-l c:\ -w 80 -c 90 } Any clues as why it's not working? Marc -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios monitor Windows diskspace
On Thu, June 30, 2011 1:51 pm, Assaf Flatto wrote: Change the command to be : check_command check_nt!USEDDISKSPACE!-l c -w 80 -c 90 (drop the :\) Marc Haber wrote: Hi all. I am currently deploying a Nagios installation to monitor several hundreds of nodes within our company. I installed NSClient++ on the WXP machines and configured Nagios to query it. Standard monitoring plugins work flawlessly (CPU load, Memory usage, NSC++ version), but I can't seem to make the disk space check work. The error reported by nagios is wrong -l argument and the configuration is as follows: define service{ use generic-service hostgroup_name win-workstations service_description C:\ Drive Space check_command check_nt!USEDDISKSPACE!-l c:\ -w 80 -c 90 } Any clues as why it's not working? Marc Thanks for the response! Unfortunately it still returns the same error. Should I enable anything in particular in NSC++ conf files? Marc -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] IPv6 support
On Fri, Jun 10, 2011 at 01:02:04PM +0200, Andreas Ericsson wrote: In fact, not being able to handle the more-than-one-IP-per-host in a non-complex way is one of the greatest beefs I have with Nagios (see web server, or a host which serves multiple services on different IP addresses). You can, but with custom variables. How many official addresses should Nagios support? 3? 5? It should be an array. Arbitrary limits suck. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] IPv6 support
On Thu, Jun 09, 2011 at 06:19:40PM +0200, Andreas Ericsson wrote: Why? If the host is reachable via ip6, it's reachable via ip6 and that's what you configure. If it's not, you configure ip4 instead. So one will need to configure a dual stack host twice, once for IPv4, and once for IPv6, and resort to fancy parent-host or check_multi setups to not be alarmed twice if it's the _host_ being down, and not one of the IP procotols? In fact, not being able to handle the more-than-one-IP-per-host in a non-complex way is one of the greatest beefs I have with Nagios (see web server, or a host which serves multiple services on different IP addresses). Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- EditLive Enterprise is the world's most technically advanced content authoring tool. Experience the power of Track Changes, Inline Image Editing and ensure content is compliant with Accessibility Checking. http://p.sf.net/sfu/ephox-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to Monitor VMware ESX Server and Storage Units
On Wed, Jan 05, 2011 at 10:38:31AM -0500, Max Hetrick wrote: On 01/05/2011 10:28 AM, Marc Haber wrote: Are there other possibilities of connecting Nagios to ESX and MSA devices? I'd appreciate your input. Have a look at the check_esx.pl plugin. You can do quite a bit with it. About the same that I can already monitor using SNMP. Judging from the web page, it doesn't allow finding out whether we still have all power supplies and all local disks. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to Monitor VMware ESX Server and StorageUnits
On Wed, Jan 05, 2011 at 11:02:46AM -0500, James Pratt wrote: hi, you mention you are using HP's management agents on ESX - if so, check out these SNMP plugins to replace check_snmp - http://nagios.manubulon.com/ What do these plugins better than the stock snmp plugins? I'm not sure if you can get any SMART drive status from snmpd/HP, but you can check RAID status on local disks, and there are also ways to check ESX datastore free space usage too. Power Supply Status? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] How to Monitor VMware ESX Server and Storage Units
Hi, up to now, I have usually been monitoring real Unix systems: One installs a bunch of Nagios plugins directly on the target systems and monitors away. Now, I have been asked to monitor a cluster of VMware ESX servers and a fibrechannel-based HP MSA Storage Unit. On the MSA, I cannot install any software at all, and on the ESX servers, I'd rather not install any software since this endangers VMware support just in case something would act up. The site in question is very interested in being able to yell at their vendors if something doesn't work. Both ESX and MSA can speak SNMP, but both boxes are missing information in their SNMP data set that I'd really love to monitor: Both boxes' SNMP sets are missing information about power supply status and whether we have any failed hard disks. I have installed all MIBs that VMware and HP have delivered, and full snmpwalk output does not contain any numerically written OIDs. But there is nothing identifying whether a hard disk or a power supply is even online, and I'd have expected to find a hard disk's SMART info at least partially in the data set which is pollable via SNMP. Are there other possibilities of connecting Nagios to ESX and MSA devices? I'd appreciate your input. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service Escalation via Host Group
On Sat, Jan 01, 2011 at 03:19:24PM +0800, Yueh-Hung Liu wrote: if only services of norm-notify should be escalated, host_name * is no need. That was helpful, thanks! Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service Escalation via Host Group
On Fri, Dec 31, 2010 at 12:06:21PM +0800, Yueh-Hung Liu wrote: but by your configuration, all services will be escalated, not only services of norm-notify. Why? And how do I configure nagios to do what I want? define serviceescalation { host_name * service_description * hostgroup_name norm-notify contact_groups admins-mail first_notification 1 last_notification 0 notification_interval 120 escalation_options w,u,c,r escalation_period 24x7 } define serviceescalation { host_name * service_description * hostgroup_name norm-notify contact_groups admins-sms first_notification 2 last_notification 0 notification_interval 120 escalation_options w,u,c,r escalation_period 24x7 } Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service Escalation via Host Group
On Fri, Dec 31, 2010 at 11:46:04AM +0100, Hugo van der Kooij wrote: On Fri, 31 Dec 2010 11:15:29 +0100, Marc Haber mh+nagios-us...@zugschlus.de wrote: On Fri, Dec 31, 2010 at 12:06:21PM +0800, Yueh-Hung Liu wrote: but by your configuration, all services will be escalated, not only services of norm-notify. Why? And how do I configure nagios to do what I want? The question you have not answered is wether you have true regex matching set. $ grep use.*regexp nagios.cfg use_regexp_matching=0 use_true_regexp_matching=0 $ If that is the case you need to use .* instead of * as true wildcard name. If that were the case, the identical hostescalation configuration wouldn't work either. Or am i missing something? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service Escalation via Host Group
Hi, I would like to have host and service escalations for all hosts that are in a certain host group. The following hostescalation definition seems to work fine: define hostescalation { hostgroup_name norm-notify contact_groups admins-mail first_notification 1 last_notification 0 notification_interval 120 escalation_options d,u,r escalation_period 24x7 } define hostescalation { hostgroup_name norm-notify contact_groups admins-sms first_notification 2 last_notification 0 notification_interval 120 escalation_options d,u,r escalation_period 24x7 } The analogous service escalation definition is ignored: define serviceescalation { host_name * service_description * hostgroup_name norm-notify contact_groups admins-mail first_notification 1 last_notification 0 notification_interval 120 escalation_options w,u,c,r escalation_period 24x7 } define serviceescalation { host_name * service_description * hostgroup_name norm-notify contact_groups admins-sms first_notification 2 last_notification 0 notification_interval 120 escalation_options w,u,c,r escalation_period 24x7 } When a service on a host which is member of host group norm-notify goes down, notifications go out to the contact group that is mentioned in the service definition. The service escalation definition seems to be ignored. Any idea what might be going wrong? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host/service escalation notification
On Tue, Dec 28, 2010 at 04:19:21PM +0200, Дмитрий Леоненко wrote: I want to create several levels of host and service escalation. Say 3 levels. In notification I want to know on which escalation level this particular notification occurred. Can't find any variable reflecting escalation level. Escalation levels are connected to the notification number, and escalations can be kind of orthogonal. Do the Notification Number ($SERVICENOTIFICATIONNUMBER$ and/or $HOSTNOTIFICATIONNUMBER$) macros the job? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NAGIOS_ environment variables in a notification script
Hi, I am trying to write a notification script which is called via the following command definition: define command { command_namenotify-service-by-email command_line/path/bin/notify --mail=$CONTACTEMAIL$ } To save myself from handing in all macros to the script via the command line, I'd like to use the macros that are written to the environment as NAGIOS_foo. Despite having set enable_environment_macros=1 in my nagios.cfg, the notification script only sees NAGIOS_PLUGIN=/path/bin/notify. What am I doing wrong? I'm using Nagios 3.0.6 from Debian lenny. Any hints will be appreciated. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_multi advice to check Internet available
Hi, On Mon, May 24, 2010 at 04:58:21PM -0700, Gordon Messmer wrote: On 05/24/2010 02:51 PM, Marc Haber wrote: On Mon, May 24, 2010 at 03:37:13PM -0400, Matt Simmons wrote: Have you considered adding your upstream router as a host, and making it the parent of your remote hosts? Yes, but that's a workaround which will misfire in the case where it's the own ISP that's faulty at its uplik. If that's something you're concerned about, your check_multi will also misfire when the problem is at the uplink for the machines that you're monitoring. Monitoring a gateway and making it the parent is in no way a workaround. An ideal installation for monitoring WAN-connected hosts would monitor your default gateway and make that the parent of all hosts. You'd also monitor the router at each remote site and make that the parent of all hosts at that site. I know all this. My first Nagios experiences date back to when it was still called Netsaint. Of course, the default gateway, the CPE and the COE of the local ISP are monitored. But that's all that can be comfortably monitored since the Internet uses dynamic routing and the paths are subject to change without further notice. Furthermore, a lot of housing providers configure their backbone and access routers to not respond to pings, so it's a challenge to monitor these sites. http://blog.zugschlus.de/archives/770-Nagios,-Parent-Hosts,-and-traceroute-on-the-Internet.html has my thoughts on that matter (thankfully written in English). Be aware that this article is more than a year old, so things have changed since then. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_multi advice to check Internet available
Hi, I am checking a number of servers which are on the Internet, and are thus likely to become unreachable for my Nagios installations in case of an outage out there on the Internet. I would like to have a host which I could use as parent host for all those servers so that I don't get buried under an avalanche of HOST DOWN alarms should my Internet go partly or fully down. I have investigated using check_multi do help me with that, and have come up with a solution which can help me with this task. I would like to solicit your comments to that solution. I have defined a number of hosts which are checked by Nagios proper, just as normal hosts are. Their individual notifications may be switched off, but I haven't done so since I like to see what's going on at least in the start phase. Their hostnames all start with mh-internet, so I can use a single check_multi config line to pull them all into a single check_multi check. check_multi is then used as service and host check on a virtual host called internet, which goes CRITICAL when more than one mh-internet-foo host is CRITICAL. Here is my detailed configuration: /etc/nagios3/conf.d/internet.cmd: statusdat [ internet ] = /^mh-internet-.*$/:my-defaultping state[WARNING] = COUNT(WARNING) 1 state[CRITICAL] = COUNT(CRITICAL) 1 state[UNKNOWN] = COUNT(UNKNOWN) 1 Nagios configuration: define host { use mh-host host_name mh-internet alias internet address 127.0.0.1 parents localhost check_command check_multi!-f /etc/nagios3/conf.d/internet.cmd } define service { namemy-internet use my-service service_description my-internet check_command check_multi!-f /etc/nagios3/conf.d/internet.cmd host_name mh-internet } define command { command_namecheck_multi command_line/usr/lib/nagios/plugins/check_multi $ARG1$ $ARG2$ $ARG3$ $ARG4$ } define host { use mh-host host_name mh-internet-torres hostgroups mh, ping alias torres.zugschlus.de address 85.214.68.41 parents mh-internet } define host { use mh-host host_name mh-internet-heise hostgroups mh, ping alias www.heise.de address www.heise.de parents mh-internet } define host { use mh-host host_name mh-internet-google-de hostgroups mh, ping alias www.google.de address www.google.de parents mh-internet } I am wondering whether this is a valid approach, or whether there are better solutions for this challenge. If somebody familiar with check_multi configuration, I would like to know whether there are any possible issues hiding in my use of check_multi. Any comments will be appreciated. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_multi advice to check Internet available
On Mon, May 24, 2010 at 03:37:13PM -0400, Matt Simmons wrote: Have you considered adding your upstream router as a host, and making it the parent of your remote hosts? Yes, but that's a workaround which will misfire in the case where it's the own ISP that's faulty at its uplik. Also, remember to take off the 'u' notification flag of the uplink's children, otherwise you'll still be buried under a pile of notifications. Sure. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] re-execute an event handler just in case the service stays DOWN
Hi Ludse, this issue was completely lost in space, sorry about that. [quoting fixed to get things back in context] On Thu, Jan 22, 2009 at 06:05:00PM +0100, Ludse Verhoeven wrote: Marc Haber wrote: I would like one event handler to be re-executed just in case it failed for the first time. The docs (http://nagios.sourceforge.net/docs/3_0/eventhandlers.html) say you don't want that, but I am fairly confident that I want it in this case. I forgot to mention that this is a host event handler. I have a nagios host in a unattended site which generally gets its internet access from the site network. In case of a site network failure, the host needs to dial in to the Internet via modem to be reachable for debugging purposes. The nagios installation monitors a host on the Internet and has an event handler connected with this host which will initiate the dial-up when the Internet host goes into a HARD DOWN state. I would like this event handler to be re-executed in case the dial-up connection doesn't come up or goes down _before_ nagios has detected that the Internet host is UP again. Is there any way to accomplish this or am I better of with a cronjob which will initiate the dial-up from outside nagios? Maybe the is_volatile option is something for you (see: http://nagios.sourceforge.net/docs/3_0/volatileservices.html) Unfortunately, is_volatile seems to be a service only option, and I am checking the host. When I define a service to that host which has the is_volatile option set, the service event handler is never called since the service is not checked again once the host is in DOWN state. Any additional hints? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- OpenSolaris 2009.06 is a cutting edge operating system for enterprises looking to deploy the next generation of Solaris that includes the latest innovations from Sun and the OpenSource community. Download a copy and enjoy capabilities such as Networking, Storage and Virtualization. Go to: http://p.sf.net/sfu/opensolaris-get ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] re-execute an event handler just in case the service stays DOWN
On Fri, Jan 23, 2009 at 06:41:20AM +0100, Joerg Linge wrote: just use a notification to fire the eventhandler script. Disadvantage: only triggered on HARD states. Advantage: notification_interval to re-execute the script. That would prevent me from actually notifying. At least if one does not want to be notified once a minute ;) Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] re-execute an event handler just in case the service stays DOWN
Hi, I would like one event handler to be re-executed just in case it failed for the first time. The docs (http://nagios.sourceforge.net/docs/3_0/eventhandlers.html) say you don't want that, but I am fairly confident that I want it in this case. I have a nagios host in a unattended site which generally gets its internet access from the site network. In case of a site network failure, the host needs to dial in to the Internet via modem to be reachable for debugging purposes. The nagios installation monitors a host on the Internet and has an event handler connected with this host which will initiate the dial-up when the Internet host goes into a HARD DOWN state. I would like this event handler to be re-executed in case the dial-up connection doesn't come up or goes down _before_ nagios has detected that the Internet host is UP again. Is there any way to accomplish this or am I better of with a cronjob which will initiate the dial-up from outside nagios? Any hints will be appreciated., Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SMS Gateway
On Sat, Jun 21, 2008 at 11:36:33AM +0600, Shoaibi wrote: Well i cannot buy an sms gateway by myself, What keeps you from plugging an UMTS card into a notebook and using the smstools package? are there any alternates? I search for some open source gateway software which i could setup, but i couldnt manage to get a url like the gateway sellers i.e. clickatell, gives that we use in notify with sms plugin. Any solution or links? You'd need to write some interface code for nagios, but that's only a few lines of script in any case. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Log rotation with Logrotate
Hi, Packaging Nagios for Debian means doing things in a way that users might expect them to be done to avoid confusion. One of these expected things is that logs are rotated using logrotate, which allows people to specify, for example, how many log generations are to be held. Unfortunately, to do so, we need to disable nagios' built-in log rotation or we get double rotated logs. Is it possible to tell nagios not to rotate the logs? How will nagios react when logrotate moves the log file away? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Debian Bug #401738: statuswml.cgi cannot schedule check commands
Hi, this is from the Debian BTS, http://bugs.debian.org/401738: | When trying to re-schedule a check for a service from the wml pages I | get the following error message: Start time must be non-zero. If I | manually submit start_time=epoch to cmd.cgi the commands works fine. | The same method works fine from the normal status.cgi. The bug submitter confirms that this still holds for Nagios 2.6. I report this bug here because the Nagios project does not seem to have a publicly visible bug tracker. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] User-defined host/service attributes
On Sat, Oct 14, 2006 at 03:11:02PM -0400, John P. Rouillard wrote: Nagios 3.0 has native support for this. But Nagios 3.0 is not yet released. I use a simple wrapper script and define things like: Do I see correctly that your wrapper script is a workaround for Nagios 2.x? I surely hope that the way of solving this with Nagios 3 is a little less ugly. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] How to not send out first service notifications?
Hi, I am monitoring some hosts on the Internet for informational reasons. Since these hosts quite frequently have failed services, I'd like my Nagios to refrain from notifying me if a service is down at the first notification. Subsequent notifications, however, should be sent out. Is there a way to do this any easier than having no notifications set in the service definition and have a service escalation having the list of contacts that used to be in the service definition? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service escalation for service groups?
Hi, in the Nagios 2.x docs, a serviceescalation item can be configured for a host name and a service description. Is there any possibility to define escalation items that automatically apply for all members of a service group? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service escalation for service groups?
On Thu, Oct 19, 2006 at 11:12:56AM +0200, [EMAIL PROTECTED] wrote: [EMAIL PROTECTED] schrieb am 19.10.2006 11:08:58: in the Nagios 2.x docs, a serviceescalation item can be configured for a host name and a service description. Is there any possibility to define escalation items that automatically apply for all members of a service group? Example: define serviceescalation { servicegroup_name WUT-SERVICEGROUP first_notification 1 last_notification 0 contact_groups HOST-CONTACTGROUP-SMS,HOST-CONTACTGROUP-MAIL notification_interval 10 escalation_period 24x7 escalation_options w,c,r } Ah. That possiblity is not in the docs. Thanks. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Antwort: How to not send out first service notifications?
On Thu, Oct 19, 2006 at 11:10:56AM +0200, [EMAIL PROTECTED] wrote: [EMAIL PROTECTED] schrieb am 19.10.2006 11:04:19: I am monitoring some hosts on the Internet for informational reasons. Since these hosts quite frequently have failed services, I'd like my Nagios to refrain from notifying me if a service is down at the first notification. Subsequent notifications, however, should be sent out. Is there a way to do this any easier than having no notifications set in the service definition and have a service escalation having the list of contacts that used to be in the service definition? If you always only want the 2nd notification, then your approach sounds wrong. You shouldn't suppress the first notification always, but instead maybe raise the number of consecutive failed checks until you throw a hard state, so you do not get too many false warnings. Actually, it is fine (and desired) for the service to show up in the web interface as DOWN, and that the history is correctly written. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] User-defined host/service attributes
Hi, recently, I was confused by my Nagios reporting a host DOWN event while the host was alive and kicking. After debugging for a while, I found out that the host had changed IP addresses and nagios was, of course, still checking the old address since it was configured to check the IP address. To catch such error situations earlier, I'd now like to have Nagios check that the host name indeed still points to the IP address we're actually checking. This is easy enough: define service { use my-service hostgroup_name hostnameip service_description my_hostnameip check_command check_hostnameip } define command { command_name check_hostnameip command_line $USER1$/check_dns --hostname=$HOSTNAME$ --expected-address=$HOSTADDRESS$ } This, however, means that one has to have the DNS name of the host in the host_name field: define host { use my-linux-host host_name ipdesc.iface.host.loc.mydomain.example hostgroups defaultping, snmp-w68, ssh, hostnameip alias foo server running bar and baz services, colocated in blu address 10.1.1.47 parents localhost } This is a bit clumsy since our host names are complicated and lengthy. I'd like to have something like define command { command_name check_hostnameip command_line $USER1$/check_dns --hostname=$CUSTOMATTR$ --expected-address=$HOSTADDRESS$ } define host { use my-linux-host host_name host.mydomain.example custom_attributeipdesc.iface.host.loc.mydomain.example hostgroups defaultping, snmp-w68, ssh, hostnameip alias foo server running bar and baz services, colocated in blu address 10.1.1.47 parents localhost } If this possible in Nagios2? If not, are there other ideas how to solve this? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE vs NCSA
On Wed, Sep 13, 2006 at 12:14:30PM +0100, Hari Sekhon wrote: Does anybody have any advice as to which I should go with? What have you used and what are your experiences of this? I'd take a security-based approach. I do not feel too comfortable with a server holding valueable or vital data is open to connection from the nagios host, which, as a host running a web interface, might be more easily compromised than the actual server. This is a minus for NRPE. With NSCA, the actual server would initiate the connection to the Nagios host which decreases security exposure. otoh, NSCA has a gap in data flow as nsca_send wants to see the output in a different format than the plugins put out. There are a bunch of shell scripts around that translate between the plugin and nsca_send, but these are like foreign objects in the nagios setup in my feelings. I think that there should be a glue binary written in C that calls the plugin and then hands over the results to nsca_send. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] what mail server to use?
On Sat, Jul 29, 2006 at 08:50:27PM -0400, nagios wrote: Does anyone have any recommendations on what mail server to use? I'm planning on going with Sendmail, but I know that there are a few others out there. Any suggestions are welcome. I'd go with whatever is the default on your OS. Being the maintainer for Debian's exim packages, I am kind of biased. If you have a working mailserver installation on the network, you might want to go with one of the featureless mailers like ssmtp or nullmailer. If so, take care what your local mail server does when the smtp server is not reachable. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Parsing a Nagios 2 configuration file from perl
Hi, I'd like to have a list of all host_name directives in host definitions in a nagios 2 template style configuration in a perl script. Did anybody already write a nagios 2 configuration file parser in perl? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in the hosting industry. Fanatical Support. Click to learn more http://sel.as-us.falkag.net/sel?cmd=lnkkid=107521bid=248729dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios-to-visio?
On Thu, Mar 30, 2006 at 02:56:52PM -0600, Rusty Hall wrote: Try looking at nagvis it doesn't automate it but very simple to creat custom views Unfortunately, the suits insist on using that visio tool :-( Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] I have a problem!!! Help
On Mon, Feb 13, 2006 at 11:33:13AM -0300, João Paulo Vaz wrote: Internal Server Error The server encountered an internal error or misconfiguration and was unable to complete your request. Please contact the server administrator, [EMAIL PROTECTED] and inform them of the time the error occurred, and anything you might have done that may have caused the error. More information about this error may be available in the server error log. _ Apache/2.0.53 (Fedora) Server at 200.252.90.27 Port 80 It will be that somebody could feel a help, because I already tried of everything and I don't get to solve. Well, I have a gut feeling that more information about this error might be available in the server error log. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] I have a problem!!! Help
On Mon, Feb 13, 2006 at 11:49:21AM -0300, João Paulo Vaz wrote: [Mon Feb 13 12:24:40 2006] [error] [client 200.252.90.21] (13)Permission denied: exec of '/usr/lib/nagios/cgi/extinfo.cgi' failed, referer: http://200.252.90.27/nagios/side.html[Mon Feb 13 12:24:40 2006] [error] Looks like a permission problem on /usr/lib/nagios/cgi/extinfo.cgi. Sheesh. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Two questions about outage processing
Hi, I do not have too much Experience with Nagios, and my Netsaint experience is like three years unused. I have the following issues: (1) When hostA is parent host of hostB, hostA is down, and that outage is acknowledged - why does hostsB show up as Unreachable, Unhandled Problem in the tactical overview? I'd expect the acknowledgement to ACK the unreachable host as well. Otherwise, one would be with a _lot_ of unhandled problems in case of big outages which could mask out more important outages somewhere else. (2) When I see a service in CRITICAL soft state (thus, no notification has gone out yet), I cannot ACK that state (even if I know that the service is going to stay CRITICAL for a while and the problem is already being worked on). IMO, it would be good to be able to ACK even SOFT states. Am I missing something? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Why the distinction between host definition and hostextinfo definition?
On Tue, Feb 07, 2006 at 07:27:47AM +0100, Frederik Vanhee wrote: Why don't you use the hostgroup_name in the hostextinfo definition ? Because I didn't know that hostgroup_name was possible in hostextinfo. It is not documented in xodtemplate.html for 2.0rc2, but has now been verified to work. Thanks for pointing this out. The availability of hostgroup_name in hostexinfo should definetely be in the docs for the 2.x release. Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Why the distinction between host definition and hostextinfo definition?
Hi, to me, the distinction between host definition and hostextinfo definition seems artificial, awkward and clumsy - and it prevents some of the more powerful features from being used with the information that belongs into hostextinfo. Hostextinfo doesn't seem to support templates, and it doesn't seem to be possible to set information that is set as hostextinfo from a host group or via wildcards. This makes it extraordinarily hard to - for example - set host icons for the status map, since it needs a dedicated hostextinfo definition setting icon_image and statusmap_image for each host which might have all other properties set from a template. Is there any reason for keeping this distinction and not including hostextinfo into main configuration? Am I missing something here? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service on an IP address different from the host
Hi, some systems I am responsible for have multiple IP addresses, with services that are not running on all IP addresses. I thus need service definitions that don't check the service on the host address, but on a dedicated service address while still being associated to the host which has a different address. I have come up with various ways to do this, each of them ugly, but differently so. (1) Duplicate each command definition for a service check like define command{ command_namecheck_http command_line$USER1$/check_http -H $HOSTADDRESS$ } define command{ command_namecheck_http_addr command_line$USER1$/check_http -H $ARG1$ } (2) One other idea I have come up with would be having a thing like an optional argument, which could be configured like define command{ command_namecheck_http command_line$USER1$/check_http -H $IFDEF $ARG1$$ $ARG1$ $ELSE$ $HOSTADDRESS$ $ENDIF$ } (3) A different approach (which is what I did back in netsaint days) was a metaplugin, which allows command configuration like define command { command_namecheck_http command_line$USER2$/modify_address $HOSTADDRESS$-$ARG1$ $USER1$/check_dns_random modify_address } which allows service definitions like define service { ... check_command check_http } which checks the service on $HOSTADDRESS$ and define service { ... check_command check_http!192.168.130.5 } which checks the service on 192.168.130.5. In the example, the metaplugin replaces the occurence of the string modify_address with $HOSTADDRESS$ if $ARG1$ is empty, and $ARG1$ otherwise. This has the advantage of having a very straightforward service configuration, at the price of having a confusing and complex command definition which does not use nagios standard mechanisms. (4) The last approach would be to have host definitions for both IP addresses, but that would probably be confusing for notification recipients who are not so intimately familiar with system architecture. Which approach would, in your opinion, be appropriate for a new installation? I am currently inclined towards (4). What is your opinion? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service on an IP address different from the host
On Thu, Feb 02, 2006 at 11:48:49AM -0800, Guy B. Purcell wrote: Regarding recipient confusion with this method, you'll notice my sample defs used a common alias. The alias is what gets reported, so recipients wouldn't see different hostnames--just different service names. Wouldn't they then be looking for the wrong host in the web interface? Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to check Internet access and/or DNS?
On Wed, Jan 25, 2006 at 04:34:34PM +1300, Steve Shipway wrote: So many different ways to do this. We do it thusly. 1) Set up a host for your border router. 2) Within this, sefine a number of http check services that try to collect web pages from popular internet sites ( google, microsoft, ibm...). Disable notifications on them. 3) Use check_summary (from nagiosexchange.org) to define a service which is OK if any of the http check services are OK, and critical if they are all down. That's a quite neat idea. I like that. Thanks! Greetings Marc -- - Marc Haber | I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things.Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 621 72739835 --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null