Re: [Nagios-users] Problem: Server is UP??
I really need some help with this guys. My setup was working for the longest time and I can't think of any changes made to it that would cause this behavior. I get random (and quite annoying!) Nagios alerts that say Problem: $SERVER is UP! (with the actual host name) or sometimes it just literally says Problem: HOST is UP. Note, this only happens during my off hours time. I've never seen this during the day. You have probably configured your contact details to send a RECOVERY alert for hosts when it is out of hours. This means you will get a 'host is UP' alert when the host status changes to UP. Your message format probably prepends the 'Problem: ' to it. Admittedly your configuration doesn't look like this is there. I suppose you should check the message header and make sure it really originated on your host, and check you don't have two versions of Nagios running (did you have and old or test instance set up that might be sending these emails?) Steve - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SMS notifications
On Jun 16, 2008, at 11:06 AM, Luc MAIGNAN wrote: Isn't there a free way to send SMS via Nagios ? Most cell phone companies have an e-mail - sms gateway service. Count yourselves lucky, here in New Zealand they charge you to use it in either direction. You need to pay to allow people to email your phone, and/or you need to pay to allow yourself to email to SMS. New Zealand mobile operators really screw the customer. Prices are astronomical compared to Europe or the USA. The cheapest way for us was to get a phone and connect it to a linux box making our own email to SMS gateway. Of course, you mustn't tell the phone company you're doing this or else they want to charge you on a different (much higher) billing plan... Steve - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SMS notifications
For free sms you can use icq sms (if available for your country) I have connected a cell phone (nokia) to a windows xp box... Installed nokia pc suit Installed Microsoft sms sender Enabled telnet on the windows box Created a BAT file with the sms sender syntax Used BAT2EXE to convert the file to exe Created a expect script on the linux BOX to telnet to the windows and execute the sms script. My opinion is that using a cell phone is better than some external SMS gateway... Because if your internet fails.. how will you reach the external SMS gateway? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Steve Shipway Sent: Tuesday, June 17, 2008 09:31 To: 'Nagios Users Mailinglist' Subject: Re: [Nagios-users] SMS notifications On Jun 16, 2008, at 11:06 AM, Luc MAIGNAN wrote: Isn't there a free way to send SMS via Nagios ? Most cell phone companies have an e-mail - sms gateway service. Count yourselves lucky, here in New Zealand they charge you to use it in either direction. You need to pay to allow people to email your phone, and/or you need to pay to allow yourself to email to SMS. New Zealand mobile operators really screw the customer. Prices are astronomical compared to Europe or the USA. The cheapest way for us was to get a phone and connect it to a linux box making our own email to SMS gateway. Of course, you mustn't tell the phone company you're doing this or else they want to charge you on a different (much higher) billing plan... Steve - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null No virus found in this incoming message. Checked by AVG. Version: 8.0.100 / Virus Database: 270.3.0/1505 - Release Date: 16/06/2008 07:20 - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Antwort: Problem: Server is UP??
[EMAIL PROTECTED] schrieb am 17.06.2008 04:28:23: I really need some help with this guys. My setup was working for the longest time and I can't think of any changes made to it that would cause this behavior. I get random (and quite annoying!) Nagios alerts that say Problem: $SERVER is UP! (with the actual host name) or sometimes it just literally says Problem: HOST is UP. Note, this only happens during my off hours time. I've never seen this during the day. I'm running Nagios on Debian Etch and I've tried version 3.02 from the backports and that had the same issue. It has to be my config but I can't figure out for the life of me what it could possibly be! Any help you can give me will be SO much appreciated! I'm going to attach my configs. Thanks everyone. After looking throgh the config, I cannot find any culprits which would cause such behaviour. Maybe you have some bogus notification commands? The Problem: ... is UP doesn't seem to make sense to me and does not correspond to the default configuration. Recovery alerts shold start with Recovery: ... Please post your host-notify-by-email command definitions. Regards Sascha -- Sascha Runschke Netzwerk- und Systemmanagement Telefon : +49 (201) 102-1879 Mobil : +49 (173) 5419665 Fax : +49 (201) 102-1102105 GFKL Financial Services AG Vorstand: Dr. Peter Jänsch (Vors.), Jürgen Baltes, Dr. Till Ergenzinger, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a VB script which uses DISKPART (http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene -- -- Tipps Tricks für Linux, MySQL Webdesign - http://home.telebel.de/~referti/blog/ - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Problem with timeperiods with exact days (Nagios 3.0)
Hello, I am using nagios-3.0 (with a patched trends.cgi from 3.0.1). Yesterday, I added easter dates to german holidays (etc/objects/timeperiods.cfg): define timeperiod{ namede-holidays timeperiod_name de-holidays alias German Holidays january 1 00:00-00:00 ; Neujahr january 6 00:00-00:00 ; Heilige drei Koenige may 1 00:00-00:00 ; Maifeiertag october 3 00:00-00:00 ; Tag d. dt. Einheit november 1 00:00-00:00 ; Allerheiligen december 25 00:00-00:00 ; 1. Weihnachtstag december 26 00:00-00:00 ; 2. Weihnachtstag 2009-04-10 00:00-00:00 ; Karfreitag 2009-04-13 00:00-00:00 ; Ostermontag 2009-05-21 00:00-00:00 ; Christi Himmelfahrt 2009-06-01 00:00-00:00 ; Pfingstmontag 2009-06-11 00:00-00:00 ; Fronleichnam ... (2010/2011/2012 added as well) } However, if I look in the WEB interface under View Config -Object Type: Timeperiods, I get a different list: Months are off by minus 2: de-holidays German Holidays ... ... 2009-04-11 00:00:00 - 00:00:00 2009-04-01 00:00:00 - 00:00:00 2009-03-21 00:00:00 - 00:00:00 2009-02-13 00:00:00 - 00:00:00 2009-02-10 00:00:00 - 00:00:00 (instead of 2009-06-11 appears 2009-04-11 a.s.o). Somethings is wrong here. I think the web interface should increment the internal representation by 1 (internal month 0-11 means jan-dec), but this does not explain the offset 2. According to the Changlog, I cannot see that this has been fixed in 3.0.2 (or has it). Regards, Reiner -- Reiner Huober (Extern) [EMAIL PROTECTED] +49(0)731 9533-1760 - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
Rene Fertig wrote: Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a VB script which uses DISKPART (http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene If you can do vbs, it should be possible to do this without diskpart by using wmi classes. On the other hand, perhaps you can just consider this a legacy problem, after all Windows 2000 Server should be retired or at least on it's way out by now... -h -- Hari Sekhon - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring large (ish) numbers of servers with exceptions to the rules...
Hi All, I currently help maintain and monitor around 50 servers across various parts of the UK using Nagios 2. At the moment, we have a configuration file for each host (%hostname%.cfg) and in that file we specify all the services for the named host. We are trying to reduce the number of configuration files as we take on more and more servers becuase there are a large number checks that we need to be rolled out to all servers and we feel that we are duplicating our workload. I'm open to ideas on how to achieve this however my thoughts were a setup along the lines of the following: - A master host template is created in which all services are defined for a host. - If a check does not need to be run for a given host (for example it is not a web server), a stanza is added to that particular host's config file that effectively tells nagios don't check for this service on this host I've tried defining all the services in a master templates file and this works perfectly however when I come to exclude certain services, I am at a loss on how to do it. Initially I tried adding a stanza with the same service name and register 0 as one of the options, however this didn't work. We have used HostGroups in the past to achieve a similar goal, however we ran into the issue that whilst we need to check the CPU Usage on all of the servers, a few of the servers that we monitor can take a lot more of a beating than the majority. This lead to us defining the CPU checks on a per-host basis as if we defined it separately from the hostgroup for the more powerful servers we we presented with a load of errors regarding duplicate service names. I hope I've made myself clear on what we're after and I look forward to receiving your input on this. Kind regards, Matt -- Matt Wallace [EMAIL PROTECTED] http://www.truthisfreedom.org.uk/ - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring large (ish) numbers of servers with exceptions to the rules...
-Original Message- From: nagios-users On Behalf Of Matthew Macdonald-Wallace Sent: 17 June 2008 13:14 I currently help maintain and monitor around 50 servers across various parts of the UK using Nagios 2. At the moment, we have a configuration file for each host (%hostname%.cfg) and in that file we specify all the services for the named host. We are trying to reduce the number of configuration files as we take on more and more servers because there are a large number checks that we need to be rolled out to all servers and we feel that we are duplicating our workload. I'm open to ideas on how to achieve this however my thoughts were a setup along the lines of the following: - A master host template is created in which all services are defined for a host. - If a check does not need to be run for a given host (for example it is not a web server), a stanza is added to that particular host's config file that effectively tells nagios don't check for this service on this host I've tried defining all the services in a master templates file and this works perfectly however when I come to exclude certain services, I am at a loss on how to do it. Initially I tried adding a stanza with the same service name and register 0 as one of the options, however this didn't work. We have used HostGroups in the past to achieve a similar goal, however we ran into the issue that whilst we need to check the CPU Usage on all of the servers, a few of the servers that we monitor can take a lot more of a beating than the majority. This lead to us defining the CPU checks on a per-host basis as if we defined it separately from the hostgroup for the more powerful servers we presented with a load of errors regarding duplicate service names. I hope I've made myself clear on what we're after and I look forward to receiving your input on this. One thing that I use in the configuration that I maintain is to have something like this: define service{ use generic-hung-mounts hostgroup_name experiments hosts !lfc0448 contact_groups experiments } where lcg0448 is a host in host group experiments and I want to apply the generic-hung-mounts check to all hosts in that group except for lcg0448. This can lead to configuration like this: define service{ use check-pbs-offline hostgroup_name workers hosts !lcg0614,!lcg0617,!lcg0618,!lcg0626 contact_groups tier1a } define service{ use check-pbs-offline hosts lcg0614,lcg0617,lcg0618,lcg0626 contact_groups tier1a,grid-team } where the only difference is that the hosts in the second definition have a second contact group. HTH Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] About check_udp plugin
Hello, I just want to confirm that whether the UDP 161 port is available not . We don't need to know if the service is up or not. Like you mentioned , I think the best way is to access to the agent. However, is there any plugin that just confirms that the UDP 161 port is available or not ? If no, I will use the check_snmp and access to the agent. Best Regards, Yu Watanabe Marc Powell さんは書きました: On Jun 16, 2008, at 8:04 PM, Yu Watanabe wrote: Hello, I used the check_udp service , like check_tcp, to check whether the 161 is useful or not. Is this not the write way to use it ? It depends on what you consider useful. It's entirely possible for a port to establish a connection but not actually be working at higher levels (i.e. answering queries). In your case, you've only determined that there's something listening on the port, not that it's snmpd, actually working or returning data you expect. IMHO, if there's a higher level test that's possible like check_snmp in this case, that will be a more accurate and useful test. -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_hpasm error
Hi I monitor my HP stuff using check_hparray and check_hpasm, generally things work OK However i have noticed that 1 box keeps alerting that the chassis is not OK with the following error. UNKNOWN - hanging hpasmdcli processes Does anyone know of a workaround to this apart from perhaps increasing the retry attempts for this service to try and catch it in a good state before the alarm triggers? thanks - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Antwort: Problem: Server is UP??
Here are the notify commands that come with debian. I don't modify them. # 'host-notify-by-email' command definition define command{ command_namehost-notify-by-email command_line/usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTALIAS$\nState: $HOSTSTATE$ for $HOSTDURATION$\nAddress: $HOSTADDRESS$\nInfo:\n\n$HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n\nACK by: $HOSTACKAUTHOR$\nComment: $HOSTACKCOMMENT$\n | /usr/bin/mail -s ** $NOTIFICATIONTYPE$ alert $NOTIFICATIONNUMBER$ - $HOSTALIAS$ host is $HOSTSTATE$ ** $CONTACTEMAIL$ } # 'host-notify-by-epager' command definition define command{ command_namehost-notify-by-epager command_line/usr/bin/printf %b Host '$HOSTALIAS$' is $HOSTSTATE$\nInfo: $HOSTOUTPUT$\nTime: $LONGDATETIME$ | /usr/bin/mail -s $NOTIFICATIONTYPE$ alert - Host $HOSTNAME$ is $HOSTSTATE$ $CONTACTPAGER$ } # 'notify-by-email' command definition define command{ command_namenotify-by-email command_line/usr/bin/printf %b * Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nState: $SERVICESTATE$ for $SERVICEDURATION$\nAddress: $HOSTADDRESS$\n\nInfo:\n\n$SERVICEOUTPUT$\n\nDate/Time: $LONGDATETIME$\n\nACK by: $SERVICEACKAUTHOR$\nComment: $SERVICEACKCOMMENT$\n | /usr/bin/mail -s ** $NOTIFICATIONTYPE$ alert $NOTIFICATIONNUMBER$ - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ ** $CONTACTEMAIL$ } # 'notify-by-epager' command definition define command{ command_namenotify-by-epager command_line/usr/bin/printf %b Service: $SERVICEDESC$\nHost: $HOSTNAME$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\nInfo: $SERVICEOUTPUT$\nDate: $LONGDATETIME$ | /usr/bin/mail -s $NOTIFICATIONTYPE$: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ $CONTACTPAGER$ } Thanks for looking into this The only other thing I can possibly think of is the pattern of this indicates that it's only doing this for Solaris servers which are using the NRPE out of Blastwave... could this be caused by NRPE? Mark On 6/17/08, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: [EMAIL PROTECTED] schrieb am 17.06.2008 04:28:23: I really need some help with this guys. My setup was working for the longest time and I can't think of any changes made to it that would cause this behavior. I get random (and quite annoying!) Nagios alerts that say Problem: $SERVER is UP! (with the actual host name) or sometimes it just literally says Problem: HOST is UP. Note, this only happens during my off hours time. I've never seen this during the day. I'm running Nagios on Debian Etch and I've tried version 3.02 from the backports and that had the same issue. It has to be my config but I can't figure out for the life of me what it could possibly be! Any help you can give me will be SO much appreciated! I'm going to attach my configs. Thanks everyone. After looking throgh the config, I cannot find any culprits which would cause such behaviour. Maybe you have some bogus notification commands? The Problem: ... is UP doesn't seem to make sense to me and does not correspond to the default configuration. Recovery alerts shold start with Recovery: ... Please post your host-notify-by-email command definitions. Regards Sascha -- Sascha Runschke Netzwerk- und Systemmanagement Telefon : +49 (201) 102-1879 Mobil : +49 (173) 5419665 Fax : +49 (201) 102-1102105 GFKL Financial Services AG Vorstand: Dr. Peter Jänsch (Vors.), Jürgen Baltes, Dr. Till Ergenzinger, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting
Re: [Nagios-users] Problem: Server is UP??
On Tue, Jun 17, 2008 at 06:27:04PM +1200, Steve Shipway wrote: I really need some help with this guys. My setup was working for the longest time and I can't think of any changes made to it that would cause this behavior. I get random (and quite annoying!) Nagios alerts that say Problem: $SERVER is UP! (with the actual host name) or sometimes it just literally says Problem: HOST is UP. Note, this only happens during my off hours time. I've never seen this during the day. You have probably configured your contact details to send a RECOVERY alert for hosts when it is out of hours. This means you will get a 'host is UP' alert when the host status changes to UP. Your message format probably prepends the 'Problem: ' to it. Admittedly your configuration doesn't look like this is there. I suppose you should check the message header and make sure it really originated on your host, and check you don't have two versions of Nagios running (did you have and old or test instance set up that might be sending these emails?) Steve I can confirm 100% that there is no other Nagios instance running. I also can't check the message header because it's using an email to SMS gateway when I get these messages so I only don't have access to those headers when it shows up on my phone. Mark - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] About check_udp plugin
On Jun 17, 2008, at 7:37 AM, Yu Watanabe wrote: Hello, I just want to confirm that whether the UDP 161 port is available not . We don't need to know if the service is up or not. Like you mentioned , I think the best way is to access to the agent. However, is there any plugin that just confirms that the UDP 161 port is available or not ? If no, I will use the check_snmp and access to the agent. yes, check_udp will verify that something is listening on that port. -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Request to test pst3 on solaris systems
Hi! This is a request to test the latest pst3 in the nagiosplugins snapshot. My colleague, Duncan Ferguson, has been making some optimisations so that is runs much faster on systems with a large number of processes - on a Solaris 10 server with multiple zones, it runs about 94% faster! I've just committed a change where a possible segfault was fixed. Best way to get the change is via a snapshot when it next runs at http://nagiosplug.sourceforge.net/snapshot I'd be interested if the before and after output is the same (within reason), with timings and number of processes and which Solaris versions. Ton http://www.altinity.com UK: +44 (0)870 787 9243 US: +1 866 879 9184 Fax: +44 (0)845 280 1725 Skype: tonvoon - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
Try this : It is a python script #!/usr/bin/python # check_swraid - plugin for nagios to check the status of linux swraid devices # # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2 of the License, or # (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU Library General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. # # Copyright 2004 Duke University # Written by Sean Dilda [EMAIL PROTECTED] # Version: 0.3 # patched with the rouilj's patch (03.02.2006) by Virer (28.07.2006) import os import sys import string mdstat = '/proc/mdstat' mdFile = open(mdstat).readlines() # two cases: two starting lines or just one starting line. # Remove the first and lasts lines as we don't need them mdFile = mdFile[1:-1] if (len(mdFile) % 3) != 0: mdFile = mdFile[1:] if (len(mdFile) % 3) != 0: # must have two starting lines print 'Error with mdstat file' sys.exit(3) if len(mdFile) == 0: print 'Error no md devices configured' sys.exit(3) mdData = [] while len(mdFile) 0: mdData.append((mdFile[0],mdFile[1])) mdFile = mdFile[3:] overallStatus = 0 errorMsg = '' devices = '' for tup in mdData: device, colon, status, type, drives = string.split(tup[0], None, 4) drives = string.split(drives) values = string.split(tup[1])[-2] values = values[1:-1] normal, current = string.split(values, '/') normal = int(normal) current = int(current) devices = devices + " " + device # Status of 0 == Ok, 1 == Warning, 2 == Critical status = 0 failed = 0 degraded = 0 msg = '' failed = [] for drive in drives: if drive[-3:] == '(F)': failed.append(drive[:string.index(drive, '[')]) status = 1 failed = ' (' + string.join(failed, ', ') + ').' if status == 'inactive': status = 2 msg = device + ' is inactive.' if type == 'raid5': if current (normal -1): msg = device + ' failed' + failed status = 2 elif current normal: msg = device + ' degraded' + failed status = 1 else: if current normal: msg = device + ' failed' + failed status = 2 if len(msg) 0: if len(errorMsg) 0: errorMsg = errorMsg + '; ' errorMsg = errorMsg + msg overallStatus = max(overallStatus, status) if overallStatus == 0: print 'All md devices (' + devices + ' ) Ok.' sys.exit(0) else: print errorMsg sys.exit(overallStatus) #EOF Rene Fertig escreveu: Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a _vbscript_ which uses DISKPART (http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
No, don't try that. Python is a great language, I code in it myself, but... 1. It is not portable across windows systems without installing add-ons 2. It checks *LINUX* software raid, not windows software raid. -h Marcos Marinho wrote: Try this : It is a python script #!/usr/bin/python # check_swraid - plugin for nagios to check the status of linux swraid devices # # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2 of the License, or # (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU Library General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. # # Copyright 2004 Duke University # Written by Sean Dilda [EMAIL PROTECTED] # Version: 0.3 # patched with the rouilj's patch (03.02.2006) by Virer (28.07.2006) import os import sys import string mdstat = '/proc/mdstat' mdFile = open(mdstat).readlines() # two cases: two starting lines or just one starting line. # Remove the first and lasts lines as we don't need them mdFile = mdFile[1:-1] if (len(mdFile) % 3) != 0: mdFile = mdFile[1:] if (len(mdFile) % 3) != 0: # must have two starting lines print 'Error with mdstat file' sys.exit(3) if len(mdFile) == 0: print 'Error no md devices configured' sys.exit(3) mdData = [] while len(mdFile) 0: mdData.append((mdFile[0],mdFile[1])) mdFile = mdFile[3:] overallStatus = 0 errorMsg = '' devices = '' for tup in mdData: device, colon, status, type, drives = string.split(tup[0], None, 4) drives = string.split(drives) values = string.split(tup[1])[-2] values = values[1:-1] normal, current = string.split(values, '/') normal = int(normal) current = int(current) devices = devices + + device # Status of 0 == Ok, 1 == Warning, 2 == Critical status = 0 failed = 0 degraded = 0 msg = '' failed = [] for drive in drives: if drive[-3:] == '(F)': failed.append(drive[:string.index(drive, '[')]) status = 1 failed = ' (' + string.join(failed, ', ') + ').' if status == 'inactive': status = 2 msg = device + ' is inactive.' if type == 'raid5': if current (normal -1): msg = device + ' failed' + failed status = 2 elif current normal: msg = device + ' degraded' + failed status = 1 else: if current normal: msg = device + ' failed' + failed status = 2 if len(msg) 0: if len(errorMsg) 0: errorMsg = errorMsg + '; ' errorMsg = errorMsg + msg overallStatus = max(overallStatus, status) if overallStatus == 0: print 'All md devices (' + devices + ' ) Ok.' sys.exit(0) else: print errorMsg sys.exit(overallStatus) #EOF Rene Fertig escreveu: Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a VB script which uses DISKPART (http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Install difficulty
This did not work either. - Original Message From: Yost, Karl [EMAIL PROTECTED] To: Jim Miller [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net Sent: Tuesday, June 17, 2008 11:14:39 AM Subject: RE: [Nagios-users] Install difficulty Try using http://localhost/nagios/ The trailing slash may be needed -Original Message- From: Jim Miller [EMAIL PROTECTED] Sent: Tuesday, June 17, 2008 11:11 AM To: nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.net Subject: [Nagios-users] Install difficulty Hello everyone I have downloaded and installed Nagios per the instructions posted for Fedora. Everything worked according to the instructions however when I start Nagios and try to navigate to http://localhost/nagios I get a 404 error. I suspect My webserver is unaware of nagios' exsistance however I am unsure how to begin to trouble shoot this problem. Can someone offer some guidance? Thank you. Jim - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Install difficulty
On Jun 17, 2008, at 10:09 AM, Jim Miller wrote: Hello everyone I have downloaded and installed Nagios per the instructions posted for Fedora. Everything worked according to the instructions however when I start Nagios and try to navigate to http://localhost/nagios I get a 404 error. I suspect My webserver is unaware of nagios' exsistance however I am unsure how to begin to trouble shoot this problem. Can someone offer some guidance? Thank you Did you 'make install-webconf' and restart apache? -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Install difficulty
Did you restart apache after you configured the aliases and directory options? Thanks, Karl From: Jim Miller [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 17, 2008 11:21 AM To: Yost, Karl; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Install difficulty This did not work either. - Original Message From: Yost, Karl [EMAIL PROTECTED] To: Jim Miller [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net Sent: Tuesday, June 17, 2008 11:14:39 AM Subject: RE: [Nagios-users] Install difficulty Try using http://localhost/nagios/ The trailing slash may be needed -Original Message- From: Jim Miller [EMAIL PROTECTED] Sent: Tuesday, June 17, 2008 11:11 AM To: nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.net Subject: [Nagios-users] Install difficulty Hello everyone I have downloaded and installed Nagios per the instructions posted for Fedora. Everything worked according to the instructions however when I start Nagios and try to navigate to http://localhost/nagios I get a 404 error. I suspect My webserver is unaware of nagios' exsistance however I am unsure how to begin to trouble shoot this problem. Can someone offer some guidance? Thank you. Jim - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Newbie question about nagios and snmp
Hi all, I have about 15 servers to monitor, some of them are windows and the rest run Debian. To do so, I have installed Nagios to send me alarms and cacti to show graphs and historical data. The things I'd like to monitor, for the time being are very basic CPU usage, disk space, ... To monitor the remote servers I've installed the snmp daemon. In cacti I have everything running, but there is a thing in nagios that I don't know how to do. I'd like to monitor the available space in the server partions and not in the disks. 1.- So I should create a service for each partition and define a hostgroup? or there is a way to say check all the partitions and shout if the available space is lower than 10%? 2.- Is there a way to ask the free disk of / instead of /dev/sda1? thanks all -- Arnau - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Install difficulty
Yes. I could be going down the wrong path but, how does the Nagios install know what the document root of the webserver is? It would seem to me that there should be some reference to nagios in the document root so the webserver know where to direct the requests, or does nagios not work that way? - Original Message From: Yost, Karl [EMAIL PROTECTED] To: Jim Miller [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net Sent: Tuesday, June 17, 2008 11:26:46 AM Subject: RE: [Nagios-users] Install difficulty Did you restart apache after you configured the aliases and directory options? Thanks, Karl From:Jim Miller [mailto:[EMAIL PROTECTED] Sent: Tuesday, June 17, 2008 11:21 AM To: Yost, Karl; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Install difficulty This did not work either. - Original Message From: Yost, Karl [EMAIL PROTECTED] To: Jim Miller [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net Sent: Tuesday, June 17, 2008 11:14:39 AM Subject: RE: [Nagios-users] Install difficulty Try using http://localhost/nagios/ The trailing slash may be needed -Original Message- From: Jim Miller [EMAIL PROTECTED] Sent: Tuesday, June 17, 2008 11:11 AM To: nagios-users@lists.sourceforge.net nagios-users@lists.sourceforge.net Subject: [Nagios-users] Install difficulty Hello everyone I have downloaded and installed Nagios per the instructions posted for Fedora. Everything worked according to the instructions however when I start Nagios and try to navigate to http://localhost/nagios I get a 404 error. I suspect My webserver is unaware of nagios' exsistance however I am unsure how to begin to trouble shoot this problem. Can someone offer some guidance? Thank you. Jim- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] different notification email
Hi, I have a small question. Is it possible to use different email templates for notification. One for warning and critical and another one for recovery. Many thanks - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Install difficulty
Yes. I could be going down the wrong path but, how does the Nagios install know what the document root of the webserver is? It would seem to me that there should be some reference to nagios in the document root so the webserver know where to direct the requests, or does nagios not work that way? # cat /etc/httpd/conf.d/nagios.conf # SAMPLE CONFIG SNIPPETS FOR APACHE WEB SERVER # Last Modified: 11-26-2005 # # This file contains examples of entries that need # to be incorporated into your Apache web server # configuration file. Customize the paths, etc. as # needed to fit your system. ScriptAlias /nagios/cgi-bin /usr/lib/nagios/cgi Directory /usr/lib/nagios/cgi # SSLRequireSSL Options ExecCGI AllowOverride None Order allow,deny Allow from all # Order deny,allow # Deny from all # Allow from 127.0.0.1 AuthName Nagios Access AuthType Basic AuthUserFile /etc/nagios/static/htpasswd.users Require valid-user /Directory Alias /nagios /usr/share/nagios Directory /usr/share/nagios # SSLRequireSSL Options None AllowOverride None Order allow,deny Allow from all # Order deny,allow # Deny from all # Allow from 127.0.0.1 AuthName Nagios Access AuthType Basic AuthUserFile /etc/nagios/static/htpasswd.users Require valid-user /Directory etc etc etc etc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Newbie question about nagios and snmp
2.- Is there a way to ask the free disk of / instead of /dev/sda1? check_snmp_disk - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Newbie question about nagios and snmp
Hi Tom, Tom Brown wrote: 2.- Is there a way to ask the free disk of / instead of /dev/sda1? check_snmp_disk In my debian I can't find that check_snmp_disk, the only thing I have found in /etc/nagios-plugins/config/snmp.cfg is: # 'snmp_disk' command definition define command{ command_namesnmp_disk command_line/usr/lib/nagios/plugins/check_snmp -H $HOSTADDRESS$ -C $ARG1$ -o .1.3.6.1.4.1.2021.9.1.7.$ARG2$,.1.3.6.1.4.1.2021.9.1 .9.$ARG2$ -w $ARG3$:,:$ARG4$ -c $ARG5$:,:$ARG6$ -u 'kB free (','% used)' -l 'disk space' } am I missing anything? -- Arnau - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Newbie question about nagios and snmp
On Jun 17, 2008, at 12:30 PM, Arnau wrote: Hi Tom, Tom Brown wrote: 2.- Is there a way to ask the free disk of / instead of /dev/sda1? check_snmp_disk In my debian I can't find that check_snmp_disk, the only thing I have found in /etc/nagios-plugins/config/snmp.cfg is: am I missing anything? A google search. http://www.google.com/search?q=nagios+check_snmp_disk -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Newbie question about nagios and snmp
In my debian I can't find that check_snmp_disk, the only thing I have found in /etc/nagios-plugins/config/snmp.cfg is: # 'snmp_disk' command definition define command{ command_namesnmp_disk command_line/usr/lib/nagios/plugins/check_snmp -H $HOSTADDRESS$ -C $ARG1$ -o .1.3.6.1.4.1.2021.9.1.7.$ARG2$,.1.3.6.1.4.1.2021.9.1 .9.$ARG2$ -w $ARG3$:,:$ARG4$ -c $ARG5$:,:$ARG6$ -u 'kB free (','% used)' -l 'disk space' } am I missing anything? yes - you can get the plugin from nagiosexchange - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
Ops sorry , I lost the windows part , my mistake. I will be looking for some alternative ; again sorry for this. As alternative, there is a place called windows script center and I think that there would be an script in order to check sofwate raid on windows, or some example in order to get this information, I would advise to look in that repository. Again my apologies Hari Sekhon escreveu: No, don't try that. Python is a great language, I code in it myself, but... 1. It is not portable across windows systems without installing add-ons 2. It checks *LINUX* software raid, not windows software raid. -h Marcos Marinho wrote: Try this : It is a python script #!/usr/bin/python # check_swraid - plugin for nagios to check the status of linux swraid devices # # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2 of the License, or # (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU Library General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program; if not, write to the Free Software # Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111-1307, USA. # # Copyright 2004 Duke University # Written by Sean Dilda [EMAIL PROTECTED] # Version: 0.3 # patched with the rouilj's patch (03.02.2006) by Virer (28.07.2006) import os import sys import string mdstat = '/proc/mdstat' mdFile = open(mdstat).readlines() # two cases: two starting lines or just one starting line. # Remove the first and lasts lines as we don't need them mdFile = mdFile[1:-1] if (len(mdFile) % 3) != 0: mdFile = mdFile[1:] if (len(mdFile) % 3) != 0: # must have two starting lines print 'Error with mdstat file' sys.exit(3) if len(mdFile) == 0: print 'Error no md devices configured' sys.exit(3) mdData = [] while len(mdFile) 0: mdData.append((mdFile[0],mdFile[1])) mdFile = mdFile[3:] overallStatus = 0 errorMsg = '' devices = '' for tup in mdData: device, colon, status, type, drives = string.split(tup[0], None, 4) drives = string.split(drives) values = string.split(tup[1])[-2] values = values[1:-1] normal, current = string.split(values, '/') normal = int(normal) current = int(current) devices = devices + + device # Status of 0 == Ok, 1 == Warning, 2 == Critical status = 0 failed = 0 degraded = 0 msg = '' failed = [] for drive in drives: if drive[-3:] == '(F)': failed.append(drive[:string.index(drive, '[')]) status = 1 failed = ' (' + string.join(failed, ', ') + ').' if status == 'inactive': status = 2 msg = device + ' is inactive.' if type == 'raid5': if current (normal -1): msg = device + ' failed' + failed status = 2 elif current normal: msg = device + ' degraded' + failed status = 1 else: if current normal: msg = device + ' failed' + failed status = 2 if len(msg) 0: if len(errorMsg) 0: errorMsg = errorMsg + '; ' errorMsg = errorMsg + msg overallStatus = max(overallStatus, status) if overallStatus == 0: print 'All md devices (' + devices + ' ) Ok.' sys.exit(0) else: print errorMsg sys.exit(overallStatus) #EOF Rene Fertig escreveu: Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a VB script which uses DISKPART (http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php
Re: [Nagios-users] check_hpasm error
Hi Tom, UNKNOWN - hanging hpasmdcli processes check_hpasm executes the hpasmcli command which connects to the hpasmd daemon. If this daemon becomes unresponsive, hpasmcli just hangs. The next run of check_hpasm detects this hanging process and exits with the above error message. It would not make sense for the plugin to continue, as another hpasmcli would also hang. I don't know the reason why the daemon stops responding. Maybe an update of the system BIOS helps. In case of such an alert the only known workaround is to restart hpasm with /etc/init.d/hpasm restart. Gerhard Does anyone know of a workaround to this apart from perhaps increasing the retry attempts for this service to try and catch it in a good state before the alarm triggers? thanks -- --- Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to monitor windows software raid
Hi Rene, Sorry for joining this response late, For checking Software RAID on Windows the best way is to use a Windows Plug in that gives connectivity to a standard widows location of the data you need. With that said, I recommend NC_Net (not only because I wrote it but also since it provides mechanism for retrieval using several of the methods hinted to in this thread. 1) using Windows Event log, there are several plugins that allow for checking windows event log (including NC_NEt) 2) Writing a script or executable, There are several plugins that allow this as well (including NC_NEt and NRPE) 3) Writing a WMI query , ( NC_net has this as well but there are also some scripts under NRPE that can do this) 4) you may be able to configure Notification on degraded RAID to submit via NSCA, and then again on recovery 5) you may be able to configure a Submit by email on Degraded/Recover RAID (I have not tried this) 6) you may be able to configure SNMP on the software RAID (I have not tried this) then check by SNMP For options 4,5,6 you may want to look into viotile service checks (see Nagios Manual) Good Luck, Tony (Author of NC_Net) On Tue, Jun 17, 2008 at 3:32 AM, Rene Fertig [EMAIL PROTECTED] wrote: Hello Hari. Have you checked NagiosExchange.org? I didn't see anything but then I didn't look too closely. Otherwise you could just write one. VBS is an easy language and universal enough for this. I'd write this in fact, if I had any servers poor enough to have windows software raid... Of cause I've checked NagiosExchange, but there are only plugins for hardware raids. I've found a VB script which uses DISKPART ( http://www.anchor.com.au/hosting/dedicated/monitoring_windows_software_raid ). Unfortunately DISKPART didn't work on Windows 2000 (only Win 2000 Pro, Win 2000 server, Win 2003 ...) But I've got a list of event IDs so I can monitor the event log and do passive checks - more than nothing... Greetings, Rene -- -- Tipps Tricks für Linux, MySQL Webdesign - http://home.telebel.de/~referti/blog/ - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring large (ish) numbers of servers with exceptions to the rules...
Hi, Using REgExp and Object Templats is a key for optimizing maintenance. I read some good details on handling what needs to be configured and what can be inhereted and automatically associated in the current Nagios 3 Documentation. I think much of the framework was in Nagios 2, but the documentaiton is a bit easier to read in nagios 3 so look at that for some tips. then check the nagios 2 docs to see if the option is also in there. A few years ago I converted a nagios 1.2 were all hosts and services were defined in a single to file to a scalable configuration similar to what was initialy described here. I found that if you have a need of suporting different clients with daily changes it was convient to have one Config directory for each clinet then in that directory have a single host file, and for each host a seperate Config file. on a host being removed it is just a matter of removing it from the Host file configuration and renaming its Config file. on adding a new host is was only adding it to the host file, then adding copy an existing service file and then cut and past to get all the services defined. then maintain the entire directory substructer through CVS or some other version controle. This as noted does get tedious to maintain, but it alows for customization of services per host without much thinking. The Disadvantage of this is the time involved for maintaining, when there are few changes getting made. OTHER options using templates work well, setting up Inheritance, using REG EXP as well as , other techniques using HostGroups all assist with orginizing the files but depending on skill levels somtimes lead to less readability (Whle for other admins it would lead to easier maintenance) Hope this helps, On Tue, Jun 17, 2008 at 8:22 AM, Wheeler, JF (Jonathan) [EMAIL PROTECTED] wrote: -Original Message- From: nagios-users On Behalf Of Matthew Macdonald-Wallace Sent: 17 June 2008 13:14 I currently help maintain and monitor around 50 servers across various parts of the UK using Nagios 2. At the moment, we have a configuration file for each host (%hostname%.cfg) and in that file we specify all the services for the named host. We are trying to reduce the number of configuration files as we take on more and more servers because there are a large number checks that we need to be rolled out to all servers and we feel that we are duplicating our workload. I'm open to ideas on how to achieve this however my thoughts were a setup along the lines of the following: - A master host template is created in which all services are defined for a host. - If a check does not need to be run for a given host (for example it is not a web server), a stanza is added to that particular host's config file that effectively tells nagios don't check for this service on this host I've tried defining all the services in a master templates file and this works perfectly however when I come to exclude certain services, I am at a loss on how to do it. Initially I tried adding a stanza with the same service name and register 0 as one of the options, however this didn't work. We have used HostGroups in the past to achieve a similar goal, however we ran into the issue that whilst we need to check the CPU Usage on all of the servers, a few of the servers that we monitor can take a lot more of a beating than the majority. This lead to us defining the CPU checks on a per-host basis as if we defined it separately from the hostgroup for the more powerful servers we presented with a load of errors regarding duplicate service names. I hope I've made myself clear on what we're after and I look forward to receiving your input on this. One thing that I use in the configuration that I maintain is to have something like this: define service{ use generic-hung-mounts hostgroup_name experiments hosts !lfc0448 contact_groups experiments } where lcg0448 is a host in host group experiments and I want to apply the generic-hung-mounts check to all hosts in that group except for lcg0448. This can lead to configuration like this: define service{ use check-pbs-offline hostgroup_name workers hosts !lcg0614,!lcg0617,!lcg0618,!lcg0626 contact_groups tier1a } define service{ use check-pbs-offline hosts lcg0614,lcg0617,lcg0618,lcg0626 contact_groups tier1a,grid-team } where the only difference is that the hosts in the second definition have a second contact group. HTH Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory - Check out the new SourceForge.net Marketplace. It's the
[Nagios-users] Custom Status Field strings and warning levels
Hi all, I would like to format the data returned from check commands in the status information field of the GUI. For example, I would like to add text after the check command has returned only an integer value, or another example I have brought up previously, divide the result by say 1024 to view in KB (when returned in bytes) or MB (when returned in KB). Any ideas? I am using COUNTERs with check_nt (NSClient++) on RHEL 4 server running Nagios v3.0.1 with the latest plugins. I have tried different combinations of the description field: check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.f Physical Memory Available check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.2f Physical Memory Available check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.f Physical Memory Available All of these return at least one percent sign, and no value, such as: %f Physical Memory Available If I leave out the description field (check_nt!COUNTER!-l \\Memory\\Available MBytes) then I get a correct integer value. I would like the GUI to show (assuming a server / machine with 2gb memory and has available 1348MB: 1348 available out of 2048 or maybe even use a percentage of available as is in the built-in Memory Usage command, but this looks at the page file, not physical memory. On another note, since I am using a large amount of COUNTER defs to check Windows servers, can someone tell me how to set the warning and critical in the check_nt!COUNTER! command? Say the available memory in the above example reaches below 500, I want a warning, and below 250 a critical. Thanks in advance, Izz - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Custom Status Field strings and warning levels
Ok...I guess I should have played around a bit more before posting. The -w and -c switches work accordingly for the COUNTER defs. Would still ask assistance in formatting the Status Information field. Regards, Izz -Original Message- From: [EMAIL PROTECTED] Sent: Tue, 17 Jun 2008 16:53:43 -0800 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Custom Status Field strings and warning levels Hi all, I would like to format the data returned from check commands in the status information field of the GUI. For example, I would like to add text after the check command has returned only an integer value, or another example I have brought up previously, divide the result by say 1024 to view in KB (when returned in bytes) or MB (when returned in KB). Any ideas? I am using COUNTERs with check_nt (NSClient++) on RHEL 4 server running Nagios v3.0.1 with the latest plugins. I have tried different combinations of the description field: check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.f Physical Memory Available check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.2f Physical Memory Available check_nt!COUNTER!-l \\Memory\\Available MBytes,%%.f Physical Memory Available All of these return at least one percent sign, and no value, such as: %f Physical Memory Available If I leave out the description field (check_nt!COUNTER!-l \\Memory\\Available MBytes) then I get a correct integer value. I would like the GUI to show (assuming a server / machine with 2gb memory and has available 1348MB: 1348 available out of 2048 or maybe even use a percentage of available as is in the built-in Memory Usage command, but this looks at the page file, not physical memory. On another note, since I am using a large amount of COUNTER defs to check Windows servers, can someone tell me how to set the warning and critical in the check_nt!COUNTER! command? Say the available memory in the above example reaches below 500, I want a warning, and below 250 a critical. Thanks in advance, Izz - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null