[Nagios-users] Nagios check with multiple results and graphing
Hi, I would like to graph the CPU usage of a Xen host using NagiosGrapher. I created a custom check that utilizes 'xentop' and I can have that script show the 'total' CPU usage, but also CPU usage of each VM - like that: XEN_DOMU_PERF - domU1 CpuUsage:.35% MemUsage:9.4% XEN_DOMU_PERF - domU2 CpuUsage:.02% MemUsage:6.3% XEN_DOMU_PERF - domU3 CpuUsage:.25% MemUsage:1.6% XEN_DOMU_PERF - Domain-0 CpuUsage:4.35% MemUsage:3.1% XEN_DOMU_PERF - domU4 CpuUsage:1.30% MemUsage:6.3% XEN_DOMU_PERF - domU5 CpuUsage:32.72% MemUsage:6.3% XEN_DOMU_PERF - domU6 CpuUsage:.20% MemUsage:6.3% XEN_DOMU_PERF - domU7 CpuUsage:.25% MemUsage:3.1% XEN_DOMU_PERF - domU8 CpuUsage:.05% MemUsage:4.7% XEN_DOMU_PERF - domU9 CpuUsage:.02% MemUsage:6.3% XEN_DOMU_PERF - domU10 CpuUsage:13.97% MemUsage:37.5% XEN_PERFORMANCE OK - CPUs:4 CpuUsage:53.50% MemTotal:16378MB MemUsage:90.9% The 'problem' is, that DomU's are 'dynamic' - they get moved around Xen hosts, new VM's are created, some are shutdown etc... Is there any way to create such a 'dynamic' graph without the need to constantly update Nagios and NagiosGrapher configuration? Danilo -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] upgrade nagios 1.4.1 to nagios 3.2.3
Hi, Can someone describe steps to upgrade nagios 1.4.1 to nagios 3.2.3. Currently i am monitoring over 500 hosts and 3000 services using nagios 1.4.1 I am also using nsca and ocsp for nagios failover. Nagios 1.4.1 is installed using rpms nagios-plugins-1.3.1-10.rhel3.dag nagios-plugins-nrpe-2.0-3.rhel3.dag nagios-nrpe-2.0-3.rhel3.dag nagios-nsca-2.4-2.rhel3.dag nagios-1.4.1-1.el3.rf I wish to install the newer version using rpm. These are the rpms i found nagios-3.2.3-2.el5.rf.i386.rpm nagios-devel-3.2.3-2.el5.rf.i386.rpm Thanks Sunny -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_by_ssh help
I have an appliance that I cannot install anything on. However I can connect via ssh. I can run remote commands with check_by_ssh, eg check_by_ssh -H hmc -C lsled -m ratbat -r sa -t phys -l hscroot -s /home/nagios/.ssh/id_rsa This checks my HMC for the status of the attention LED on an AIX box. However, whatever the option is it comes back as OK in nagios. Is there a way to use check_by_ssh to check for a specific word? Also, the list active/deactive systems command puts each system on a separate line, which I would rather all be in one continuous line. It is not possible to write anything to the remote machine. Regards B The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. If you are not the intended addressee please contact the sender and dispose of this e-mail. Thank you. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications on passive service checks
On 11/11/2010 11:27 PM, Hall, JC wrote: Is it accurate that Nagios will only send 1 notification for a passive service check? IE, the notification_interval definition for a passively checked service won't instruct Nagios to re-send a notification such as with actively checked services? To be honest, I haven't got the faintest idea. An educated guess is that it will re-send the notification if it receives another passive check-result and enough time has passed though, or that it simply re-sends the notification when enough time has passed. If you try and find out, let me know either way and I'll amend the docs. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] upgrade nagios 1.4.1 to nagios 3.2.3
On 11/12/2010 10:02 AM, Sunny Jaisinghani wrote: Hi, Can someone describe steps to upgrade nagios 1.4.1 to nagios 3.2.3. Currently i am monitoring over 500 hosts and 3000 services using nagios 1.4.1 I am also using nsca and ocsp for nagios failover. Nagios 1.4.1 is installed using rpms nagios-plugins-1.3.1-10.rhel3.dag nagios-plugins-nrpe-2.0-3.rhel3.dag nagios-nrpe-2.0-3.rhel3.dag nagios-nsca-2.4-2.rhel3.dag nagios-1.4.1-1.el3.rf I wish to install the newer version using rpm. These are the rpms i found nagios-3.2.3-2.el5.rf.i386.rpm nagios-devel-3.2.3-2.el5.rf.i386.rpm RPM packages quite often contain trigger-scripts that handle everything automagically. That might not be the case if you go from rhel3 to rhel5 packages though. Make backups before you try anything. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] debugging eventhandler via nrpe
Hello all I am trying to implement an event handler on a remote machine , and having a problem with the way the status arguments are transferred over the NRPE channel . My config is as such : define service{ snip max_check_attempts 3 event_handler check_nrpe_nonssl!solr-restart!$SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$!-t 200 snip } On the remote server the nrpe was compiled with --allow-command-args ,and in the nrpe.cfg i have the following parameters ? dont_blame_nrpe=1 debug=1 command_timeout=300 # Event Handler command[solr-restart]=/usr/local/nagios/libexec/eventhandlers/restart-solr.sh the event handler script is : #!/bin/bash # # Event handler script for restarting the web server on the local machine # # Note: This script will only restart the web server if the service is # retried 3 times (in a soft state) or if the web service somehow # manages to fall into a hard error state. # case $1 in OK) # The service just came back up, so don't do anything... ;; WARNING) ;; UNKNOWN) ;; CRITICAL) case $2 in SOFT) case $3 in 2) echo Too early - not restarting yet exit 0 ;; esac case $3 in 3) echo Too early - not restarting yet exit 0 ;; esac ;; HARD) cd /usr/local/nagios/libexec/eventhandlers/ curl -s -v -u *:*** --request PUT -d @solr7down.xml http://LB sleep 5 sudo /etc/init.d/jboss stop sleep 60 sudo /etc/init.d/jboss start sleep 15 curl -s -v -u *:** --request PUT -d @solr7up.xml http://LB sleep 3 echo Event handler restarted the solr service ;; esac ;; esac echo Event handler restarted the solr service exit 0 I can see in the syslog that the script is initiated : Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Host address is in allowed_hosts Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Handling the connection... Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Host is asking for command 'solr-restart' to be run... Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Running command: /usr/local/nagios/libexec/eventhandlers/restart-solr.sh Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Command completed with return code 0 and output: But the event handler is not preforming the tasks it is meant to . when i initiate the command manually ~/libexec/check_nrpe -H host -n -c solr-restart -a Critical HARD 3 -t 200 it is running fine . I know it is an issue with the transfer of the arguments to the nrpe - but i am missing something to make sure they are parsed and sent properly . Anyone can point me to the sign in front of me i am blindingly missing ? Thanks -- Never,Ever Cut A Deal With a Dragon -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications on passive service checks
Actually , the fact that the check is passive or active is some what irrelevant in this context. The place that will determine the notification policy is you central nagios , and what you defined in your service/host notifications directive . if you also define an escalation for that service - that will kick off the notifications . On 12/11/10 12:00, Andreas Ericsson wrote: On 11/11/2010 11:27 PM, Hall, JC wrote: Is it accurate that Nagios will only send 1 notification for a passive service check? IE, the notification_interval definition for a passively checked service won't instruct Nagios to re-send a notification such as with actively checked services? To be honest, I haven't got the faintest idea. An educated guess is that it will re-send the notification if it receives another passive check-result and enough time has passed though, or that it simply re-sends the notification when enough time has passed. If you try and find out, let me know either way and I'll amend the docs. -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Suppress Max concurrent service checks messages.
We're running Nagios 3.2.3 with concurrent service checks set to 40. We can't go much higher than this due to resource constraints outside of Nagios but we're running 329 services at 5 minute intervals (this is a load test of sorts not production load ... yet). Average execution time/latency is 36/11 seconds so we're seeing quite a few messages like this in the Nagios log file: (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks (40) has been reached. Nudging host:service by 9 seconds... Is there any way to suppress these messages from being logged? I don't see an option for logging these in the config file documentation. Thanks, Paul Dubuc -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
On 11/12/2010 04:30 PM, Paul M. Dubuc wrote: We're running Nagios 3.2.3 with concurrent service checks set to 40. We can't go much higher than this due to resource constraints outside of Nagios but we're running 329 services at 5 minute intervals (this is a load test of sorts not production load ... yet). Average execution time/latency is 36/11 seconds so we're seeing quite a few messages like this in the Nagios log file: If you're doing a load test on a system that clearly doesn't handle production load and thus forces you to run with less than optimal settings, you're doing things wrong. (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks (40) has been reached. Nudginghost:service by 9 seconds... Is there any way to suppress these messages from being logged? I don't see an option for logging these in the config file documentation. Not really, no. See my previous comment though. It's equally valid now, even though about 12 seconds have passed since I wrote it. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote: We're running Nagios 3.2.3 with concurrent service checks set to 40. We can't go much higher than this due to resource constraints outside of Nagios but we're running 329 services at 5 minute intervals (this is a load test of sorts not production load ... yet). Average execution time/latency is 36/11 seconds so we're seeing quite a few messages like this in the Nagios log file: (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks (40) has been reached. Nudging host:service by 9 seconds... Is there any way to suppress these messages from being logged? I don't see an option for logging these in the config file documentation. I put those messages in. Firstly, 40 doesn't necessarily mean there are 40 concurrent service checks running as they may have finished but not been reaped yet (to decrement the counter). Secondly, if you are getting these messages, then either (1) this limit is too low - increase and keep an eye of the load on your nagios server; (2) you've got too many checks running - reduce frequencies/ numbers or setup a slave server. The trouble with the way the nudging works is that it hides the fact that you have latency issues (as the check is rescheduled to a future time). This means nagiostats will not include the additional latency time here. If someone has a better way of working this out, I'm all ears. Ton -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_http requests - SSL Not available
I've been digging at this one for a while and not getting any closer to resolution. Has anyone else encountered problems with getting check_http to work with the -S/--ssl option? Nagios 3.2.1 running on a CentOS box. I have openssl and openssl-devel installed Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest version Nothing to do Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest version Nothing to do I recompiled my plugins and got confirmation of openssl --with-apt-get-command: --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s --with-ping-command: /bin/ping -n -U -w %d -c %d %s --with-ipv6: yes --with-mysql: no --with-openssl: yes --with-gnutls: no --enable-extra-opts: no --with-perl: /usr/bin/perl --enable-perl-modules: no --with-cgiurl: /nagios/cgi-bin --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin --enable-libtap: no For some reason, after this, I'm still not able to get a response to -ssl checks using check_http. [r...@nagios nagios-plugins-1.4.14]# /usr/local/nagios/libexec/check_http --ssl -H siteaddress check_http: Invalid option - SSL is not available Kevin Davison Network Administrator Innosphere SDG Ltd. 147 Wyndham St. N., Ste 306 Guelph, ON, N1H 4E9 (519) 766-9726 X223 Email: kdavi...@innosphere.camailto:kdavi...@innosphere.ca Website: www.innosphere.ca -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http requests - SSL Not available
On 12/11/10 17:15, Kevin Davison wrote: I've been digging at this one for a while and not getting any closer to resolution. Has anyone else encountered problems with getting check_http to work with the --S/--ssl option? Nagios 3.2.1 running on a CentOS box. I have openssl and openssl-devel installed Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest version Nothing to do Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest version Nothing to do I recompiled my plugins and got confirmation of openssl --with-apt-get-command: --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s --with-ping-command: /bin/ping -n -U -w %d -c %d %s --with-ipv6: yes --with-mysql: no --with-openssl: yes --with-gnutls: no --enable-extra-opts: no --with-perl: /usr/bin/perl --enable-perl-modules: no --with-cgiurl: /nagios/cgi-bin --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin --enable-libtap: no For some reason, after this, I'm still not able to get a response to --ssl checks using check_http. [r...@nagios nagios-plugins-1.4.14]# /usr/local/nagios/libexec/check_http --ssl -H siteaddress check_http: Invalid option - SSL is not available Kevin Davison Network Administrator I am able to run the check_http with the -S option ~/libexec/check_http -H 10.0.X.XX -S HTTP WARNING: HTTP/1.1 401 Authorization Required - 1726 bytes in 0.036 second response time |time=0.035854s;;;0.00 size=1726B;;;0 openssl might be installed , but do you have the devel package - it will be required for the compilation to work . -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
Ton Voon wrote: On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote: We're running Nagios 3.2.3 with concurrent service checks set to 40. We can't go much higher than this due to resource constraints outside of Nagios but we're running 329 services at 5 minute intervals (this is a load test of sorts not production load ... yet). Average execution time/latency is 36/11 seconds so we're seeing quite a few messages like this in the Nagios log file: (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks (40) has been reached. Nudginghost:service by 9 seconds... Is there any way to suppress these messages from being logged? I don't see an option for logging these in the config file documentation. I put those messages in. Firstly, 40 doesn't necessarily mean there are 40 concurrent service checks running as they may have finished but not been reaped yet (to decrement the counter). Secondly, if you are getting these messages, then either (1) this limit is too low - increase and keep an eye of the load on your nagios server; (2) you've got too many checks running - reduce frequencies/ numbers or setup a slave server. The trouble with the way the nudging works is that it hides the fact that you have latency issues (as the check is rescheduled to a future time). This means nagiostats will not include the additional latency time here. If someone has a better way of working this out, I'm all ears. Ton Thanks, Ton. This is helpful information and advice. The services we're running require web browsers to run which are a cpu and memory intensive resource that, temporarily, we need to manage on the Nagios server. In production we shouldn't have these limitations, but for now I just wanted to keep all these messages from flooding the log. Andreas, I know it's doing things wrong, but there's not much I can do about it right now. Since I know what the problem is that these messages are trying to tell me. I'd just like to keep them from flooding the logs so I can see what else is happening more easily. That's all. Thanks, Paul Dubuc -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
Ton Voon wrote: ... The trouble with the way the nudging works is that it hides the fact that you have latency issues (as the check is rescheduled to a future time). This means nagiostats will not include the additional latency time here. If someone has a better way of working this out, I'm all ears. Would it cause other problems if the total nudging time for a service were included in its latency time? -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] different notification_intervals by contact
From: Duncan Berriman [mailto:dun...@dcl.co.uk] Sent: Wednesday, November 10, 2010 1:00 PM To: 'Nagios Users List' Subject: Re: [Nagios-users] different notification_intervals by contact Escalations are a little pesky to get working correctly. Here is an example. ... Thanks, Duncan. I've decided to take a somewhat different approach. Ultimately, what they want is for the pager to occur at 4x the frequency of the e-mail (15 minutes versus 1 hour). So this doesn't wind up being all that hard if I make a contact that calls a simple shell script. That shell script then looks at the NOTIFICATIONNUMBER to (in this case) determine if it's a multiple of 4 and if so, sends the alert. In fact, I'm going to make this so that's going to take an argument to determine what number to perform 'modulo' on. So in theory this could be reused if someone wanted to have something run every other notification number, every 6th, etc, indefinitely. The downside as I see it is that Nagios won't quite have an accurate representation of who got what notifications. From Nagios' perspective, it sent an alert to the mailing list, but really, the script acts as a gateway to determine if a message was actually sent. So the Notifications for the host/service as shown in the UI will not be quite correct. But I think they can live with that. Mark -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http requests - SSL Not available
Yes. openssl-devel is installed. Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest version From: Assaf Flatto [mailto:nag...@flatto.net] Sent: November-12-10 12:27 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] check_http requests - SSL Not available On 12/11/10 17:15, Kevin Davison wrote: I've been digging at this one for a while and not getting any closer to resolution. Has anyone else encountered problems with getting check_http to work with the -S/--ssl option? Nagios 3.2.1 running on a CentOS box. I have openssl and openssl-devel installed Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest version Nothing to do Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest version Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest version Nothing to do I recompiled my plugins and got confirmation of openssl --with-apt-get-command: --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s --with-ping-command: /bin/ping -n -U -w %d -c %d %s --with-ipv6: yes --with-mysql: no --with-openssl: yes --with-gnutls: no --enable-extra-opts: no --with-perl: /usr/bin/perl --enable-perl-modules: no --with-cgiurl: /nagios/cgi-bin --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin --enable-libtap: no For some reason, after this, I'm still not able to get a response to -ssl checks using check_http. [r...@nagios nagios-plugins-1.4.14]# /usr/local/nagios/libexec/check_http --ssl -H siteaddress check_http: Invalid option - SSL is not available Kevin Davison Network Administrator I am able to run the check_http with the -S option ~/libexec/check_http -H 10.0.X.XX -S HTTP WARNING: HTTP/1.1 401 Authorization Required - 1726 bytes in 0.036 second response time |time=0.035854s;;;0.00 size=1726B;;;0 openssl might be installed , but do you have the devel package - it will be required for the compilation to work . -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.ukhttp://www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_openmanage -- question about battery check
hey gang, i'm using check_openmanage ( http://folk.uio.no/trondham/software/check_openmanage.html) to monitor some dell servers. i'm using version 3.5.6 in production, but have also tested with 3.6.1 (latest, released Nov. 2, 2010). the problem i'm having is that the check is reporting battery charging WARNINGS even though I'm blacklisting that check. === r...@nagios:/opt/plugins# perl ./check_openmanage-3.6.1 -H server1 -C public -e -s -i -b bat_charge WARNING: [ABCDEFG] Cache Battery 0 in controller 0 is Learning (Active) [probably harmless] WARNING: [ABCDEFG] Cache Battery 0 in controller 1 is Learning (Active) [probably harmless] -- SYSTEM: PowerEdge R710, SN: ABCDEFG r...@nagios:/opt/plugins# == is there a way to ignore these that works? am i doing something wrong? thanks, J -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_openmanage -- question about battery check
the problem i'm having is that the check is reporting battery charging WARNINGS even though I'm blacklisting that check. === r...@nagios:/opt/plugins# perl ./check_openmanage-3.6.1 -H server1 -C public -e -s -i -b bat_charge No, you're not... Not quite, anyway. :) Re-visit the documentation for blacklisting - you need to specify *which* battery you're blacklisting. This is the case for all blacklist directives. Hint: I use '-b bat_charge=ALL' in my service definition. Benny -- No matter how many shorts we have in the system, my guards will be instructed to treat every surveillance camera malfunction as a full-scale emergency. -- Peter Anspach's Evil Overlord List, #67 -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
On 11/12/2010 06:03 PM, Ton Voon wrote: On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote: We're running Nagios 3.2.3 with concurrent service checks set to 40. We can't go much higher than this due to resource constraints outside of Nagios but we're running 329 services at 5 minute intervals (this is a load test of sorts not production load ... yet). Average execution time/latency is 36/11 seconds so we're seeing quite a few messages like this in the Nagios log file: (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks (40) has been reached. Nudginghost:service by 9 seconds... Is there any way to suppress these messages from being logged? I don't see an option for logging these in the config file documentation. I put those messages in. Firstly, 40 doesn't necessarily mean there are 40 concurrent service checks running as they may have finished but not been reaped yet (to decrement the counter). Secondly, if you are getting these messages, then either (1) this limit is too low - increase and keep an eye of the load on your nagios server; (2) you've got too many checks running - reduce frequencies/ numbers or setup a slave server. The trouble with the way the nudging works is that it hides the fact that you have latency issues (as the check is rescheduled to a future time). This means nagiostats will not include the additional latency time here. If someone has a better way of working this out, I'm all ears. We could use something like pnp4nagios does, and issue a check to make sure load is below a certain threshold before firing off new checks. There's a (reasonably) portable way of getting the number of online CPU's, so we could even make an educated guess at how many checks we can run to saturate the CPU's while still not running too many checks. Ofcourse, some checks are more heavy-duty than others. As a first stab at maintaining reasonable load, we should probably ignore that. At a later point, we might want to introduce probably load increase of running this check and nudge checks into the future when we're in danger of load / num_cpus 0.9 or some other suitable number. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Suppress Max concurrent service checks messages.
On 11/12/2010 06:40 PM, Paul M. Dubuc wrote: Andreas, I know it's doing things wrong, but there's not much I can do about it right now. Since I know what the problem is that these messages are trying to tell me. I'd just like to keep them from flooding the logs so I can see what else is happening more easily. That's all. You could always run Nagios in the foreground and redirect the log through a grep -v filter, restarting it on midnight every night and rotating logs manually. It's not difficult. Just cumbersome. So long as you're aware that whatever you conclude from your tests will be more than just a little off wrt what you wanted to determine, you'll almost certainly do alright though. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Centralized Desktop Delivery: Dell and VMware Reference Architecture Simplifying enterprise desktop deployment and management using Dell EqualLogic storage and VMware View: A highly scalable, end-to-end client virtualization framework. Read more! http://p.sf.net/sfu/dell-eql-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null