[Nagios-users] Nagios check with multiple results and graphing

2010-11-12 Thread Danilo Godec
 Hi,

I would like to graph the CPU usage of a Xen host using NagiosGrapher. I
created a custom check that utilizes 'xentop' and I can have that script
show the 'total' CPU usage, but also CPU usage of each VM - like that:

 XEN_DOMU_PERF - domU1 CpuUsage:.35% MemUsage:9.4%
 XEN_DOMU_PERF - domU2 CpuUsage:.02% MemUsage:6.3%
 XEN_DOMU_PERF - domU3 CpuUsage:.25% MemUsage:1.6%
 XEN_DOMU_PERF - Domain-0 CpuUsage:4.35% MemUsage:3.1%
 XEN_DOMU_PERF - domU4 CpuUsage:1.30% MemUsage:6.3%
 XEN_DOMU_PERF - domU5 CpuUsage:32.72% MemUsage:6.3%
 XEN_DOMU_PERF - domU6 CpuUsage:.20% MemUsage:6.3%
 XEN_DOMU_PERF - domU7 CpuUsage:.25% MemUsage:3.1%
 XEN_DOMU_PERF - domU8 CpuUsage:.05% MemUsage:4.7%
 XEN_DOMU_PERF - domU9 CpuUsage:.02% MemUsage:6.3%
 XEN_DOMU_PERF - domU10 CpuUsage:13.97% MemUsage:37.5%
 XEN_PERFORMANCE OK - CPUs:4 CpuUsage:53.50% MemTotal:16378MB
 MemUsage:90.9%

The 'problem' is, that DomU's are 'dynamic' - they get moved around Xen
hosts, new VM's are created, some are shutdown etc...

Is there any way to create such a 'dynamic' graph without the need to
constantly update Nagios and NagiosGrapher configuration?

  Danilo


--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] upgrade nagios 1.4.1 to nagios 3.2.3

2010-11-12 Thread Sunny Jaisinghani
Hi,

 

Can someone describe steps to upgrade nagios 1.4.1 to nagios 3.2.3.

Currently i am monitoring over 500 hosts and 3000 services using nagios
1.4.1

I am also using nsca and ocsp for nagios failover.

 

Nagios 1.4.1 is installed using rpms

 

nagios-plugins-1.3.1-10.rhel3.dag

nagios-plugins-nrpe-2.0-3.rhel3.dag

nagios-nrpe-2.0-3.rhel3.dag

nagios-nsca-2.4-2.rhel3.dag

nagios-1.4.1-1.el3.rf

 

I wish to install the newer version using rpm. These are the rpms i
found

 

nagios-3.2.3-2.el5.rf.i386.rpm

nagios-devel-3.2.3-2.el5.rf.i386.rpm

 

Thanks

Sunny

 

 

 

 

 

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_by_ssh help

2010-11-12 Thread Brian O'Mahony
I have an appliance that I cannot install anything on. However I can connect 
via ssh. I can run remote commands with check_by_ssh, eg

check_by_ssh -H hmc -C lsled -m ratbat -r sa -t phys -l hscroot -s 
/home/nagios/.ssh/id_rsa

This checks my HMC for the status of the attention LED on an AIX box.

However, whatever the option is it comes back as OK in nagios. Is there a way 
to use check_by_ssh to check for a specific word?

Also, the list active/deactive systems command puts each system on a separate 
line, which I would rather all be in one continuous line.

It is not possible to write anything to the remote machine.

Regards

B


The information in this email is confidential and may be legally privileged.
It is intended solely for the addressee. Access to this email by anyone else
is unauthorized. If you are not the intended recipient, any disclosure,
copying, distribution or any action taken or omitted to be taken in reliance
on it, is prohibited and may be unlawful. If you are not the intended
addressee please contact the sender and dispose of this e-mail. Thank you.
--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications on passive service checks

2010-11-12 Thread Andreas Ericsson
On 11/11/2010 11:27 PM, Hall, JC wrote:
 Is it accurate that Nagios will only send 1 notification for a
 passive service check?
 
 IE, the notification_interval definition for a passively checked
 service won't instruct Nagios to re-send a notification such as with
 actively checked services?
 

To be honest, I haven't got the faintest idea. An educated guess is
that it will re-send the notification if it receives another passive
check-result and enough time has passed though, or that it simply
re-sends the notification when enough time has passed.

If you try and find out, let me know either way and I'll amend the
docs.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] upgrade nagios 1.4.1 to nagios 3.2.3

2010-11-12 Thread Andreas Ericsson
On 11/12/2010 10:02 AM, Sunny Jaisinghani wrote:
 Hi,
 
 
 
 Can someone describe steps to upgrade nagios 1.4.1 to nagios 3.2.3.
 
 Currently i am monitoring over 500 hosts and 3000 services using nagios
 1.4.1
 
 I am also using nsca and ocsp for nagios failover.
 
 
 
 Nagios 1.4.1 is installed using rpms
 
 
 
 nagios-plugins-1.3.1-10.rhel3.dag
 
 nagios-plugins-nrpe-2.0-3.rhel3.dag
 
 nagios-nrpe-2.0-3.rhel3.dag
 
 nagios-nsca-2.4-2.rhel3.dag
 
 nagios-1.4.1-1.el3.rf
 
 
 
 I wish to install the newer version using rpm. These are the rpms i
 found
 
 
 
 nagios-3.2.3-2.el5.rf.i386.rpm
 
 nagios-devel-3.2.3-2.el5.rf.i386.rpm
 

RPM packages quite often contain trigger-scripts that handle
everything automagically. That might not be the case if you go
from rhel3 to rhel5 packages though.

Make backups before you try anything.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] debugging eventhandler via nrpe

2010-11-12 Thread Assaf Flatto
  Hello all

I am trying to implement an event handler on a remote machine , and 
having a problem with the way the status arguments are transferred over 
the NRPE channel .

My config is as such :

define service{
snip
  max_check_attempts  3
  event_handler   
check_nrpe_nonssl!solr-restart!$SERVICESTATE$ $SERVICESTATETYPE$ 
$SERVICEATTEMPT$!-t 200

snip
}

On the remote server the nrpe was compiled with --allow-command-args  
,and in  the nrpe.cfg i have the following parameters ?

dont_blame_nrpe=1
debug=1
command_timeout=300
# Event Handler
command[solr-restart]=/usr/local/nagios/libexec/eventhandlers/restart-solr.sh


the event handler script is :
#!/bin/bash
#
# Event handler script for restarting the web server on the local machine
#
# Note: This script will only restart the web server if the service is
# retried 3 times (in a soft state) or if the web service somehow
# manages to fall into a hard error state.
#

case $1 in
OK)
  # The service just came back up, so don't do anything...
  ;;
WARNING)
  ;;
UNKNOWN)
  ;;
CRITICAL)
  case $2 in
  SOFT)

  case $3 in
  2)
  echo Too early - not restarting yet
  exit 0
  ;;
 esac

  case $3 in
  3)
  echo Too early - not restarting yet
  exit 0
  ;;
  esac
  ;;
  HARD)
cd /usr/local/nagios/libexec/eventhandlers/
curl -s -v -u *:*** --request PUT -d @solr7down.xml http://LB
sleep 5
sudo /etc/init.d/jboss stop

sleep 60
sudo /etc/init.d/jboss start
sleep 15
curl -s -v -u *:** --request PUT -d @solr7up.xml http://LB
sleep 3
echo  Event handler restarted the solr service
  ;;
  esac
  ;;
esac
echo Event handler restarted the solr service
exit 0

I can see in the syslog that the script is initiated :
Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Host address is in allowed_hosts
Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Handling the connection...
Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Host is asking for command 
'solr-restart' to be run...
Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Running command: 
/usr/local/nagios/libexec/eventhandlers/restart-solr.sh
Nov 11 18:15:44 gbc1-solr-07 nrpe[29687]: Command completed with return 
code 0 and output:

But the event handler is not preforming the tasks it is meant to .


when i initiate the command manually

~/libexec/check_nrpe -H host -n -c solr-restart -a  Critical HARD 3 
-t 200
it is running fine .
I know it is an issue with the transfer of the arguments to the nrpe - 
but i am missing something to make sure they are parsed and sent properly .

Anyone can point me to the sign in front of me i am blindingly missing ?

Thanks


-- 
Never,Ever Cut A Deal With a Dragon


--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notifications on passive service checks

2010-11-12 Thread Assaf Flatto
  Actually , the fact that the check is passive or active is some what 
irrelevant in this context.

The place that will determine the notification policy is you central 
nagios , and what you defined in your service/host notifications directive .
if you also define an escalation for that service - that will kick off 
the notifications .




On 12/11/10 12:00, Andreas Ericsson wrote:
 On 11/11/2010 11:27 PM, Hall, JC wrote:
 Is it accurate that Nagios will only send 1 notification for a
 passive service check?

 IE, the notification_interval definition for a passively checked
 service won't instruct Nagios to re-send a notification such as with
 actively checked services?

 To be honest, I haven't got the faintest idea. An educated guess is
 that it will re-send the notification if it receives another passive
 check-result and enough time has passed though, or that it simply
 re-sends the notification when enough time has passed.

 If you try and find out, let me know either way and I'll amend the
 docs.



-- 
Never,Ever Cut A Deal With a Dragon


Next year I will be doing the London to Paris bike ride to
raise money for the DogTrust (www.dogstrust.co.uk) .
Please Sponsor me at http://www.justgiving.com/Assaf-Flatto


--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Paul M. Dubuc
We're running Nagios 3.2.3 with concurrent service checks set to 40.  We can't 
go much higher than this due to resource constraints outside of Nagios but 
we're running 329 services at 5 minute intervals (this is a load test of 
sorts not production load ... yet).  Average execution time/latency is 36/11 
seconds so we're seeing quite a few messages like this in the Nagios log file:

(Informational Message) [11-11-2010 14:55:57] Max concurrent service checks 
(40) has been reached. Nudging host:service by 9 seconds...

Is there any way to suppress these messages from being logged?  I don't see an 
option for logging these in the config file documentation.

Thanks,
Paul Dubuc

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Andreas Ericsson
On 11/12/2010 04:30 PM, Paul M. Dubuc wrote:
 We're running Nagios 3.2.3 with concurrent service checks set to 40.  We can't
 go much higher than this due to resource constraints outside of Nagios but
 we're running 329 services at 5 minute intervals (this is a load test of
 sorts not production load ... yet).  Average execution time/latency is 36/11
 seconds so we're seeing quite a few messages like this in the Nagios log file:
 

If you're doing a load test on a system that clearly doesn't handle
production load and thus forces you to run with less than optimal settings,
you're doing things wrong.

 (Informational Message) [11-11-2010 14:55:57] Max concurrent service checks
 (40) has been reached. Nudginghost:service  by 9 seconds...
 
 Is there any way to suppress these messages from being logged?  I don't see an
 option for logging these in the config file documentation.
 

Not really, no. See my previous comment though. It's equally valid now,
even though about 12 seconds have passed since I wrote it.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Ton Voon

On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote:

 We're running Nagios 3.2.3 with concurrent service checks set to  
 40.  We can't
 go much higher than this due to resource constraints outside of  
 Nagios but
 we're running 329 services at 5 minute intervals (this is a load  
 test of
 sorts not production load ... yet).  Average execution time/latency  
 is 36/11
 seconds so we're seeing quite a few messages like this in the Nagios  
 log file:

 (Informational Message) [11-11-2010 14:55:57] Max concurrent service  
 checks
 (40) has been reached. Nudging host:service by 9 seconds...

 Is there any way to suppress these messages from being logged?  I  
 don't see an
 option for logging these in the config file documentation.

I put those messages in.

Firstly, 40 doesn't necessarily mean there are 40 concurrent service  
checks running as they may have finished but not been reaped yet (to  
decrement the counter).

Secondly, if you are getting these messages, then either (1) this  
limit is too low - increase and keep an eye of the load on your nagios  
server; (2) you've got too many checks running - reduce frequencies/ 
numbers or setup a slave server.

The trouble with the way the nudging works is that it hides the fact  
that you have latency issues (as the check is rescheduled to a future  
time). This means nagiostats will not include the additional latency  
time here.

If someone has a better way of working this out, I'm all ears.

Ton


--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] check_http requests - SSL Not available

2010-11-12 Thread Kevin Davison
I've been digging at this one for a while and not getting any closer to 
resolution.

Has anyone else encountered problems with getting check_http to  work with the 
-S/--ssl option?

Nagios 3.2.1 running on a CentOS box.
I have openssl and openssl-devel installed

Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest version
Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest version
Nothing to do

Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest 
version
Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest 
version
Nothing to do

I recompiled my plugins and got confirmation of openssl
--with-apt-get-command:
   --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s
   --with-ping-command: /bin/ping -n -U -w %d -c %d %s
   --with-ipv6: yes
   --with-mysql: no
   --with-openssl: yes
   --with-gnutls: no
   --enable-extra-opts: no
   --with-perl: /usr/bin/perl
   --enable-perl-modules: no
   --with-cgiurl: /nagios/cgi-bin
   --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin
   --enable-libtap: no

For some reason, after this, I'm still not able to get a response to -ssl 
checks using check_http.

[r...@nagios nagios-plugins-1.4.14]# /usr/local/nagios/libexec/check_http --ssl 
-H siteaddress
check_http: Invalid option - SSL is not available



Kevin Davison
Network Administrator
Innosphere SDG Ltd.
147 Wyndham St. N., Ste 306
Guelph, ON, N1H 4E9
(519) 766-9726 X223
Email: kdavi...@innosphere.camailto:kdavi...@innosphere.ca
Website: www.innosphere.ca

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_http requests - SSL Not available

2010-11-12 Thread Assaf Flatto

 On 12/11/10 17:15, Kevin Davison wrote:


I've been digging at this one for a while and not getting any closer 
to resolution.


Has anyone else encountered problems with getting check_http to  work 
with the --S/--ssl option?


Nagios 3.2.1 running on a CentOS box.

I have openssl and openssl-devel installed

Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest 
version


Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest 
version


Nothing to do

Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and 
latest version


Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and 
latest version


Nothing to do

I recompiled my plugins and got confirmation of openssl

--with-apt-get-command:

   --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s

   --with-ping-command: /bin/ping -n -U -w %d -c %d %s

   --with-ipv6: yes

   --with-mysql: no

   --with-openssl: yes

   --with-gnutls: no

   --enable-extra-opts: no

   --with-perl: /usr/bin/perl

   --enable-perl-modules: no

   --with-cgiurl: /nagios/cgi-bin

   --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin

   --enable-libtap: no

For some reason, after this, I'm still not able to get a response to 
--ssl checks using check_http.


[r...@nagios nagios-plugins-1.4.14]# 
/usr/local/nagios/libexec/check_http --ssl -H siteaddress


check_http: Invalid option - SSL is not available

Kevin Davison

Network Administrator



I am able to run the check_http  with the -S option

 ~/libexec/check_http -H 10.0.X.XX  -S
HTTP WARNING: HTTP/1.1 401 Authorization Required - 1726 bytes in 0.036 
second response time |time=0.035854s;;;0.00 size=1726B;;;0


openssl might be installed , but do you have the devel package - it will 
be required for the compilation to work .



--
Never,Ever Cut A Deal With a Dragon


Next year I will be doing the London to Paris bike ride to
raise money for the DogTrust (www.dogstrust.co.uk) .
Please Sponsor me at http://www.justgiving.com/Assaf-Flatto

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Paul M. Dubuc
Ton Voon wrote:

 On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote:

 We're running Nagios 3.2.3 with concurrent service checks set to
 40.  We can't
 go much higher than this due to resource constraints outside of
 Nagios but
 we're running 329 services at 5 minute intervals (this is a load
 test of
 sorts not production load ... yet).  Average execution time/latency
 is 36/11
 seconds so we're seeing quite a few messages like this in the Nagios
 log file:

 (Informational Message) [11-11-2010 14:55:57] Max concurrent service
 checks
 (40) has been reached. Nudginghost:service  by 9 seconds...

 Is there any way to suppress these messages from being logged?  I
 don't see an
 option for logging these in the config file documentation.

 I put those messages in.

 Firstly, 40 doesn't necessarily mean there are 40 concurrent service
 checks running as they may have finished but not been reaped yet (to
 decrement the counter).

 Secondly, if you are getting these messages, then either (1) this
 limit is too low - increase and keep an eye of the load on your nagios
 server; (2) you've got too many checks running - reduce frequencies/
 numbers or setup a slave server.

 The trouble with the way the nudging works is that it hides the fact
 that you have latency issues (as the check is rescheduled to a future
 time). This means nagiostats will not include the additional latency
 time here.

 If someone has a better way of working this out, I'm all ears.

 Ton

Thanks, Ton.  This is helpful information and advice.  The services we're 
running require web browsers to run which are a cpu and memory intensive 
resource that, temporarily, we need to manage on the Nagios server.  In 
production we shouldn't have these limitations, but for now I just wanted to 
keep all these messages from flooding the log.

Andreas, I know it's doing things wrong, but there's not much I can do about 
it right now.  Since I know what the problem is that these messages are trying 
to tell me.  I'd just like to keep them from flooding the logs so I can see 
what else is happening more easily.  That's all.

Thanks,
Paul Dubuc

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Paul M. Dubuc
Ton Voon wrote:
...

 The trouble with the way the nudging works is that it hides the fact
 that you have latency issues (as the check is rescheduled to a future
 time). This means nagiostats will not include the additional latency
 time here.

 If someone has a better way of working this out, I'm all ears.

Would it cause other problems if the total nudging time for a service were 
included in its latency time?

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] different notification_intervals by contact

2010-11-12 Thread Frost, Mark {PBC}

From: Duncan Berriman [mailto:dun...@dcl.co.uk]
Sent: Wednesday, November 10, 2010 1:00 PM
To: 'Nagios Users List'
Subject: Re: [Nagios-users] different notification_intervals by contact

Escalations are a little pesky to get working correctly.

Here is an example.

...

Thanks, Duncan.

I've decided to take a somewhat different approach.  Ultimately, what they want 
is for the pager to occur at 4x the frequency of the e-mail (15 minutes versus 
1 hour).

So this doesn't wind up being all that hard if I make a contact that calls a 
simple shell script.  That shell script then looks at the NOTIFICATIONNUMBER to 
(in this case) determine if it's a multiple of 4 and if so, sends the alert.  
In fact, I'm going to make this so that's going to take an argument to 
determine what number to perform 'modulo' on.  So in theory this could be 
reused if someone wanted to have something run every other notification number, 
every 6th, etc, indefinitely.

The downside as I see it is that Nagios won't quite have an accurate 
representation of who got what notifications.  From Nagios' perspective, it 
sent an alert to the mailing list, but really, the script acts as a gateway to 
determine if a message was actually sent.  So the Notifications for the 
host/service as shown in the UI will not be quite correct.  But I think they 
can live with that.

Mark
--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_http requests - SSL Not available

2010-11-12 Thread Kevin Davison
Yes. openssl-devel is installed.

Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest 
version
Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest 
version



From: Assaf Flatto [mailto:nag...@flatto.net]
Sent: November-12-10 12:27 PM
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] check_http requests - SSL Not available

On 12/11/10 17:15, Kevin Davison wrote:
I've been digging at this one for a while and not getting any closer to 
resolution.

Has anyone else encountered problems with getting check_http to  work with the 
-S/--ssl option?

Nagios 3.2.1 running on a CentOS box.
I have openssl and openssl-devel installed


Package openssl-0.9.8e-12.el5_4.6.x86_64 already installed and latest version
Package openssl-0.9.8e-12.el5_4.6.i686 already installed and latest version
Nothing to do

Package openssl-devel-0.9.8e-12.el5_4.6.x86_64 already installed and latest 
version
Package openssl-devel-0.9.8e-12.el5_4.6.i386 already installed and latest 
version
Nothing to do

I recompiled my plugins and got confirmation of openssl
--with-apt-get-command:
   --with-ping6-command: /bin/ping6 -n -U -w %d -c %d %s
   --with-ping-command: /bin/ping -n -U -w %d -c %d %s
   --with-ipv6: yes
   --with-mysql: no
   --with-openssl: yes
   --with-gnutls: no
   --enable-extra-opts: no
   --with-perl: /usr/bin/perl
   --enable-perl-modules: no
   --with-cgiurl: /nagios/cgi-bin
   --with-trusted-path: /bin:/sbin:/usr/bin:/usr/sbin
   --enable-libtap: no

For some reason, after this, I'm still not able to get a response to -ssl 
checks using check_http.

[r...@nagios nagios-plugins-1.4.14]# /usr/local/nagios/libexec/check_http --ssl 
-H siteaddress
check_http: Invalid option - SSL is not available



Kevin Davison
Network Administrator

I am able to run the check_http  with the -S option

 ~/libexec/check_http -H 10.0.X.XX  -S
HTTP WARNING: HTTP/1.1 401 Authorization Required - 1726 bytes in 0.036 second 
response time |time=0.035854s;;;0.00 size=1726B;;;0

openssl might be installed , but do you have the devel package - it will be 
required for the compilation to work .




--

Never,Ever Cut A Deal With a Dragon





Next year I will be doing the London to Paris bike ride to

raise money for the DogTrust (www.dogstrust.co.ukhttp://www.dogstrust.co.uk) .

Please Sponsor me at http://www.justgiving.com/Assaf-Flatto
--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_openmanage -- question about battery check

2010-11-12 Thread Joel Brooks
hey gang,

i'm using check_openmanage (
http://folk.uio.no/trondham/software/check_openmanage.html) to monitor some
dell servers.

i'm using version 3.5.6 in production, but have also tested with 3.6.1
(latest, released Nov. 2, 2010).

the problem i'm having is that the check is reporting battery charging
WARNINGS even though I'm blacklisting that check.

===
r...@nagios:/opt/plugins# perl ./check_openmanage-3.6.1 -H server1 -C public
-e -s -i -b bat_charge
WARNING: [ABCDEFG] Cache Battery 0 in controller 0 is Learning (Active)
[probably harmless]
WARNING: [ABCDEFG] Cache Battery 0 in controller 1 is Learning (Active)
[probably harmless]
-- SYSTEM: PowerEdge R710, SN: ABCDEFG
r...@nagios:/opt/plugins#
==

is there a way to ignore these that works?  am i doing something wrong?

thanks,

J
--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_openmanage -- question about battery check

2010-11-12 Thread C. Bensend

 the problem i'm having is that the check is reporting battery charging
 WARNINGS even though I'm blacklisting that check.

 ===
 r...@nagios:/opt/plugins# perl ./check_openmanage-3.6.1 -H server1 -C
 public
 -e -s -i -b bat_charge

No, you're not...  Not quite, anyway.  :)  Re-visit the
documentation for blacklisting - you need to specify *which*
battery you're blacklisting.  This is the case for all
blacklist directives.

Hint:  I use '-b bat_charge=ALL' in my service definition.

Benny


-- 
No matter how many shorts we have in the system, my guards will
be instructed to treat every surveillance camera malfunction as a
full-scale emergency.
   -- Peter Anspach's Evil Overlord List, #67



--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Andreas Ericsson
On 11/12/2010 06:03 PM, Ton Voon wrote:
 
 On 12 Nov 2010, at 15:30, Paul M. Dubuc wrote:
 
 We're running Nagios 3.2.3 with concurrent service checks set to
 40.  We can't
 go much higher than this due to resource constraints outside of
 Nagios but
 we're running 329 services at 5 minute intervals (this is a load
 test of
 sorts not production load ... yet).  Average execution time/latency
 is 36/11
 seconds so we're seeing quite a few messages like this in the Nagios
 log file:

 (Informational Message) [11-11-2010 14:55:57] Max concurrent service
 checks
 (40) has been reached. Nudginghost:service  by 9 seconds...

 Is there any way to suppress these messages from being logged?  I
 don't see an
 option for logging these in the config file documentation.
 
 I put those messages in.
 
 Firstly, 40 doesn't necessarily mean there are 40 concurrent service
 checks running as they may have finished but not been reaped yet (to
 decrement the counter).
 
 Secondly, if you are getting these messages, then either (1) this
 limit is too low - increase and keep an eye of the load on your nagios
 server; (2) you've got too many checks running - reduce frequencies/
 numbers or setup a slave server.
 
 The trouble with the way the nudging works is that it hides the fact
 that you have latency issues (as the check is rescheduled to a future
 time). This means nagiostats will not include the additional latency
 time here.
 
 If someone has a better way of working this out, I'm all ears.
 

We could use something like pnp4nagios does, and issue a check to make
sure load is below a certain threshold before firing off new checks.
There's a (reasonably) portable way of getting the number of online
CPU's, so we could even make an educated guess at how many checks we
can run to saturate the CPU's while still not running too many checks.

Ofcourse, some checks are more heavy-duty than others. As a first stab
at maintaining reasonable load, we should probably ignore that. At a
later point, we might want to introduce probably load increase of
running this check and nudge checks into the future when we're in
danger of load / num_cpus  0.9 or some other suitable number.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Suppress Max concurrent service checks messages.

2010-11-12 Thread Andreas Ericsson
On 11/12/2010 06:40 PM, Paul M. Dubuc wrote:
 
 Andreas, I know it's doing things wrong, but there's not much I can do about
 it right now.  Since I know what the problem is that these messages are trying
 to tell me.  I'd just like to keep them from flooding the logs so I can see
 what else is happening more easily.  That's all.
 

You could always run Nagios in the foreground and redirect the log through a
grep -v filter, restarting it on midnight every night and rotating logs
manually. It's not difficult. Just cumbersome.

So long as you're aware that whatever you conclude from your tests will be
more than just a little off wrt what you wanted to determine, you'll almost
certainly do alright though.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Centralized Desktop Delivery: Dell and VMware Reference Architecture
Simplifying enterprise desktop deployment and management using
Dell EqualLogic storage and VMware View: A highly scalable, end-to-end
client virtualization framework. Read more!
http://p.sf.net/sfu/dell-eql-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null