[Nagios-users] Dynamic host, hostgroup and service addition/subtraction

2009-12-23 Thread Lee Azzarello
I have an application that requires new cluster nodes be created and
destroyed to handle load spikes.  I would like this additional
capacity to be represented in nagios during it's lifetime. Currently
my installation requires manually adding host definitions and
dependent hostgroup and service configurations. Is there a system in
Nagios 3 to change the host definitions without editing a
configuration file, redeploying the configs and restarting the nagios
service?

-- 
___
Lee Azzarello
drop.io staff hacker

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Layer 2 connectivity awareness?

2009-03-11 Thread Lee Azzarello
If your switch can do SNMP you can monitor the switchport the trunk is
connected to.

-lee

On Wed, Mar 11, 2009 at 11:37 AM, Victor Lee duren...@gmail.com wrote:
 In the course of my learning (just starting out) how to work with Nagios I
 have found that it isn't aware of layer 2 connections. In other words, if I
 have 2 switches with a trunk between them Nagios doesn't seem to provide me
 a way to either note that manually on the map nor does it seem to be aware
 of that type of connectivity. The same applies to servers, there doesn't
 seem to be a way to identify which switch a server is connected to. Now I
 admit this might be due to my limited knowledge of the product so I ask the
 members here. Am I correct in this assessment?

 --
 Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
 powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
 easily build your RIAs with Flex Builder, the Eclipse(TM)based development
 software that enables intelligent coding and step-through debugging.
 Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Layer 2 connectivity awareness?

2009-03-11 Thread Lee Azzarello
On Wed, Mar 11, 2009 at 1:07 PM, RijilV rij...@riji.lv wrote:
 2009/3/11 Victor Lee duren...@gmail.com

 In the course of my learning (just starting out) how to work with Nagios I
 have found that it isn't aware of layer 2 connections. In other words, if I
 have 2 switches with a trunk between them Nagios doesn't seem to provide me
 a way to either note that manually on the map nor does it seem to be aware
 of that type of connectivity. The same applies to servers, there doesn't
 seem to be a way to identify which switch a server is connected to. Now I
 admit this might be due to my limited knowledge of the product so I ask the
 members here. Am I correct in this assessment?




 I think most people give up on the nagios map pretty quickly, or at the most
 setup a few partent/child hosts and leave it at that.

Agreed. Parent-child relationships are useful to get a bigger picture
of points of failure, for example if a web proxy has 5 app servers
behind it, 6 hosts will go down if that proxy fails.

-lee

--
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Bulk Disable of Event Handlers?

2009-03-06 Thread Lee Azzarello
Sorry for the misinformation. I would expect that if a service is
scheduled to be down, a script which is designed to make sure it's
still up would be disabled for that period. Or is it that event
handlers are too generic and there could be a situation where you
would want event handlers to continue for a down host?

-lee

On Thu, Mar 5, 2009 at 6:32 PM, Kyle O'Donnell kyleodonn...@gmail.com wrote:
 downtime does not stop event handlers.

 On Thu, Mar 5, 2009 at 6:18 PM, Jonah Horowitz jhorow...@looksmart.net
 wrote:

 Does scheduled downtime stop event handlers?  I didn't think it did.

 Jonah

 On 3/5/09 2:03 PM, Lee Azzarello l...@dropio.com wrote:

  You can put the services in a service group and schedule downtime for
  all services in that group.
 
  -lee
 
  On Thu, Mar 5, 2009 at 4:23 PM, Jonah Horowitz jhorow...@looksmart.net
  wrote:
  I finally took the plunge and set up event handlers on a set of our
  webservers.  It¹s great because it restarts the web service
  automatically
  when there is an issue.  The problem is, we can¹t take them down for
  maintenance without Nagios restarting them.  I know I can disable the
  event
  handlers across the entire nagios system via the ³Tactical² view.  Is
  there
  any way to do it across a service group through the web interface?
 
  Either through the web or through a command line script would be great.
 
  If not, I¹ll probably write/post my own.
 
  Thanks,
  --
  Jonah Horowitz · Monitoring Manager · jhorow...@looksmart.net
  W: 415-348-7694 · F: 415-348-7033 · M: 415-513-7202
  LookSmart - Premium and Performance Advertising Solutions
  625 Second Street, San Francisco, CA 94107
 
 
 

 -
 -
  Open Source Business Conference (OSBC), March 24-25, 2009, San
  Francisco, CA
  -OSBC tackles the biggest issue in open source: Open Sourcing the
  Enterprise
  -Strategies to boost innovation and cut costs with open source
  participation
  -Receive a $600 discount off the registration fee with the source code:
  SFAD
  http://p.sf.net/sfu/XcvMzF8H
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when
  reporting
  any issue.
  ::: Messages without supporting info will risk being sent to /dev/null
 
 
 
  --
  Open Source Business Conference (OSBC), March 24-25, 2009, San
  Francisco, CA
  -OSBC tackles the biggest issue in open source: Open Sourcing the
  Enterprise
  -Strategies to boost innovation and cut costs with open source
  participation
  -Receive a $600 discount off the registration fee with the source code:
  SFAD
  http://p.sf.net/sfu/XcvMzF8H
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when
  reporting
  any issue.
  ::: Messages without supporting info will risk being sent to /dev/null

 --
 Jonah Horowitz · Monitoring Manager · jhorow...@looksmart.net
 W: 415-348-7694 · F: 415-348-7033 · M: 415-513-7202
 LookSmart - Premium and Performance Advertising Solutions
 625 Second Street, San Francisco, CA 94107



 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco,
 CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the
 Enterprise
 -Strategies to boost innovation and cut costs with open source
 participation
 -Receive a $600 discount off the registration fee with the source code:
 SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null



--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios/NRPE relationship

2009-03-05 Thread Lee Azzarello
It helps me to build up a NRPE service check manually. First execute
it manually on the host and confirm it's working, next copy the
command line into the nrpe configuration. Disconnect from that host
and connect to the host where nagios is running and manually execute
the check_nrpe plugin with the IP/hostname of the remote host to
confirm remote execution works. Then go into the service configuration
and lock it down. I prefer not to pass arguments via NRPE, so the
service check is always just the name of the NRPE command name on the
remote host.

There is also SNMP, though that's a real can of worms if you haven't
yet worked with Net-SNMP before.

-lee

On Wed, Mar 4, 2009 at 6:26 PM, Martyn mar...@chetnet.co.uk wrote:
 Hi all, hope you do no think this is a stupid question; however I have to
 ask it anyway for a better understanding.

 I have my Nagios Server with NRPE plugins installed and I have a Linux
 Remote Server that I want to monitor, I know that you need NRPE on the
 remote Linux box but after that it goes a bit glazed, this is the bit I need
 to understand.

 Here is the standard configuration NRPE installed on my remote Linux box:

 command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10
 command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c
 30,25,20
 command[check_disk1]=/usr/local/nagios/libexec/check_disk -w 20 -c 10 -p
 /dev/sda1
 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10
 -s Z
 command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c
 200

 So my question is, if I need to monitor SMTP on the remote box I would have
 to ask the remote administrator to add the following command to their
 nrpe.cfg:

 Command[check_smtp]=/usr/local/nagios/libexec/check_smtp -H localhost

 Do they have to use the -H localhost directive or is there something
 different they can use, the localhost just does not seem right looking at
 the other examples above.

 Then on my Nagios Server, under the .cfg box I want to monitor I define the
 command:

 define service{
     use  generic-service
     host_name   linux-server
     service_description   SSH
     check_command  check_nrpe!check_ssh
     }

 Hope its not such a stupid question that it does not warrant a reply.

 Rgs
 Martyn

 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Bulk Disable of Event Handlers?

2009-03-05 Thread Lee Azzarello
You can put the services in a service group and schedule downtime for
all services in that group.

-lee

On Thu, Mar 5, 2009 at 4:23 PM, Jonah Horowitz jhorow...@looksmart.net wrote:
 I finally took the plunge and set up event handlers on a set of our
 webservers.  It’s great because it restarts the web service automatically
 when there is an issue.  The problem is, we can’t take them down for
 maintenance without Nagios restarting them.  I know I can disable the event
 handlers across the entire nagios system via the “Tactical” view.  Is there
 any way to do it across a service group through the web interface?

 Either through the web or through a command line script would be great.

 If not, I’ll probably write/post my own.

 Thanks,
 --
 Jonah Horowitz · Monitoring Manager · jhorow...@looksmart.net
 W: 415-348-7694 · F: 415-348-7033 · M: 415-513-7202
 LookSmart - Premium and Performance Advertising Solutions
 625 Second Street, San Francisco, CA 94107


 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Email to Voice

2009-03-04 Thread Lee Azzarello
On Tue, Mar 3, 2009 at 5:57 PM, Martyn mar...@chetnet.co.uk wrote:
 Although I'm new and still setting up how I want to monitor my Servers and
 Routers, it got me thinking about texts that I receive whilst I'm in bed,
 I'm so used to getting a text statement from my bank that I now sleep
 through them, so text alerts are not sufficient enough for me to lift my
 head of my pillow if there was a fault.

 Does anybody know of a company that offers email to voice service, this way
 I get Nagios to send an email to a server which in turn call my mobile and
 will play me an alert of some kind.

Twilio.com has a REST API to make phone calls. You can do it all via a
script with an HTTP client library.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Need to monitor Solaris 10 and am new to Nagios

2009-03-03 Thread Lee Azzarello
Open the web panel of Nagios and click on the documentation link.
It's the second entry in the general category in the left frame.

-lee

On Fri, Feb 27, 2009 at 12:08 AM, syed jafar syed_ja...@hotmail.com wrote:


 I am new to Nagios and would like to monitor Solaris 10 systems (CPU, Disk,
 ...etc). Nagios server/host is running Redhat LINUX. Could someone let me
 know step by step process what to do. I have been told that it's kinda
 dfficult to monitor Solaris 10 system. If this is not possible is there a
 PERL or Shell script available to monitor the disk and CPU utilization.



 Thanks,

 Syed Jafar

 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Service linked to hostgroup / notifications

2009-03-03 Thread Lee Azzarello
On Fri, Feb 27, 2009 at 3:20 PM, Neu, Timothy t...@sjm.com wrote:
 As I understand it, running NTP on a VM is a complete waste of
 resources, as there is no real per-VM clock to keep in sync. (i.e.,
 you're updating ether)  VMWare fakes a clock whenever a process tries
 to access one.

 I know there are some kernel boot parameters that can be added to
 improve time accuracy inside a VM.  Not sure if I found them on Redhat's
 site or VMWare's...   That and VMWare tools time synchronization is the
 best I've been able to come up with.

I'm running 40 hosts in Amazon EC2. They are Xen virtual machines and
all of their clocks are synced via NTP. I have never had a problem
with time shift nor CPU performance.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Problems with check_snmp add-on

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 11:02 AM, Eric Chatham echat...@broadvox.net wrote:
 Hello,



 I am trying to run the check_snmp add-on to retrieve the product name of an
 Adtran.  I have the community string and the MIBs loaded.  When I run the
 following command, I get “No data received from host.”  What am I doing
 wrong?  Thank you for your assistance.



 [root@ ~ libexec]# snmptranslate -m +ADTRAN-MIB -IR -On adProdName

 .1.3.6.1.4.1.664.3.1.1
 [r...@~  libexec]# ./check_snmp -H ip address -C string read-only -o
 .1.3.6.1.4.1.664.3.1.1

 SNMP problem - No data received from host

 CMD: /usr/local/bin/snmpget -t 1 -r 5 -m '' -v 1 [authpriv] ip address
 .1.3.6.1.4.1.664.3.1.1

What happens when you run snmpget manually? Does it return the value expected?

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 5:21 AM, Klaus Umbach
treibh...@sozial-inkompetent.de wrote:
 Hi,

 I have a problem with unstable lines and/or machines under stress.

 Sometimes they just don't reply on nrpe, but are still working, so I get a
 lot of false alarms. Is it possible, to tell check_nrpe to exit with a
 warning instead of critical?

I believe that a state of socket timeout will always be flagged as
critical, though you can increase the max_check_attempts so it won't
throw a notification so soon if you know of false positives.

Of course figuring out the cause of your false positives is probably
the better solution.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] SELinux problem for SNMP

2009-02-26 Thread Lee Azzarello
I believe your question would be better served on a list related to SELinux.

In my experience SELinux is overkill for anything but the most
paranoid security situations. Without a complete understanding of your
entire security landscape, you'll just end up fighting with your own
systems because SELinux is protecting them from you.

-lee

On Thu, Feb 26, 2009 at 9:45 AM, Satish Patel sat...@linuxbug.org wrote:

 Hi,

 I have RHEL 5 Box with SELinux enforceing mode now what happend when
 my nagios box trying to use snmp to get CPU load its working fine. but
 when it trying to use DISK and MEM infor its failed not response. even
 this same plugin working with all my linux client ubuntu, debian,
 Redhat not problem with plugin but i found problem related to SELinux
 and i dont want to Disable it so what is the other option and how i
 can disable snmp policy in SELinux to make happy my nagios?


 Regards,

 Satish Patel






 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Procs Warning

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 12:14 PM, Martyn mar...@chetnet.co.uk wrote:
 Not being very familiar with Ubuntu and similar systems I want to try and
 find out how many processors to monitor, I just started to monitor a fresh
 install of Ubuntu server to see what results I get back, at the minute I'm
 getting a Warning telling me 153 processors.

Wow! That's quite some horsepower.

Unfortunately I think you might be confusing a processor as in a CPU
core and a process as in a running program. 153 processes is by no
means an insane number for a heavily loaded server. As in any
monitoring situation, you must take some time and define normal
before you can decide on the warning/critical thresholds for your
application.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Problems with check_snmp add-on

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 12:58 PM, Eric Chatham echat...@broadvox.net wrote:
 [root@ ~ libexec]# snmpget -v1 -c string read-only ip host 
 .1.3.6.1.4.1.664.3.1.1
 Timeout: No Response from ip host.
 [root@ ~ libexec]#

 I took an ethereal capture as well.  After looking at it, it looks like the 
 destination host may not be accepting SNMP traffic.  Is UDP 161 default for 
 SNMP?

 [root@ ~ libexec]# tethereal ip host ip host
 Running as user root and group root. This could be dangerous.
 Capturing on eth0
  0.00 source ip - ip host SNMP get-request 
 SNMPv2-SMI::enterprises.664.3.1.1
  0.049775 ip host - source ip ICMP Destination unreachable (Port 
 unreachable)
  1.000719 source ip - ip host SNMP get-request 
 SNMPv2-SMI::enterprises.664.3.1.1

I've never worked with this equipment before so I can't give any more advice.

Looks like it's time to dive into the Adtran's manual. Fun!

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Service linked to hostgroup / notifications

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 1:17 PM, David Fontaine dav.fonta...@gmail.com wrote:
 Hello,

 I am monitoring a few dozen of servers and I'm really satisfied with nagios.
 This is a great tool.

 However, there's one (actually more than one) thing that I can't figure out
 :

 I define a service :

 define service{
     use generic-service
     hostgroup_name  linux-servers
     service_description Time
     check_command   check_time!public
     }

 This service applies to a host group so that I don't have to define the
 service for each server. The command doesn't matter but for completeness
 here's what it does : it compares the time on a remote server (snmp) to the
 local time and report the difference.

Going out on a limb here...have you considered using the Network Time
Protocol to keep all your host's clocks in sync with an Internet scale
network of federated time servers available to you for free?

I'm using NTP on a network of 40 hosts and I never have worried about
their clocks getting out of sync.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] newbie question

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 3:08 PM, White, Richard L rlwh...@illinois.edu wrote:
 List,



 I just finished installing Nagios 2.10 on my RedHat Linux system.

Old version. Consider upgrading to version 3.


 I’m new at web stuff but after some digging was able to log into the
 application.



 I have not changed any of the default cfg files except for the contacts one
 listed in the quickstart directions and pointing the authentication to the
 correct password file.

 It looks like I should be monitoring my localhost, but I do not see hosts or
 services displayed in the viewer.

 Do they only show up when they are bad?



 On most screens I get the message similar to this one:



 “It appears as though you do not have permissions to view information for
 any of the hosts you requested…

 If you believe this is an error, check the HTTP server authentication
 requirements for access this CGI and check the authorization options in your
 CGI configuration file”



 Is this because the files it is looking for do not exist yet?

 Do I have a permissions problem?

Yes, that is most likely.

Start here:
http://nagios.sourceforge.net/docs/3_0/config.html

Documentation in this arena for your current version is not very
explicit. Consider upgrading.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Problems with check_snmp add-on

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 4:55 PM, Eric Chatham echat...@broadvox.net wrote:
 I got the Adtran to work with snmpwalk and get.  I would like to monitor 
 ifOperStatus.1, which is the status on interface 1.  I created this command 
  service

 # 'check_snmp_interface' command defintion
 define command {
        command_name    check_snmp_interface
        command_line    $USER1$/check_snmp -H $HOSTADDRESS$ -C $ARG1$ -o 
 $ARG2$ -c $ARG3$
        }

 define service{
        host_name                       reach_media
        service_description             INT1
        check_command                   check_snmp_interface!ip host!com 
 string ro!ifOperStatus.1!1:1
        max_check_attempts              3
        normal_check_interval           1
        check_period                    24x7

 When I add the service into Nagios, I keep getting check_snmp: Invalid 
 critical threshold - ifOperStatus.1

Double check your argument order between the command and service definitions.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Multiple IP address SSH/HTTP check

2009-02-25 Thread Lee Azzarello
Treat them as two different hosts, or pick a single address to depend on.

-lee

On Wed, Feb 25, 2009 at 11:13 AM, Satish Patel sat...@linuxbug.org wrote:

 I have client which has multiple IP address and runing apache and ssh
 on it but i dont know how to check service on multiple ip address?

 i have define both interface IP address in hosts.cfg file

 like this.

 address 192.168.1.1,172.16.1.1

 Regards,

 Satish Patel






 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] lazy nagios und cygwin...

2009-02-24 Thread Lee Azzarello
This is gonna be hard to debug with out any debugging output.

I concur, it is by no means normal for nagios to just stop doing
anything without user intervention. For comparison, I'm running
Nagios 3.0.6 from Debian with 34 hosts and 107 services. It has been
running continually for 3 months, which is when I first brought it
online.

-lee

On Tue, Feb 24, 2009 at 9:42 AM, Jakob Curdes j...@info-systems.de wrote:
 Hello,

 we are running nagios 3.0.5 under cygwin quite successfully in a
 relatively simple setup (most check results come via NRPE from NSCA++
 clients on windows machines). Everything is well, but sometimes a
 awkward thing happens: nagios just stops doing anything.

 - It continues to run : the processes are visible in the Windows process
 table
 - It ceases to execute checks: checks just stop bein executed
 - It ceases to write anything to a logfile
 - It also ceases to write anything to a debug file with Debug=16 (last
 entry is nothing suspicous)

 After killing the nagios processes and restarting the windows service
 (nagios is registered as a windows service using the cygrunsrv facility)
 everything is back to normal.
 What I find most annoying is that even in the CGI output everything
 seems normal - just that the last checks are several hours overdue, but
 no alarm or other signal is shown.

 We are not sure if this is cygwin/windows related or if it is a problem
 which also happens in the normal unix environment.
 Comments would be very welcome. I will now turn up debugging further in
 the hope that then we see something interesting.

 Regards,
 Jakob Curdes

 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] 1st post to the group, basic q ... I think

2009-02-24 Thread Lee Azzarello
On Mon, Feb 23, 2009 at 11:10 PM, Lance Raymond
lraym...@weatherflow.com wrote:

 I tried a few things after the _v1 but nagios kept failing on restart, so I
 think I just need to figure out howto push the variables (such as hostIP ,
 community name) to the check_command, right?  Since changing the service to
 use the new command, nagios is now red with the following error; Return code
 of 255 is out of bounds

There are good docs on how to pass arguments to a check command.
Here's my pattern to bring new service checks online without mucking
about with the running nagios process:

1) Find the plugin on the host running nagios
2) Run the plugin manually as the nagios user with the arguments you
would like. Inspect the output. Is it what you expect?
3) If it is what you expect, compose the check command and service
definition exactly as your manual run, only use macro substitution as
per the documentation
4) Save your configs and check them before loading the new objects.
nagios -v nagios.cfg will automate this check and inform you that
everything is okay before you go live.
5) If everything's okay, reload the configuration and check the web
interface for the new service check status.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Understanding check_cluster

2009-02-24 Thread Lee Azzarello
Here's my config. It's functional:

define command{
  command_name  check-cluster-health
  command_line  /usr/lib/nagios/plugins/check_cluster --service -l
$ARG1$ -w $ARG2$ -c $ARG3$ -d $ARG4$
}

define service{
  service_description   check-cluster-health
  host  app-proxy
  check_command check-cluster-health!App Thread
Health!0!1!$SERVICESTATEID:app-1:mongrel-count$,$SERVICESTATEID:app-2:mongrel-count$,$SERVICESTATEID:app-3:mongrel-count$,$SERVICESTATEID:app-4:mongrel-count$
  use   serviceClusterTemplate
}

define service{
  service_description   mongrel-count
  hostgroup app-servers,manager-servers
  check_command check_nrpe_1arg!check_mongrel_count
  notifications_enabled 0
  use   serviceClusterTemplate
}

-lee

On Tue, Feb 24, 2009 at 5:18 PM, Chris Beattie cbeat...@geninfo.com wrote:
 I need some help understanding the check_cluster plugin, please.  I’m using
 version 1.4.13 of the plugins on Nagios 3.10, all compiled from source on
 64-bit CentOS 5.2.  We use VMWare ESX clusters, and I’d like the hosts in
 Nagios that happen to be virtual machines to have one parent instead of a
 list of parents comprising every ESX host in the cluster.  Recently, an ESX
 host was moved from one cluster to another, so I had to change a lot of
 parents.  If there’s a better way to represent VMs and their hosts, I’m open
 to suggestions too.



 I don’t have any problem running it as the Nagios user from the command line
 and feeding it states, like so:

 ./check_cluster --host --data=0,0,2,1 --warning=0 --critical=1

 CLUSTER CRITICAL: Host cluster: 2 up, 1 down, 1 unreachable

 ./check_cluster --host --data=0,0,0,0 --warning=0 --critical=1

 CLUSTER OK: Host cluster: 4 up, 0 down, 0 unreachable

 ./check_cluster --host --data=0,0,0,1 --warning=0 --critical=1

 CLUSTER WARNING: Host cluster: 3 up, 1 down, 0 unreachable



 Adding --verbose just says “check_cluster - Warning: start=0 end=0;
 Critical: start=0 end=1” first.



 However, if I try anything with the $HOSTSTATEID$ macro, everything is
 always OK, even if I just make up host names:

 [./check_cluster --host
 --data=$HOSTSTATEID:duck$,$HOSTSTATEID:cow$,$HOSTSTATEID:chicken$
 --warning=0 --critical=1

 CLUSTER OK: Host cluster: 3 up, 0 down, 0 unreachable



 I thought maybe macros work better when executed by Nagios, so I added
 check_host_cluster command a host with that as its check_command.

 define command {

     command_name    check_host_cluster

     command_line    $USER1$/check_cluster --host --label=$HOSTNAME$
 --warning=$ARG1$ --critical=$ARG2$ --data=$ARG3$

 }



 define host {

     use linux-server

     host_name   ProductionCluster1

     alias   Production Cluster 1

     address 127.0.0.1

     parents gisesx1,gisesx3,gisesx4

     check_command
 check_host_cluster!1!2!$HOSTSTATEID:foo1$,$HOSTSTATEID:foo3$,$HOSTSTATEID:foo4$

     hostgroups  nogsupport

 }



 The check_interval for the linux-server template is set to 3.  I made the
 assumption that it didn’t matter what I set the address to since I’m only
 interested in the state of other hosts, and it’s not being referenced in the
 check_command.



 It shows up in the host information web page as being up, but I don’t have
 any hosts named foo:

 Host Status:

   UP

  (for 0d 3h 41m 9s+)

 Status Information: CLUSTER OK: ProductionCluster1: 3 up, 0 down, 0
 unreachable



 I had better luck with check_icmp, but it looks like it goes straight to
 CRITICAL if one host is down.

 This message (including any attachments) is intended only for
 the use of the individual or entity to which it is addressed and
 may contain information that is non-public, proprietary,
 privileged, confidential, and exempt from disclosure under
 applicable law or may constitute as attorney work product.
 If you are not the intended recipient, you are hereby notified
 that any use, dissemination, distribution, or copying of this
 communication is strictly prohibited. If you have received this
 communication in error, notify us immediately by telephone and
 (i) destroy this message if a facsimile or (ii) delete this message
 immediately if this is an electronic communication.

 Thank you.


 --
 Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
 -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
 -Strategies to boost innovation and cut costs with open source participation
 -Receive a $600 discount off the registration fee with the source code: SFAD
 http://p.sf.net/sfu/XcvMzF8H
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when 

Re: [Nagios-users] NRPE Problem

2009-02-12 Thread Lee Azzarello
You can also try typing your arguments at the far end in the nrpe
command file. I'm not the biggest fan of passing arguments via nrpe.
I've found it simpler to deploy the same nrpe command file to a
cluster and have nagios just execute the check_nrpe_1arg command.

-lee

On Thu, Feb 12, 2009 at 3:39 AM, LEVEAU Stanislas
stanislas.lev...@ac-caen.fr wrote:
 hi

 in your define command, i think it's

 define command{
 command_name check_nrpe_manual_disk
 command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c
 check_manual_disk -a $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$
   }

 and not  -a $ARG1$ $ARG2$ $ARG3$

 maybe

 regards
 Stan

 Thomas Guyot-Sionnest a écrit :

 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 On 11/02/09 02:43 PM, Vasiliy Boulytchev wrote:


 Gents,

   I am trying to have Nagios specify which partition to check via
 nrpe... and am getting the following error:

 DISK CRITICAL - 10 is not accessible: No such file or directory

 Here is my configuration:


 ###

 define command{
 command_name check_nrpe_manual_disk
 command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c
 check_manual_disk -a $ARG1$ $ARG2$ $ARG3$
 }



 command[check_manual_disk]=/usr/local/nagios-plugins/libexec/check_disk
 -w $ARG1$ -c $ARG2$ -p $ARG3$




 define service{
 use local-service
 host_name localhost
 service_description Manual Disk /dev/sda1
 check_command check_nrpe_manual_disk!20!10!/
 }


 It looks like an extra argument is passed somehow, so check_disk ends up
 checking the the path 10 (your warning threshold), although your
 config looks ok. Are you sure NRPE and Nagios have been started with the
 most recent config, and there's not more than one instance of Nagios and
 NRPE running?

 Have you tried running the check trough the command link (check_nrpe)?

 - --
 Thomas
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.6 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

 iD8DBQFJk7aK6dZ+Kt5BchYRArhiAJ0dYvtmC13zVi0WGHFaWXeB/snukACgr8By
 JhACq1XdPD5D2Ay+7IJdzgI=
 =Yglg
 -END PGP SIGNATURE-

 --
 Create and Deploy Rich Internet Apps outside the browser with
 Adobe(R)AIR(TM)
 software. With Adobe AIR, Ajax developers can use existing skills and code
 to
 build responsive, highly engaging applications that combine the power of
 local
 resources and data with the reach of the web. Download the Adobe AIR SDK and
 Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null



 --
 LEVEAU Stanislas
 Rectorat de Caen
 SIAC
 168, rue Caponière
 B.P. 6184
 14061 CAEN Cedex


 Service Informatique de l'académie de Caen
 Département Systèmes  Réseaux


  Tel : 02.31.30.17.86




 --
 Create and Deploy Rich Internet Apps outside the browser with
 Adobe(R)AIR(TM)
 software. With Adobe AIR, Ajax developers can use existing skills and code
 to
 build responsive, highly engaging applications that combine the power of
 local
 resources and data with the reach of the web. Download the Adobe AIR SDK and
 Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Could not complete SSL handshake

2009-02-12 Thread Lee Azzarello
Here's a mystery for the books. I was alerted this morning of a socket
timeout while nagios attempted to connect the NRPE server on a remote
host. I go in and manually check that host and sure enough:

Feb 12 16:02:59 conversion-10 nrpe[6886]: Error: Could not complete
SSL handshake. 5
Feb 12 16:36:03 conversion-10 nrpe[7270]: Error: Could not complete
SSL handshake. 5

Weird, but sort of understandable. Just to make sure it's down, from
the host where Nagios is running:

control-1:~# telnet conversion-10.internal 5666
Trying 10.254.163.50...
Connected to conversion-10.internal.
Escape character is '^]'.

Huh? I can connect via telnet. NRPE is not down.

Then I visually check other services on the remote host though the web
interface, two of which are also a NRPE service check. They are not
generating the SSL handshake error, no socket timeout, status OK, same
host. Wacky. Well, last thing to try is to execute the NRPE check
manually from the host where Nagios is running:

control-1:~# /usr/lib/nagios/plugins/check_nrpe -H
conversion-10.internal -c check_tmpdir_links
check_tmpdir_links OK - result:2823  |links$=2823

Woah, dude!? Uhhh, why is this singular service check telling me it's
having a socket timeout ONLY when run from Nagios but not from an
interactive shell? I give up...

2 hours pass, then I am alerted of the following event:

[1234458138] SERVICE ALERT:
conversion-10;tmpdir-links;OK;HARD;1;check_tmpdir_links OK -
result:2846

WTF? It fixed itself? Scary. The only trend I can make of this is
that the timed out service is infrequent. I configured it to be
checked every 30 minutes, while the others are far more frequent,
checking every 5 minutes. Maybe I just got unlucky and hit some high
network latency? I don't know.

-lee

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] graphing trends across hosts or services instead of a timeseries

2009-02-12 Thread Lee Azzarello
Nagios itself does have some trending tools in version 3, though they
are not very comprehensive. Are you looking for something beyond their
scope?

-lee

On Thu, Feb 12, 2009 at 3:09 AM, Rahul Nabar rpna...@gmail.com wrote:
 One other thing that I haven't figured out yet with PNP-NAGIOS is this: How
 does one get trending across services or hosts? i.e. It is easy to see time
 series graphs of pingtimes, load averages disk usages etc. but sometimes
 what seems more relevant is a chart across services for a given snapshot in
 time. Say, to identify a hot node, or a node with unusually high load
 averages.

 Is there a way to do this? Or am I tinkering with the wrong tool!

 --
 Rahul

 --
 Create and Deploy Rich Internet Apps outside the browser with
 Adobe(R)AIR(TM)
 software. With Adobe AIR, Ajax developers can use existing skills and code
 to
 build responsive, highly engaging applications that combine the power of
 local
 resources and data with the reach of the web. Download the Adobe AIR SDK and
 Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Ack/Downtime on multiple hosts/services

2009-02-12 Thread Lee Azzarello
I would think of this as more of a policy question than a technical
one. If a large number of hosts and services are going down often
enough for you to need a batch processing script to trigger nagios to
recognize down time, perhaps your monitoring plan could be altered?
You can define a check time period that only watches for critical
states when the hosts are expected to be online. You can also change
which services are checked for the group of hosts that have a high
amount of maintenance periods.

-lee

On Thu, Feb 12, 2009 at 12:30 AM, Thomas Guyot-Sionnest derm...@aei.ca wrote:
 -BEGIN PGP SIGNED MESSAGE-
 Hash: SHA1

 On 11/02/09 12:11 PM, Jason Frisvold wrote:
 Greetings,

   I'm relatively new to Nagios and still trying to wrap my head around
 it.  One thing that I have not been able to find is a way to ack or
 place into downtime multiple items at once.  Is there a way to do this?
  If so, how?  If not, is there some other way I'm supposed to handle
 these sorts of situations?

 If you put a host into downtime, it suppress all service notifications.
 You can also use commands like Disable all notifications for this host
 There's also a few commands you can run for the whole hostgroup or
 servicegroup.

 Other than that I used to write quick and dirty scripts to send
 mass-command to the command pipe in the past - nothing really re-usable
 though, see the libexec/eventhandlers directory if you need some
 inspiration.

 - --
 Thomas
 -BEGIN PGP SIGNATURE-
 Version: GnuPG v1.4.6 (GNU/Linux)
 Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

 iD8DBQFJk7QK6dZ+Kt5BchYRAmYwAJwN25CAfpdu9X+E8PCDA/Rze7mv5ACeKvjq
 pHZXRrErlqflBsM/TeYE5XM=
 =wo4m
 -END PGP SIGNATURE-

 --
 Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM)
 software. With Adobe AIR, Ajax developers can use existing skills and code to
 build responsive, highly engaging applications that combine the power of local
 resources and data with the reach of the web. Download the Adobe AIR SDK and
 Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] graphing trends across hosts or services instead of a timeseries

2009-02-12 Thread Lee Azzarello
In Nagios version 3, you click on Reporting-Trends and use the menus
to generate a picture.

The limitation is you can only see one picture at a time for a
particular host or service. So you could not, for example have an
automatically generated dashboard of trends for a particular host or
service group. Though there's nothing stopping you from taking
screenshots and making your own.

-lee

On Thu, Feb 12, 2009 at 2:43 PM, Rahul Nabar rpna...@gmail.com wrote:


 On Thu, Feb 12, 2009 at 11:46 AM, Lee Azzarello l...@dropio.com wrote:

 Nagios itself does have some trending tools in version 3, though they
 are not very comprehensive. Are you looking for something beyond their
 scope?

 Thanks Lee. I am not aware of the scope of the inbuilt trending tools. Maybe
 that's a good place to start. How does one use those? Say, how can one
 obtain a graph of ping times across all hosts in a suitable format?

 That might make it easy to identify problem machines.

 --
 Rahul


--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Which books are best for Nagios 3

2009-02-04 Thread Lee Azzarello
On Wed, Feb 4, 2009 at 12:01 PM, Paul Weaver paul.wea...@bbc.co.uk wrote:
 On Jan 19, 2009, at 11:25 AM, James Miller wrote:

  Recently I upgraded us from 2.9 to 3.0.6 and I'm looking for
  recommendations for a book or two on 3.0 to take my
 knowledge to the
  next level.  I've found the following books available and I'm
  wondering which ones are the best:

  Any suggestions would be _greatly_ appreciated!

 5. This list. Try to discover the answer to as many questions
 as you can yourself by reading the docs and source and by
 trial. Try to understand the why of an answer you discover or
 provided by someone else. I consider it a much more
 interesting and varied resource than the few standard
 examples a book might have; plus it's free =).

 Only once you reach a certain level. There are no doubt many lurkers
 who have trouble getting started. I know I did. I got a copy of
 Building a Monitoring Infrastructure with Nagios (Feb07) and found
 that invaluable, but we're still on nagios version 2

Agreed. Dave Josephsen's book is excellent. I'm currently on the
visualization chapter. It's very comprehensive and practical. He's
opinionated enough to make reading chapters interesting.

-lee

--
Create and Deploy Rich Internet Apps outside the browser with Adobe(R)AIR(TM)
software. With Adobe AIR, Ajax developers can use existing skills and code to
build responsive, highly engaging applications that combine the power of local
resources and data with the reach of the web. Download the Adobe AIR SDK and
Ajax docs to start building applications today-http://p.sf.net/sfu/adobe-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null