[Nagios-users] [UPDATE] remote check_by path question

2009-10-27 Thread lance raymond
Sorry, after a 10 or so minutes (not sure on the latency) but the error did
change.  It now shows the following;
Warning threshold must be float or float triplet!I am hitting a meeting now
but at least there is progress and something else I can google unless
someone knows of this already and has a simple fix.  Will check back after
the meeting regardless, thanks as always.

-- Forwarded message --
From: lance raymond lance.raym...@gmail.com
Date: Tue, Oct 27, 2009 at 11:01 AM
Subject: remote check_by path question
To: Nagios Users nagios-users@lists.sourceforge.net


I now want to add some remote checks, and it looks like a simple path issue,
yet I can't seem to verify.  The groundwork server, and all webservers are
running Ubuntu-Server 8.10.  SSH key's are setup with the user nagios, and I
can remotely ssh in to the 1st webserver no problem.  I am / was getting the
following error;
Remote command execution failed: sh: libexec/check_load: not found  and
sure enough nothing was on that remote webserver.  So I installed both the
aptitude packages;
nagios-nrpe-plugin  - Nagios Remote Plugin Executor Plugin
nagios-nrpe-server  - Nagios Remote Plugin Executor Server

Not sure if I need the server or not, but it's installed and the check_load
and others are put in the default;
/usr/lib/nagios/plugins/
Looking at the error, I simply made a symlink under /home/nagios/libexec but
still getting the same error.  Below is just a copy/paste of the ssh in and
test.

nag...@groundwork:/usr/local/groundwork/nagios/etc$ ssh 192.168.50.101
ssh: /usr/local/groundwork/common/lib/libcrypto.so.0.9.8: no version
information available (required by ssh)
Linux ws1 2.6.27-7-server #1 SMP Tue Nov 4 20:16:57 UTC 2008 x86_64

Last login: Tue Oct 27 10:57:55 2009 from 192.168.50.6
$ libexec/check_load -w 5,4,3 -c 10,8,6
OK - load average: 0.74, 1.43, 1.66|load1=0.740;5.000;10.000;0;
load5=1.430;4.000;8.000;0; load15=1.660;3.000;6.000;0;

So things do seem to work, just not sure.  The checkcommand shows variable
USER22, I could probably hardcode the full path, but not sure if that's the
right thing to do.

Thanks as always.
--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] remote check_by path question

2009-10-27 Thread lance raymond
I now want to add some remote checks, and it looks like a simple path issue,
yet I can't seem to verify.  The groundwork server, and all webservers are
running Ubuntu-Server 8.10.  SSH key's are setup with the user nagios, and I
can remotely ssh in to the 1st webserver no problem.  I am / was getting the
following error;
Remote command execution failed: sh: libexec/check_load: not found  and
sure enough nothing was on that remote webserver.  So I installed both the
aptitude packages;
nagios-nrpe-plugin  - Nagios Remote Plugin Executor Plugin
nagios-nrpe-server  - Nagios Remote Plugin Executor Server

Not sure if I need the server or not, but it's installed and the check_load
and others are put in the default;
/usr/lib/nagios/plugins/
Looking at the error, I simply made a symlink under /home/nagios/libexec but
still getting the same error.  Below is just a copy/paste of the ssh in and
test.

nag...@groundwork:/usr/local/groundwork/nagios/etc$ ssh 192.168.50.101
ssh: /usr/local/groundwork/common/lib/libcrypto.so.0.9.8: no version
information available (required by ssh)
Linux ws1 2.6.27-7-server #1 SMP Tue Nov 4 20:16:57 UTC 2008 x86_64

Last login: Tue Oct 27 10:57:55 2009 from 192.168.50.6
$ libexec/check_load -w 5,4,3 -c 10,8,6
OK - load average: 0.74, 1.43, 1.66|load1=0.740;5.000;10.000;0;
load5=1.430;4.000;8.000;0; load15=1.660;3.000;6.000;0;

So things do seem to work, just not sure.  The checkcommand shows variable
USER22, I could probably hardcode the full path, but not sure if that's the
right thing to do.

Thanks as always.
--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Checking load via snmp

2009-10-23 Thread lance raymond
The server and all the devices I will be polling are all Dell PowerEdge
servers, the 1750, 1850 and 1950 models.  The plugin I am using is what cam
in the install.  I will look at some options as well as the exchange as
there may be others, just saw that and figured it was a basic check I am
sure so would use the stock one.

Thanks

 (Sorry Andy for the direct response)

 On Thu, Oct 22, 2009 at 7:36 PM, Andy Ford a...@andyford.org wrote:

 What kind of device are you polling? It could be that it provides load and
 percent CPU used via SNMP and the your plugin is using the percent CPU
 OID(s).

 On Thu, Oct 22, 2009 at 2:20 PM, lance raymond 
 lance.raym...@gmail.comwrote:

 I see a nice little check_snmp_load plugin that comes with
 groundwork/nagios, but not sure if it's correct.  When I run from the
 command line on the nagio server, I get the following;

 ./check_snmp_load.pl -H 192.168.50.101 -C community -w 50 0 -c 60   I
 get the following reply;
 4 CPU, average load 14.8  50 : OK

 at the same time, a 'w' shows;
 r...@ws1:~# w
  15:18:31 up 21 days, 17:51,  3 users,  load average: 1.12, 1.75, 1.72

 So I am trying to see if I am missing something as to why the load is
 showing at 14, where it should be one.

 Not sure if anyone is using this over the check_by_ssh, and has feedback,
 but I would like to just get the immediate server load every x amount of
 minutes, not worried about the 5 minute average, etc.

 Thanks


 --
 Come build with us! The BlackBerry(R) Developer Conference in SF, CA
 is the only developer event you need to attend this year. Jumpstart your
 developing skills, take BlackBerry mobile applications to market and stay
 ahead of the curve. Join us from November 9 - 12, 2009. Register now!
 http://p.sf.net/sfu/devconference
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null




 --
 http://www.andrewford.org



--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error

2009-10-22 Thread lance raymond
I haven't installed anything on the webservers (yet), but I do wish to via
snmp do load, cpu, etc. or if there is a better way like a client daemon,
feel free to let me know. I am still reading/understanding what each is used
for (service.cfg, commands.cfg, etc.) but I have the following all for that
check;

services.cfg
define service {
service_description check_s1
use generic-service
hostgroup_name  s1_WebServers
check_command   check_s1!3!5
}

check_commands.cfg
define command {
command_namecheck_s1
command_line$USER1$/check_http -H
www.site1.com-w $ARG1$ -c $ARG2$
}

command.cfg has;
command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H
www.site1.com -I $HOSTADDRESS$

As for where the local command was run, yes it was local to the groundwork
(nagios) server from the following location;

r...@groundwork:/usr/local/groundwork/nagios/libexec/check_http -H
www.site1.com -I 192.168.50.101
HTTP OK HTTP/1.1 200 OK - 14351 bytes in 0.249 seconds
|time=0.249456s;;;0.00 size=14351B;;;0

I am not sure what the NRPE plugin is but just looking at the options, it
looks like it's run on the client as there is no spot for IP, when run local
it simply says connection refused by host for the site1.com domain.

Thanks.

On Thu, Oct 22, 2009 at 4:59 AM, Martin Melin mme...@gmail.com wrote:

 Hi,
 Where did you define the check_s1 command? On which host? It looks like an
 NRPE config so is this on one of the web servers or on your Nagios host?

 Where did you try running the command from the shell? On the Nagios host or
 the web server?

 Please show the command definition from Nagios for check_s1 as well.

 Make sure that you've edited the correct NRPE config file, reloaded nrped
 if you're not using inetd. To check that
 the check works from the command line, run the check_nrpe command from 
 Nagios, not check_http from Nagios or the webserver.

 Regards,
 Martin Melin

 On Thu, Oct 22, 2009 at 3:32 AM, lance raymond lance.raym...@gmail.comwrote:

 Hi all, well I am running GroundWork open source and really happy with the
 layout, etc. and now comes the nagios things.  My goal is to setup things
 neatly, organized and learn as I go rather than asking stupid q's along the
 way.  So with that, the 1st task;

 I have 8 webservers all host numerous sites, a few host only select.  So I
 have created some hostgroups, adding the servers that belong to each, and
 wonderfully it works.  I can select a server, see the group, click the
 group, see the servers (did I mention I'm good)  :)

 Anyway, I want to setup 2 seperate checks one for each website, then
 associate them with the groups, so for example;
 ws1 (site1, site2)
 ws2 (site1)
 ws3 (site1, site2)
 ws4 (site2)

 Pretty simple.  So I copied the check_http command where I saw it used and
 have the following;

 command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H
 $HOSTADDRESS$ -I $HOSTADDRESS$
 command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H
 www.site1.com -I $HOSTADDRESS$
 command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H
 www.site2.com -I $HOSTADDRESS$

 The services.cfg has the following (will only show 1 for space)

 define service {
 service_description  check_s1
 usegeneric-service
 hostgroup_name s1_WebServers
 check_commandcheck_s1!3!5
 }

 Now after the above setup and a restart, I have happy to see the check_s1
 listed but it's showing critical with the following error;
 CRITICAL - Socket timeout after 10 seconds

 At the command line check it does work;
  ./check_http -H www.site1.com -I 192.168.50.101
 HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds
 |time=0.089949s;;;0.00 size=14398B;;;0

 I am looking through where people are increasing the timeout, but the
 response time is not even a second, so there must be something else I am
 missing.  So I am looking forward to learning more, welcome any ideas,
 suggestions, or anything else I should know about using nagios under
 groundwork.

 This is running under ubuntu-8.10 server, Program Version:3.0.6, let me
 know what else I can or need to provide on this.



 --
 Come build with us! The BlackBerry(R) Developer Conference in SF, CA
 is the only developer event you need to attend this year. Jumpstart your
 developing skills, take BlackBerry mobile applications to market and stay
 ahead of the curve. Join us from November 9 - 12, 2009. Register now!
 http://p.sf.net/sfu/devconference
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when

Re: [Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error

2009-10-22 Thread lance raymond
[SOLVED].  That did it.  Can't believe I even wrote the command line and
included it, but I just followed the default.  But thanks.

Looking forward to all the learning from both playing and the group.


On Thu, Oct 22, 2009 at 10:17 AM, Ortner, Gerald gerald.ort...@gespag.atwrote:





 Try



 define command {
 command_namecheck_s1
 command_line$USER1$/check_http -H
 www.site1.com -I $HOSTADDRESS$ -w $ARG1$ -c $ARG2$
 }



 *Von:* lance raymond [mailto:lance.raym...@gmail.com]
 *Gesendet:* Donnerstag, 22. Oktober 2009 15:17
 *An:* nagios-users@lists.sourceforge.net
 *Betreff:* Re: [Nagios-users] 1st post,so naturally the 1st question by
 the nagios newb re: socket error



 I haven't installed anything on the webservers (yet), but I do wish to via
 snmp do load, cpu, etc. or if there is a better way like a client daemon,
 feel free to let me know. I am still reading/understanding what each is used
 for (service.cfg, commands.cfg, etc.) but I have the following all for that
 check;

 services.cfg
 define service {
 service_description check_s1
 use generic-service
 hostgroup_name  s1_WebServers
 check_command   check_s1!3!5
 }

 check_commands.cfg
 define command {
 command_namecheck_s1
 command_line$USER1$/check_http -H
 www.site1.com -w $ARG1$ -c $ARG2$
 }

 command.cfg has;
 command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H
 www.site1.com -I $HOSTADDRESS$

 As for where the local command was run, yes it was local to the groundwork
 (nagios) server from the following location;

 r...@groundwork:/usr/local/groundwork/nagios/libexec/check_http -H
 www.site1.com -I 192.168.50.101
 HTTP OK HTTP/1.1 200 OK - 14351 bytes in 0.249 seconds
 |time=0.249456s;;;0.00 size=14351B;;;0

 I am not sure what the NRPE plugin is but just looking at the options, it
 looks like it's run on the client as there is no spot for IP, when run local
 it simply says connection refused by host for the site1.com domain.

 Thanks.

 On Thu, Oct 22, 2009 at 4:59 AM, Martin Melin mme...@gmail.com wrote:

 Hi,



 Where did you define the check_s1 command? On which host? It looks like an
 NRPE config so is this on one of the web servers or on your Nagios host?



 Where did you try running the command from the shell? On the Nagios host or
 the web server?



 Please show the command definition from Nagios for check_s1 as well.



 Make sure that you've edited the correct NRPE config file, reloaded nrped
 if you're not using inetd. To check that
 the check works from the command line, run the check_nrpe command from 
 Nagios, not check_http from Nagios or the webserver.



 Regards,

 Martin Melin



 On Thu, Oct 22, 2009 at 3:32 AM, lance raymond lance.raym...@gmail.com
 wrote:

  Hi all, well I am running GroundWork open source and really happy with
 the layout, etc. and now comes the nagios things.  My goal is to setup
 things neatly, organized and learn as I go rather than asking stupid q's
 along the way.  So with that, the 1st task;

 I have 8 webservers all host numerous sites, a few host only select.  So I
 have created some hostgroups, adding the servers that belong to each, and
 wonderfully it works.  I can select a server, see the group, click the
 group, see the servers (did I mention I'm good)  :)

 Anyway, I want to setup 2 seperate checks one for each website, then
 associate them with the groups, so for example;
 ws1 (site1, site2)
 ws2 (site1)
 ws3 (site1, site2)
 ws4 (site2)

 Pretty simple.  So I copied the check_http command where I saw it used and
 have the following;

 command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H
 $HOSTADDRESS$ -I $HOSTADDRESS$
 command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H
 www.site1.com -I $HOSTADDRESS$
 command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H
 www.site2.com -I $HOSTADDRESS$

 The services.cfg has the following (will only show 1 for space)

 define service {
 service_description  check_s1
 usegeneric-service
 hostgroup_name s1_WebServers
 check_commandcheck_s1!3!5
 }

 Now after the above setup and a restart, I have happy to see the check_s1
 listed but it's showing critical with the following error;
 CRITICAL - Socket timeout after 10 seconds

 At the command line check it does work;
  ./check_http -H www.site1.com -I 192.168.50.101
 HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds
 |time=0.089949s;;;0.00 size=14398B;;;0

 I am looking through where people are increasing the timeout, but the
 response time is not even a second, so there must be something else I am
 missing.  So I am looking forward to learning more, welcome any ideas,
 suggestions, or anything else I should know about using nagios

[Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error

2009-10-21 Thread lance raymond
Hi all, well I am running GroundWork open source and really happy with the
layout, etc. and now comes the nagios things.  My goal is to setup things
neatly, organized and learn as I go rather than asking stupid q's along the
way.  So with that, the 1st task;

I have 8 webservers all host numerous sites, a few host only select.  So I
have created some hostgroups, adding the servers that belong to each, and
wonderfully it works.  I can select a server, see the group, click the
group, see the servers (did I mention I'm good)  :)

Anyway, I want to setup 2 seperate checks one for each website, then
associate them with the groups, so for example;
ws1 (site1, site2)
ws2 (site1)
ws3 (site1, site2)
ws4 (site2)

Pretty simple.  So I copied the check_http command where I saw it used and
have the following;

command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H
$HOSTADDRESS$ -I $HOSTADDRESS$
command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H
www.site1.com -I $HOSTADDRESS$
command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H
www.site2.com -I $HOSTADDRESS$

The services.cfg has the following (will only show 1 for space)

define service {
service_description  check_s1
usegeneric-service
hostgroup_name s1_WebServers
check_commandcheck_s1!3!5
}

Now after the above setup and a restart, I have happy to see the check_s1
listed but it's showing critical with the following error;
CRITICAL - Socket timeout after 10 seconds

At the command line check it does work;
 ./check_http -H www.site1.com -I 192.168.50.101
HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds
|time=0.089949s;;;0.00 size=14398B;;;0

I am looking through where people are increasing the timeout, but the
response time is not even a second, so there must be something else I am
missing.  So I am looking forward to learning more, welcome any ideas,
suggestions, or anything else I should know about using nagios under
groundwork.

This is running under ubuntu-8.10 server, Program Version:3.0.6, let me know
what else I can or need to provide on this.
--
Come build with us! The BlackBerry(R) Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9 - 12, 2009. Register now!
http://p.sf.net/sfu/devconference___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] nagios to graph ethernet traffic.. can it ?

2009-04-28 Thread lance raymond
 
ok, I now have (thanks to the group) my server up and currently just  
watching one box... many more to be added.  We also run cacti for  
ethernet traffic, etc. and wondering can this be done as well.
 
I am looking on the nagios exchange site and don't see anything that can  
take that type of data, then graph it.
 
If it is possible, what requirements do I need (and should I have  
already).  I still need to install nagios graph and thought I would just  
start with my 1st simple machine, then add some ethernet traffic in/out  
to him and once he is checking everything I can simply add other hosts...
 
Thanks as always..


--
Register Now  Save for Velocity, the Web Performance  Operations 
Conference from O'Reilly Media. Velocity features a full day of 
expert-led, hands-on workshops and two days of sessions from industry 
leaders in dedicated Performance  Operations tracks. Use code vel09scf 
and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] check_snmp_storage plugin - 126 return code

2009-04-20 Thread lance raymond
Not sure what's wrong, seems simple enough.  I am using the following as 
a guide on this plugin 
(http://nagios.manubulon.com/index_commands.html#storage)
 From the command line on the nagios server I can get it to work;

perl check_snmp_storage.pl -H x.x.x.x -C community -m / -w 90 -c 95 -r
/: 67%used(99233MB/147706MB) (90%) : OK


I have the following 2 things setup.. in commands.cfg;
define command{
   command_name check_snmp_storage
   command_line $USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ -m $ARG1$ 
-w $ARG2$ -c $ARG3$ $ARG4$
}


And the server has the following;
define service{
use local-service
host_name cs0
service_description   Root / partition
check_command   check_snmp_storage!/!90!95!-r
}



--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Still some basic 101 items

2009-04-14 Thread Lance Raymond
I'm not sure I should say sorry for the basic questions as this is what the 
group is for, but seeing some of the in depth questions... talk about an 
inferiority complex.   Well, I now can see how to debug the start (thanks 
Marc) and figured my one problem.  But back to a basic few things.  I am 
looking to have nagios monitor 40 or so servers.  10 or so webservers, a few 
database, all have ssh, etc. so as I am trying to lay this out in my head, 
for an example, is this the right thinking.

Each box has an unique name in a server folder, /server/servername.cfg  
(that is what I have started and kind of working).
The webservers will have the following;
the define host {
host_name
address
use (I think this is my problem)
}

Now the above, the use I think is the actual hostgroup I wish it to belong 
to (ex, webserver, linux-server, etc.)  I am thinking I will have group 
temapltes, webserver will have an http test, linux-server can have an ssh 
check, load, users, etc. Now if the above is making sense, the use will have 
multiple, ex,
use   webserver, linux-server(I assume this is possible)  That would 
take care of defining the default services to check for those particular 
box's.

I guess I am looking for both confirmation and suggestions on some high 
level down setups.  I don't want each host having 10 define_service items.  
I would rather say he is a webserver, and linux box.  That by nature will 
check x, y and z.  The problem so far is I have 4 servers, and even though I 
have an address in each, the use is still the default so ALL the box's 
details, users, swap usage, etc. is referenceing the localhost and not that 
individual box.

That is where I am stuck, the doc's are really not to helpful, but I am 
trying to understand more the seperate files.  How the use works, when to 
use a hostgroup, etc.  I will gladly provide cfg files or if someone has 
some basic things (aside from the install templates) or some help on the 
above it's appreciated.

Lr
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] initial setups, hostgroup help requested

2009-04-13 Thread lance raymond




still getting things in order, and having a problem with adding my 1st
hostgroup. Side note, is there any kind of logging, or debugging to
see where a failure occurs? Ex. I add a new config file with the
following basic line in the main conf;
 cfg_file=/usr/local/nagios/etc/hostgroups.cfg
The file has the following;
 define hostgroup{
 hostgroup_name webservers
   alias webservers
   contact_groups sysadmins
   members cs0
   }

The alias I just made up, the contact group sysadmin's exists and the
member cs0 is one of my webservers. In the actual server.cfg file for
that server I have a line;
 hostgroups  webservers

I don't know if that needs to be there as I am defining his as a member
in the hostgroups.cfg file.

Keeping things like this I start nagios and simply get;
Starting nagios:CONFIG ERROR! Start aborted. Check your Nagios
configuration.
(which is why I am looking as the nagios.log says nothing helpful.

Now if I comment out the hostgroup line in the main conf, things start,
but I would like to use hostgroups for the obvious reasons. I am sure
these and the new ones will be pretty basic as the needs are simple,
and if I can debug where it stopping would be even easier, but again,
this will be a1 or 2 sentence reply I bet!

Tnx





--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] check_local_disk basic question

2009-03-05 Thread Lance Raymond
Still new to it, but so far things are good.  I have added a remote 
host, monitoring load via snmp successfully, but my old alarm keeps 
tripping due to disk space.

df -h shows;
Filesystem   Size  Used Avail Use% Mounted on
/fs here 660G  597G   64G  91% /

so it seems 91% is in use, nagios is showing (nice and green);
DISK OK - free space: / 27867 MB (92% inode=99%):

The config for that machine (servername.cfg) shows;
check_command   check_local_disk!20%!10%!/


So to me that says when 20% left warn me, 10% critical alarm, right?

Thanks


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] gui config application

2009-02-27 Thread Lance Raymond
Googling around there are a few out there, mostly ng1, ng2, and I am 
wondering if there is something nice and clean out there for nagios 3.

I don't mind the cli, but sometimes a point/click is just so much easier.

New to the list, so if it's been discussed, I apologize.


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] 1st post to the group, basic q ... I think

2009-02-24 Thread Lance Raymond




ok, I am closer, but still unclear on the check_command and the define
service part (I will try to read more 2nt) but I can run in manually as
nagios from the cli;

./check_snmp_load.pl -H 1.1.1.1 -C community -w 3,3,2 -c 4,4,3 -T netsl
Load : 4.37 4.27 4.20 : 4.37  4 : CRITICAL 4.27  4 : CRITICAL
4.20  3 : CRITICAL

so that works. Now in the command.cfg I have the following;

define command{
 command_name check_snmp_load_v1
 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T
$ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$ -C community
}

and the server.cfg I have;

define service{
 use local-service ; Name of
service$
 host_name bigbird
 service_description Current Load via SNMP
 check_command
check_snmp_load_v1!netsl!3,3,2!4,4,3
}

I have played numerous times, that is the latest, I am trying to
understand where the $ARG variables are defined (and how). I assume
the $HOSTADDRESS is just by the server.cfg file, but not sure on
$USER7$ as well as push the other $ARG$ one's, becuase as you see from
the cli hand entering the info it works!

Can't wait till all this basic stuff is behind me!


Lee Azzarello wrote:

  On Mon, Feb 23, 2009 at 11:10 PM, Lance Raymond
lraym...@weatherflow.com wrote:
  
  
I tried a few things after the _v1 but nagios kept failing on restart, so I
think I just need to figure out howto push the variables (such as hostIP ,
community name) to the check_command, right? Since changing the service to
use the new command, nagios is now red with the following error;Return code
of 255 is out of bounds

  
  
There are good docs on how to pass arguments to a check command.
Here's my pattern to bring new service checks online without mucking
about with the running nagios process:

1) Find the plugin on the host running nagios
2) Run the plugin manually as the nagios user with the arguments you
would like. Inspect the output. Is it what you expect?
3) If it is what you expect, compose the check command and service
definition exactly as your manual run, only use macro substitution as
per the documentation
4) Save your configs and check them before loading the new objects.
nagios -v nagios.cfg will automate this check and inform you that
everything is okay before you "go live".
5) If everything's okay, reload the configuration and check the web
interface for the new service check status.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
  






--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] 1st post to the group, basic q ... I think

2009-02-24 Thread Lance Raymond




Sorry for the quick update, but don't want people to waste time, or
they may change based on my update.

Well, reading the http://nagios.manubulon.com/index_commands.html#load
page more, I am trying to understand a bit more. So my commands.cfg
file now looks like this;

define command{
 command_name check_snmp_load_v1
 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ -C
community -T netsl -w 3,3,2 -c 4,4,3 -C community

}

and the server.cfg file looks like this;

define service{
 use local-service ; Name of
service$
 host_name bigbird
 service_description Current Load via SNMP
 check_command
check_snmp_load_v1!netsl!3,3,2!4,4,3
}

All seems to be working now, I need to understand the alarm levels a
bit (that's easy reading) as well as figure if I even need the !netsl,
etc. on the check_command line.

But thanks for the push's in the right direction


Lee Azzarello wrote:

  On Mon, Feb 23, 2009 at 11:10 PM, Lance Raymond
lraym...@weatherflow.com wrote:
  
  
I tried a few things after the _v1 but nagios kept failing on restart, so I
think I just need to figure out howto push the variables (such as hostIP ,
community name) to the check_command, right? Since changing the service to
use the new command, nagios is now red with the following error;Return code
of 255 is out of bounds

  
  
There are good docs on how to pass arguments to a check command.
Here's my pattern to bring new service checks online without mucking
about with the running nagios process:

1) Find the plugin on the host running nagios
2) Run the plugin manually as the nagios user with the arguments you
would like. Inspect the output. Is it what you expect?
3) If it is what you expect, compose the check command and service
definition exactly as your manual run, only use macro substitution as
per the documentation
4) Save your configs and check them before loading the new objects.
nagios -v nagios.cfg will automate this check and inform you that
everything is okay before you "go live".
5) If everything's okay, reload the configuration and check the web
interface for the new service check status.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
  






--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] 1st post to the group, basic q ... I think

2009-02-23 Thread Lance Raymond
so far the install, etc. went easy there are some good doc's there.

I am still  understanding the groups, etc. but so far I have the server 
and 2 remote hosts setup.  The load check isn't really to good, so I 
downloaded the perl one using snmp and from the command line works 
great, see this example;

[r...@ws6 libexec]# perl ./check_snmp_load.pl -H ipadd -C community -w 
3,3,2 -c 4,4,3 -T netsl
Load : 5.55 5.04 4.87 : 5.55  4 : CRITICAL 5.04  4 : CRITICAL 4.87  3 
: CRITICAL

I added the following to the commands.cfg file;

define command{
   command_name check_snmp_load_v1
   command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T 
$ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$
}

on the target machine I put;

define service{
use local-service ; Name of 
service template to use
host_name   servername
service_description Current Load via SNMP
check_command   
check_local_load!5.0,4.0,3.0!10.0,6.0,4.0
}

I restarted nagios, saw the new line in the details, but the values are 
showing as;
OK - load average: 0.10, 0.06, 0.01

I did have a shell opened testing the local script at the same time and 
the load is still around 5.

I will have other questions, etc. as I read/play more, but just wanted 
to get this part done, then look at thresholds, etc. but if anyone is 
using this script and notice something obvious, please feel free to let 
me know.

Thanks.



--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] 1st post to the group, basic q ... I think

2009-02-23 Thread Lance Raymond
ah, after all the install stuff to fall with something like that was just 
bad!  Well your right, but I am still having a problem.  I read alot on 
hostgroups, etc. and starting writing out the map of what and how so to then 
go back to this, man, I am ready for bed!  But here is the scoop / problem.

The command file is untouched.  There is nothing in the services file as I 
am only testing on one host, so it's in that servername.cfg file.  Once 
things work I can move it, but that box now has the following;


define command{
   command_name check_snmp_load_v1
   command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T 
$ARG1$ -w $ARG2$ -c $$
}
define host{
use linux-server; Name of host template to 
use
; This host definition will 
inherit all variables that are defined
; in (or inherited by) the 
linux-server host template definition.
host_name  db1
alias   db1
address   public ip here
}
define service{
use local-service ; Name of 
service template to use
host_name   bigbird
service_description Current Load via SNMP
check_command   check_snmp_load_v1
 }

I tried a few things after the _v1 but nagios kept failing on restart, so I 
think I just need to figure out howto push the variables (such as hostIP , 
community name) to the check_command, right?  Since changing the service to 
use the new command, nagios is now red with the following error; Return code 
of 255 is out of bounds

I'm getting closer, as I said some good reading 2nt, but not for this, so 
hopefully someone has a quick answer and I promise to have more intelligent 
questions after  ;)
--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null