[Nagios-users] [UPDATE] remote check_by path question
Sorry, after a 10 or so minutes (not sure on the latency) but the error did change. It now shows the following; Warning threshold must be float or float triplet!I am hitting a meeting now but at least there is progress and something else I can google unless someone knows of this already and has a simple fix. Will check back after the meeting regardless, thanks as always. -- Forwarded message -- From: lance raymond lance.raym...@gmail.com Date: Tue, Oct 27, 2009 at 11:01 AM Subject: remote check_by path question To: Nagios Users nagios-users@lists.sourceforge.net I now want to add some remote checks, and it looks like a simple path issue, yet I can't seem to verify. The groundwork server, and all webservers are running Ubuntu-Server 8.10. SSH key's are setup with the user nagios, and I can remotely ssh in to the 1st webserver no problem. I am / was getting the following error; Remote command execution failed: sh: libexec/check_load: not found and sure enough nothing was on that remote webserver. So I installed both the aptitude packages; nagios-nrpe-plugin - Nagios Remote Plugin Executor Plugin nagios-nrpe-server - Nagios Remote Plugin Executor Server Not sure if I need the server or not, but it's installed and the check_load and others are put in the default; /usr/lib/nagios/plugins/ Looking at the error, I simply made a symlink under /home/nagios/libexec but still getting the same error. Below is just a copy/paste of the ssh in and test. nag...@groundwork:/usr/local/groundwork/nagios/etc$ ssh 192.168.50.101 ssh: /usr/local/groundwork/common/lib/libcrypto.so.0.9.8: no version information available (required by ssh) Linux ws1 2.6.27-7-server #1 SMP Tue Nov 4 20:16:57 UTC 2008 x86_64 Last login: Tue Oct 27 10:57:55 2009 from 192.168.50.6 $ libexec/check_load -w 5,4,3 -c 10,8,6 OK - load average: 0.74, 1.43, 1.66|load1=0.740;5.000;10.000;0; load5=1.430;4.000;8.000;0; load15=1.660;3.000;6.000;0; So things do seem to work, just not sure. The checkcommand shows variable USER22, I could probably hardcode the full path, but not sure if that's the right thing to do. Thanks as always. -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] remote check_by path question
I now want to add some remote checks, and it looks like a simple path issue, yet I can't seem to verify. The groundwork server, and all webservers are running Ubuntu-Server 8.10. SSH key's are setup with the user nagios, and I can remotely ssh in to the 1st webserver no problem. I am / was getting the following error; Remote command execution failed: sh: libexec/check_load: not found and sure enough nothing was on that remote webserver. So I installed both the aptitude packages; nagios-nrpe-plugin - Nagios Remote Plugin Executor Plugin nagios-nrpe-server - Nagios Remote Plugin Executor Server Not sure if I need the server or not, but it's installed and the check_load and others are put in the default; /usr/lib/nagios/plugins/ Looking at the error, I simply made a symlink under /home/nagios/libexec but still getting the same error. Below is just a copy/paste of the ssh in and test. nag...@groundwork:/usr/local/groundwork/nagios/etc$ ssh 192.168.50.101 ssh: /usr/local/groundwork/common/lib/libcrypto.so.0.9.8: no version information available (required by ssh) Linux ws1 2.6.27-7-server #1 SMP Tue Nov 4 20:16:57 UTC 2008 x86_64 Last login: Tue Oct 27 10:57:55 2009 from 192.168.50.6 $ libexec/check_load -w 5,4,3 -c 10,8,6 OK - load average: 0.74, 1.43, 1.66|load1=0.740;5.000;10.000;0; load5=1.430;4.000;8.000;0; load15=1.660;3.000;6.000;0; So things do seem to work, just not sure. The checkcommand shows variable USER22, I could probably hardcode the full path, but not sure if that's the right thing to do. Thanks as always. -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Checking load via snmp
The server and all the devices I will be polling are all Dell PowerEdge servers, the 1750, 1850 and 1950 models. The plugin I am using is what cam in the install. I will look at some options as well as the exchange as there may be others, just saw that and figured it was a basic check I am sure so would use the stock one. Thanks (Sorry Andy for the direct response) On Thu, Oct 22, 2009 at 7:36 PM, Andy Ford a...@andyford.org wrote: What kind of device are you polling? It could be that it provides load and percent CPU used via SNMP and the your plugin is using the percent CPU OID(s). On Thu, Oct 22, 2009 at 2:20 PM, lance raymond lance.raym...@gmail.comwrote: I see a nice little check_snmp_load plugin that comes with groundwork/nagios, but not sure if it's correct. When I run from the command line on the nagio server, I get the following; ./check_snmp_load.pl -H 192.168.50.101 -C community -w 50 0 -c 60 I get the following reply; 4 CPU, average load 14.8 50 : OK at the same time, a 'w' shows; r...@ws1:~# w 15:18:31 up 21 days, 17:51, 3 users, load average: 1.12, 1.75, 1.72 So I am trying to see if I am missing something as to why the load is showing at 14, where it should be one. Not sure if anyone is using this over the check_by_ssh, and has feedback, but I would like to just get the immediate server load every x amount of minutes, not worried about the 5 minute average, etc. Thanks -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- http://www.andrewford.org -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error
I haven't installed anything on the webservers (yet), but I do wish to via snmp do load, cpu, etc. or if there is a better way like a client daemon, feel free to let me know. I am still reading/understanding what each is used for (service.cfg, commands.cfg, etc.) but I have the following all for that check; services.cfg define service { service_description check_s1 use generic-service hostgroup_name s1_WebServers check_command check_s1!3!5 } check_commands.cfg define command { command_namecheck_s1 command_line$USER1$/check_http -H www.site1.com-w $ARG1$ -c $ARG2$ } command.cfg has; command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I $HOSTADDRESS$ As for where the local command was run, yes it was local to the groundwork (nagios) server from the following location; r...@groundwork:/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I 192.168.50.101 HTTP OK HTTP/1.1 200 OK - 14351 bytes in 0.249 seconds |time=0.249456s;;;0.00 size=14351B;;;0 I am not sure what the NRPE plugin is but just looking at the options, it looks like it's run on the client as there is no spot for IP, when run local it simply says connection refused by host for the site1.com domain. Thanks. On Thu, Oct 22, 2009 at 4:59 AM, Martin Melin mme...@gmail.com wrote: Hi, Where did you define the check_s1 command? On which host? It looks like an NRPE config so is this on one of the web servers or on your Nagios host? Where did you try running the command from the shell? On the Nagios host or the web server? Please show the command definition from Nagios for check_s1 as well. Make sure that you've edited the correct NRPE config file, reloaded nrped if you're not using inetd. To check that the check works from the command line, run the check_nrpe command from Nagios, not check_http from Nagios or the webserver. Regards, Martin Melin On Thu, Oct 22, 2009 at 3:32 AM, lance raymond lance.raym...@gmail.comwrote: Hi all, well I am running GroundWork open source and really happy with the layout, etc. and now comes the nagios things. My goal is to setup things neatly, organized and learn as I go rather than asking stupid q's along the way. So with that, the 1st task; I have 8 webservers all host numerous sites, a few host only select. So I have created some hostgroups, adding the servers that belong to each, and wonderfully it works. I can select a server, see the group, click the group, see the servers (did I mention I'm good) :) Anyway, I want to setup 2 seperate checks one for each website, then associate them with the groups, so for example; ws1 (site1, site2) ws2 (site1) ws3 (site1, site2) ws4 (site2) Pretty simple. So I copied the check_http command where I saw it used and have the following; command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H $HOSTADDRESS$ -I $HOSTADDRESS$ command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I $HOSTADDRESS$ command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H www.site2.com -I $HOSTADDRESS$ The services.cfg has the following (will only show 1 for space) define service { service_description check_s1 usegeneric-service hostgroup_name s1_WebServers check_commandcheck_s1!3!5 } Now after the above setup and a restart, I have happy to see the check_s1 listed but it's showing critical with the following error; CRITICAL - Socket timeout after 10 seconds At the command line check it does work; ./check_http -H www.site1.com -I 192.168.50.101 HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds |time=0.089949s;;;0.00 size=14398B;;;0 I am looking through where people are increasing the timeout, but the response time is not even a second, so there must be something else I am missing. So I am looking forward to learning more, welcome any ideas, suggestions, or anything else I should know about using nagios under groundwork. This is running under ubuntu-8.10 server, Program Version:3.0.6, let me know what else I can or need to provide on this. -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when
Re: [Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error
[SOLVED]. That did it. Can't believe I even wrote the command line and included it, but I just followed the default. But thanks. Looking forward to all the learning from both playing and the group. On Thu, Oct 22, 2009 at 10:17 AM, Ortner, Gerald gerald.ort...@gespag.atwrote: Try define command { command_namecheck_s1 command_line$USER1$/check_http -H www.site1.com -I $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ } *Von:* lance raymond [mailto:lance.raym...@gmail.com] *Gesendet:* Donnerstag, 22. Oktober 2009 15:17 *An:* nagios-users@lists.sourceforge.net *Betreff:* Re: [Nagios-users] 1st post,so naturally the 1st question by the nagios newb re: socket error I haven't installed anything on the webservers (yet), but I do wish to via snmp do load, cpu, etc. or if there is a better way like a client daemon, feel free to let me know. I am still reading/understanding what each is used for (service.cfg, commands.cfg, etc.) but I have the following all for that check; services.cfg define service { service_description check_s1 use generic-service hostgroup_name s1_WebServers check_command check_s1!3!5 } check_commands.cfg define command { command_namecheck_s1 command_line$USER1$/check_http -H www.site1.com -w $ARG1$ -c $ARG2$ } command.cfg has; command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I $HOSTADDRESS$ As for where the local command was run, yes it was local to the groundwork (nagios) server from the following location; r...@groundwork:/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I 192.168.50.101 HTTP OK HTTP/1.1 200 OK - 14351 bytes in 0.249 seconds |time=0.249456s;;;0.00 size=14351B;;;0 I am not sure what the NRPE plugin is but just looking at the options, it looks like it's run on the client as there is no spot for IP, when run local it simply says connection refused by host for the site1.com domain. Thanks. On Thu, Oct 22, 2009 at 4:59 AM, Martin Melin mme...@gmail.com wrote: Hi, Where did you define the check_s1 command? On which host? It looks like an NRPE config so is this on one of the web servers or on your Nagios host? Where did you try running the command from the shell? On the Nagios host or the web server? Please show the command definition from Nagios for check_s1 as well. Make sure that you've edited the correct NRPE config file, reloaded nrped if you're not using inetd. To check that the check works from the command line, run the check_nrpe command from Nagios, not check_http from Nagios or the webserver. Regards, Martin Melin On Thu, Oct 22, 2009 at 3:32 AM, lance raymond lance.raym...@gmail.com wrote: Hi all, well I am running GroundWork open source and really happy with the layout, etc. and now comes the nagios things. My goal is to setup things neatly, organized and learn as I go rather than asking stupid q's along the way. So with that, the 1st task; I have 8 webservers all host numerous sites, a few host only select. So I have created some hostgroups, adding the servers that belong to each, and wonderfully it works. I can select a server, see the group, click the group, see the servers (did I mention I'm good) :) Anyway, I want to setup 2 seperate checks one for each website, then associate them with the groups, so for example; ws1 (site1, site2) ws2 (site1) ws3 (site1, site2) ws4 (site2) Pretty simple. So I copied the check_http command where I saw it used and have the following; command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H $HOSTADDRESS$ -I $HOSTADDRESS$ command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I $HOSTADDRESS$ command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H www.site2.com -I $HOSTADDRESS$ The services.cfg has the following (will only show 1 for space) define service { service_description check_s1 usegeneric-service hostgroup_name s1_WebServers check_commandcheck_s1!3!5 } Now after the above setup and a restart, I have happy to see the check_s1 listed but it's showing critical with the following error; CRITICAL - Socket timeout after 10 seconds At the command line check it does work; ./check_http -H www.site1.com -I 192.168.50.101 HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds |time=0.089949s;;;0.00 size=14398B;;;0 I am looking through where people are increasing the timeout, but the response time is not even a second, so there must be something else I am missing. So I am looking forward to learning more, welcome any ideas, suggestions, or anything else I should know about using nagios
[Nagios-users] 1st post, so naturally the 1st question by the nagios newb re: socket error
Hi all, well I am running GroundWork open source and really happy with the layout, etc. and now comes the nagios things. My goal is to setup things neatly, organized and learn as I go rather than asking stupid q's along the way. So with that, the 1st task; I have 8 webservers all host numerous sites, a few host only select. So I have created some hostgroups, adding the servers that belong to each, and wonderfully it works. I can select a server, see the group, click the group, see the servers (did I mention I'm good) :) Anyway, I want to setup 2 seperate checks one for each website, then associate them with the groups, so for example; ws1 (site1, site2) ws2 (site1) ws3 (site1, site2) ws4 (site2) Pretty simple. So I copied the check_http command where I saw it used and have the following; command[check_http]=/usr/local/groundwork/nagios/libexec/check_http -H $HOSTADDRESS$ -I $HOSTADDRESS$ command[check_s1]=/usr/local/groundwork/nagios/libexec/check_http -H www.site1.com -I $HOSTADDRESS$ command[check_s2]=/usr/local/groundwork/nagios/libexec/check_http -H www.site2.com -I $HOSTADDRESS$ The services.cfg has the following (will only show 1 for space) define service { service_description check_s1 usegeneric-service hostgroup_name s1_WebServers check_commandcheck_s1!3!5 } Now after the above setup and a restart, I have happy to see the check_s1 listed but it's showing critical with the following error; CRITICAL - Socket timeout after 10 seconds At the command line check it does work; ./check_http -H www.site1.com -I 192.168.50.101 HTTP OK HTTP/1.1 200 OK - 14398 bytes in 0.090 seconds |time=0.089949s;;;0.00 size=14398B;;;0 I am looking through where people are increasing the timeout, but the response time is not even a second, so there must be something else I am missing. So I am looking forward to learning more, welcome any ideas, suggestions, or anything else I should know about using nagios under groundwork. This is running under ubuntu-8.10 server, Program Version:3.0.6, let me know what else I can or need to provide on this. -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios to graph ethernet traffic.. can it ?
ok, I now have (thanks to the group) my server up and currently just watching one box... many more to be added. We also run cacti for ethernet traffic, etc. and wondering can this be done as well. I am looking on the nagios exchange site and don't see anything that can take that type of data, then graph it. If it is possible, what requirements do I need (and should I have already). I still need to install nagios graph and thought I would just start with my 1st simple machine, then add some ethernet traffic in/out to him and once he is checking everything I can simply add other hosts... Thanks as always.. -- Register Now Save for Velocity, the Web Performance Operations Conference from O'Reilly Media. Velocity features a full day of expert-led, hands-on workshops and two days of sessions from industry leaders in dedicated Performance Operations tracks. Use code vel09scf and Save an extra 15% before 5/3. http://p.sf.net/sfu/velocityconf ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_snmp_storage plugin - 126 return code
Not sure what's wrong, seems simple enough. I am using the following as a guide on this plugin (http://nagios.manubulon.com/index_commands.html#storage) From the command line on the nagios server I can get it to work; perl check_snmp_storage.pl -H x.x.x.x -C community -m / -w 90 -c 95 -r /: 67%used(99233MB/147706MB) (90%) : OK I have the following 2 things setup.. in commands.cfg; define command{ command_name check_snmp_storage command_line $USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ -m $ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$ } And the server has the following; define service{ use local-service host_name cs0 service_description Root / partition check_command check_snmp_storage!/!90!95!-r } -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Still some basic 101 items
I'm not sure I should say sorry for the basic questions as this is what the group is for, but seeing some of the in depth questions... talk about an inferiority complex. Well, I now can see how to debug the start (thanks Marc) and figured my one problem. But back to a basic few things. I am looking to have nagios monitor 40 or so servers. 10 or so webservers, a few database, all have ssh, etc. so as I am trying to lay this out in my head, for an example, is this the right thinking. Each box has an unique name in a server folder, /server/servername.cfg (that is what I have started and kind of working). The webservers will have the following; the define host { host_name address use (I think this is my problem) } Now the above, the use I think is the actual hostgroup I wish it to belong to (ex, webserver, linux-server, etc.) I am thinking I will have group temapltes, webserver will have an http test, linux-server can have an ssh check, load, users, etc. Now if the above is making sense, the use will have multiple, ex, use webserver, linux-server(I assume this is possible) That would take care of defining the default services to check for those particular box's. I guess I am looking for both confirmation and suggestions on some high level down setups. I don't want each host having 10 define_service items. I would rather say he is a webserver, and linux box. That by nature will check x, y and z. The problem so far is I have 4 servers, and even though I have an address in each, the use is still the default so ALL the box's details, users, swap usage, etc. is referenceing the localhost and not that individual box. That is where I am stuck, the doc's are really not to helpful, but I am trying to understand more the seperate files. How the use works, when to use a hostgroup, etc. I will gladly provide cfg files or if someone has some basic things (aside from the install templates) or some help on the above it's appreciated. Lr -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] initial setups, hostgroup help requested
still getting things in order, and having a problem with adding my 1st hostgroup. Side note, is there any kind of logging, or debugging to see where a failure occurs? Ex. I add a new config file with the following basic line in the main conf; cfg_file=/usr/local/nagios/etc/hostgroups.cfg The file has the following; define hostgroup{ hostgroup_name webservers alias webservers contact_groups sysadmins members cs0 } The alias I just made up, the contact group sysadmin's exists and the member cs0 is one of my webservers. In the actual server.cfg file for that server I have a line; hostgroups webservers I don't know if that needs to be there as I am defining his as a member in the hostgroups.cfg file. Keeping things like this I start nagios and simply get; Starting nagios:CONFIG ERROR! Start aborted. Check your Nagios configuration. (which is why I am looking as the nagios.log says nothing helpful. Now if I comment out the hostgroup line in the main conf, things start, but I would like to use hostgroups for the obvious reasons. I am sure these and the new ones will be pretty basic as the needs are simple, and if I can debug where it stopping would be even easier, but again, this will be a1 or 2 sentence reply I bet! Tnx -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_local_disk basic question
Still new to it, but so far things are good. I have added a remote host, monitoring load via snmp successfully, but my old alarm keeps tripping due to disk space. df -h shows; Filesystem Size Used Avail Use% Mounted on /fs here 660G 597G 64G 91% / so it seems 91% is in use, nagios is showing (nice and green); DISK OK - free space: / 27867 MB (92% inode=99%): The config for that machine (servername.cfg) shows; check_command check_local_disk!20%!10%!/ So to me that says when 20% left warn me, 10% critical alarm, right? Thanks -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] gui config application
Googling around there are a few out there, mostly ng1, ng2, and I am wondering if there is something nice and clean out there for nagios 3. I don't mind the cli, but sometimes a point/click is just so much easier. New to the list, so if it's been discussed, I apologize. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 1st post to the group, basic q ... I think
ok, I am closer, but still unclear on the check_command and the define service part (I will try to read more 2nt) but I can run in manually as nagios from the cli; ./check_snmp_load.pl -H 1.1.1.1 -C community -w 3,3,2 -c 4,4,3 -T netsl Load : 4.37 4.27 4.20 : 4.37 4 : CRITICAL 4.27 4 : CRITICAL 4.20 3 : CRITICAL so that works. Now in the command.cfg I have the following; define command{ command_name check_snmp_load_v1 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T $ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$ -C community } and the server.cfg I have; define service{ use local-service ; Name of service$ host_name bigbird service_description Current Load via SNMP check_command check_snmp_load_v1!netsl!3,3,2!4,4,3 } I have played numerous times, that is the latest, I am trying to understand where the $ARG variables are defined (and how). I assume the $HOSTADDRESS is just by the server.cfg file, but not sure on $USER7$ as well as push the other $ARG$ one's, becuase as you see from the cli hand entering the info it works! Can't wait till all this basic stuff is behind me! Lee Azzarello wrote: On Mon, Feb 23, 2009 at 11:10 PM, Lance Raymond lraym...@weatherflow.com wrote: I tried a few things after the _v1 but nagios kept failing on restart, so I think I just need to figure out howto push the variables (such as hostIP , community name) to the check_command, right? Since changing the service to use the new command, nagios is now red with the following error;Return code of 255 is out of bounds There are good docs on how to pass arguments to a check command. Here's my pattern to bring new service checks online without mucking about with the running nagios process: 1) Find the plugin on the host running nagios 2) Run the plugin manually as the nagios user with the arguments you would like. Inspect the output. Is it what you expect? 3) If it is what you expect, compose the check command and service definition exactly as your manual run, only use macro substitution as per the documentation 4) Save your configs and check them before loading the new objects. nagios -v nagios.cfg will automate this check and inform you that everything is okay before you "go live". 5) If everything's okay, reload the configuration and check the web interface for the new service check status. -lee -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 1st post to the group, basic q ... I think
Sorry for the quick update, but don't want people to waste time, or they may change based on my update. Well, reading the http://nagios.manubulon.com/index_commands.html#load page more, I am trying to understand a bit more. So my commands.cfg file now looks like this; define command{ command_name check_snmp_load_v1 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ -C community -T netsl -w 3,3,2 -c 4,4,3 -C community } and the server.cfg file looks like this; define service{ use local-service ; Name of service$ host_name bigbird service_description Current Load via SNMP check_command check_snmp_load_v1!netsl!3,3,2!4,4,3 } All seems to be working now, I need to understand the alarm levels a bit (that's easy reading) as well as figure if I even need the !netsl, etc. on the check_command line. But thanks for the push's in the right direction Lee Azzarello wrote: On Mon, Feb 23, 2009 at 11:10 PM, Lance Raymond lraym...@weatherflow.com wrote: I tried a few things after the _v1 but nagios kept failing on restart, so I think I just need to figure out howto push the variables (such as hostIP , community name) to the check_command, right? Since changing the service to use the new command, nagios is now red with the following error;Return code of 255 is out of bounds There are good docs on how to pass arguments to a check command. Here's my pattern to bring new service checks online without mucking about with the running nagios process: 1) Find the plugin on the host running nagios 2) Run the plugin manually as the nagios user with the arguments you would like. Inspect the output. Is it what you expect? 3) If it is what you expect, compose the check command and service definition exactly as your manual run, only use macro substitution as per the documentation 4) Save your configs and check them before loading the new objects. nagios -v nagios.cfg will automate this check and inform you that everything is okay before you "go live". 5) If everything's okay, reload the configuration and check the web interface for the new service check status. -lee -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] 1st post to the group, basic q ... I think
so far the install, etc. went easy there are some good doc's there. I am still understanding the groups, etc. but so far I have the server and 2 remote hosts setup. The load check isn't really to good, so I downloaded the perl one using snmp and from the command line works great, see this example; [r...@ws6 libexec]# perl ./check_snmp_load.pl -H ipadd -C community -w 3,3,2 -c 4,4,3 -T netsl Load : 5.55 5.04 4.87 : 5.55 4 : CRITICAL 5.04 4 : CRITICAL 4.87 3 : CRITICAL I added the following to the commands.cfg file; define command{ command_name check_snmp_load_v1 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T $ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$ } on the target machine I put; define service{ use local-service ; Name of service template to use host_name servername service_description Current Load via SNMP check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4.0 } I restarted nagios, saw the new line in the details, but the values are showing as; OK - load average: 0.10, 0.06, 0.01 I did have a shell opened testing the local script at the same time and the load is still around 5. I will have other questions, etc. as I read/play more, but just wanted to get this part done, then look at thresholds, etc. but if anyone is using this script and notice something obvious, please feel free to let me know. Thanks. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] 1st post to the group, basic q ... I think
ah, after all the install stuff to fall with something like that was just bad! Well your right, but I am still having a problem. I read alot on hostgroups, etc. and starting writing out the map of what and how so to then go back to this, man, I am ready for bed! But here is the scoop / problem. The command file is untouched. There is nothing in the services file as I am only testing on one host, so it's in that servername.cfg file. Once things work I can move it, but that box now has the following; define command{ command_name check_snmp_load_v1 command_line $USER1$/check_snmp_load.pl -H $HOSTADDRESS$ $USER7$ -T $ARG1$ -w $ARG2$ -c $$ } define host{ use linux-server; Name of host template to use ; This host definition will inherit all variables that are defined ; in (or inherited by) the linux-server host template definition. host_name db1 alias db1 address public ip here } define service{ use local-service ; Name of service template to use host_name bigbird service_description Current Load via SNMP check_command check_snmp_load_v1 } I tried a few things after the _v1 but nagios kept failing on restart, so I think I just need to figure out howto push the variables (such as hostIP , community name) to the check_command, right? Since changing the service to use the new command, nagios is now red with the following error; Return code of 255 is out of bounds I'm getting closer, as I said some good reading 2nt, but not for this, so hopefully someone has a quick answer and I promise to have more intelligent questions after ;) -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null