[Nagios-users] Problem connecting to nsclient++
Hi, I have an odd problem connecting to a Windows host running nsclient++ We have a server, running Nagios 3.2.0 under RedHat Fedora 12, which connects over a VPN to another network. We are already monitoring other devices over the VPN with no problem (phone systems, wireless access points etc). I have to add a Windows box, running windows 7, to the monitoring. I have installed nsclient++ on it using a configuration that works with other Windows machines. However, the nagios server is unable to connect to the nsclient - the connection times out. Trying to telnet in on the nsclient port also gives a timeout. To make things worse, I can connect (via telnet) from other machines on the nagios server network, so I know that nsclient is running correctly. I am also able to connect from other networks without any issue. Running nsclient++ -test I can see all other machines connecting in, and giving results as expected. However, when I try connect from the main nagios server there is nothing, not even an unauthorized access message (yes, I've checked that as well). The server can ping the windows machine OK, and the windows firewall is turned off. The windows machine can also ping back to the nagios server, so there is a link there. Any thoughts/ideas would be appreciated. Thanks Pete Dewell -- Pete Dewell | Stuff Done p...@stuff-done.co.uk ** The information contained in this message, including any attachment, is confidential and may be privileged or otherwise protected from disclosure. The information is intended only for the person or entity to which it is addressed. If you are not the intended recipient, please contact the sender and delete this message from your system. Any review, re-transmission, distribution or other use of this information by persons or entities other than the intended recipient is prohibited. * -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] [Nagios-devel] RFC/RFP Nagios command workers
On 06/28/2011 05:13 PM, Matthieu Kermagoret wrote: Hi list, First of all, sorry for the delayed response, last month was pretty crazy at work :-p On Mon, May 23, 2011 at 12:38 PM, Andreas Ericssona...@op5.se wrote: On 05/23/2011 11:37 AM, Matthieu Kermagoret wrote: Because shipping an official module that does it would mean not only supporting the old complexity, but also the new one. Having a single default system for running checks would definitely be preferrable to supporting multiple ones. I agree with you when you say that a single system is better than two. However I fear that the worker system would need very more code than a simpler system (and less code usually means less bugs) and that the worker system would destabilize Nagios. Quite the opposite, really. The amount of backflips we're doing right now to make sure the core is threadsafe is huge, so it's likely this patch will even reduce the LoC count in Nagios. For years it's been Nagios' development team's policy not to include features that could be written as modules. I liked it that way. Everything can be written as modules. The worker process thing will have the nice sideeffect that modules can register sockets that core Nagios will listen to events from, with a special callback when there's data available on the socket. This reduces complexity of a lot of modules by a fair bit. With worker-processes instead of multiple threads it's also trivial to write modules with regards to thread-safety, and potential leaks in worker modules (such as embedded perl) can be ignored, since we can just kill the worker process and spawn a new one once it's done some arbitrary number of checks. This is how Apache handles leaky modules and we could do far worse than using the world's most popular webserver as an example. There's also another thing. Mozilla Firefox has been accused of feature stagnation in the core since they let addon writers handle adding new features, and far from everybody uses modules. Google Chrome has taken a fair share of users from Firefox lately, partly because it implements some of the more popular modules directly in-core. Nagios has also been accused of feature stagnation, even though broker module development has flourished in recent years (nagios with modules is nothing like the old nagios without them), so it makes sense to add certain selected module capabilities to the core. 1) Remove the multiple fork system to execute a command. The Nagios Core process forks directly the process that will exec the command (more or less sh's parsing of command line, don't really know if this could/should be integreted in the Core). This really can't be done without using multiple threads since the core can't wait() for input and children while at the same time issuing select() calls to multiplex the new output of currently running checks. What about a signal handler on SIGCHLD that would wait() terminated process and a select() on pipe FDs connected to child processes, with a timeout to kill non-responding checks ? Highly impractical for shortlived children and with so many pipes to listen to. It would mean we'd be iterating over the entire childstack several hundred times per second just to read new output. We're forced to do that, since pipes can't contain an infinite amount of data. The child's write() call will fail when the pipe is full and the children won't exit while waiting to write. Doing so many select() calls means the scheduler will suffer greatly, along with modules that wish to run code in the main thread every now and then. With sockets, we can let each worker handle a smaller number of checks at the time, and since they have no scheduling responsibilities the master process is free to just await new input. 2) The root process and the subprocess are connected with a pipe() so that the command output can be fetched by reading the pipe. Nagios will maintain a list of currently running commands. Pipes are limited in that they only guarantee 512 bytes of atomic writes and reads. TCP sockets don't have this problem. There's also It is my understanding of Posix that the core standard defines a 512-byte minimal limit for atomic I/O operations but I cannot find any section enforcing atomicity on I/O operations on TCP sockets, so pipes would be better indeed. Were you refering to the XSI Streams or could you point me to the appropriate section ? No. TCP sockets don't enforce atomicity beyond the 512 bytes already specified, but they do enforce ordering, which pipes don't. This is actually a real problem (although an unusual one) when several processes tries to write data to Nagios' command pipe and one of them writes more than the atomic limit on whatever system it's being written on. The fact that pipes use fixed-size buffers for pipes (requiring a full kernel recompile to change) and the fact that a program can change the size of its socket buffers with a simple
[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
Ok, Here is what I got so far: 1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 -n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 80,80,200,200,20,20 -d 5 , I get the following output: GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 'GigabitEthernet0/24_out_bps'=26237;6;8;0;10 Before getting a correct output I run the command several times. The initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) :(1 UP): UNKNOWN 2. Defined the following command in nagios: define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$ } Where And the corresponding service check: define service{ use generic-service,pnpgraphed-service ; Inherit values from a template host_name test_sw service_description Port 24 Bandwidth Usage check_command check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10 normal_check_interval 2 ; Check the service every 2 units under normal conditions retry_check_interval1 ; Re-check the service every 1 unit until its final/hard state is determined } In nagios service status I get now: ERROR: Description table : No response from remote host 192.168.2.1. In resources I have defined $USER5$=Public Ideas, please? Thank you! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 05:14 PM Betreff:RE: MRTG graphs problem - measurement units not taken into consideration I have tried the SNMP plugin a few minutes ago and here is what I get when running it from the command line: ./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 200,400 -c 0,600 GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN The interface name I took it from listing all available interfaces on the switch. Any ideas, please? Thanks! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 03:31 PM Betreff:MRTG graphs problem - measurement units not taken into consideration Hello, I am using Nagios to monitor the bandwidth usage of a couple of switches/routers via the check_local_mrtgtraf command. The graphs I handle via pnp4nagios. The problem I encounter is the following: if the monitored bandwidth is at one point expressed in KB/s and then I bring a lot of traffic through the router the, unit is changed into MB/s. The graphed values seem not to take into consideration the change of the measurement unit meaning that the 50KB/s seems to be at the same height as the 50MB/s value in the graph. Can anyone please help? Thank you! -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
Was a typo error of the community string, after putting public instead of Public everything looks cool. Still 1 question: If I try to query interface GigabitEthernet0/2 I get the response: GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable data on file (1 rows) : 3 int NOK : CRITICAL So ti seems that it takes all interfaces starting with 0/2 string. Any way to limit this? Do I have to enter any escape character? Thank you! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/29/2011 01:29 PM Betreff:Antwort: RE: MRTG graphs problem - measurement units not taken into consideration Ok, Here is what I got so far: 1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 -n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 80,80,200,200,20,20 -d 5 , I get the following output: GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 'GigabitEthernet0/24_out_bps'=26237;6;8;0;10 Before getting a correct output I run the command several times. The initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) :(1 UP): UNKNOWN 2. Defined the following command in nagios: define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$ } Where And the corresponding service check: define service{ use generic-service,pnpgraphed-service ; Inherit values from a template host_name test_sw service_description Port 24 Bandwidth Usage check_command check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10 normal_check_interval 2 ; Check the service every 2 units under normal conditions retry_check_interval1 ; Re-check the service every 1 unit until its final/hard state is determined } In nagios service status I get now: ERROR: Description table : No response from remote host 192.168.2.1. In resources I have defined $USER5$=Public Ideas, please? Thank you! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 05:14 PM Betreff:RE: MRTG graphs problem - measurement units not taken into consideration I have tried the SNMP plugin a few minutes ago and here is what I get when running it from the command line: ./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 200,400 -c 0,600 GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN The interface name I took it from listing all available interfaces on the switch. Any ideas, please? Thanks! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 03:31 PM Betreff:MRTG graphs problem - measurement units not taken into consideration Hello, I am using Nagios to monitor the bandwidth usage of a couple of switches/routers via the check_local_mrtgtraf command. The graphs I handle via pnp4nagios. The problem I encounter is the following: if the monitored bandwidth is at one point expressed in KB/s and then I bring a lot of traffic through the router the, unit is changed into MB/s. The graphed values seem not to take into consideration the change of the measurement unit meaning that the 50KB/s seems to be at the same height as the 50MB/s value in the graph. Can anyone please help? Thank you! -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to use check_esx3 Blacklists
On 29 June 2011 09:52, Mies, Christian christian.m...@it-novum.com wrote: Hi List, I try to monitor my Datastores by check_esx3 by Datacenter. I want to exclude each local_* Datastore. The following command is not working .. ./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_* What am I doing wrong? At first glance, my guess is your shell is interpreting the * as a file wildcard. You probably need to put quotes around it like so: ./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_* Alternatively, backslash-escape it like so: ./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_\* It's not clear whether check_esx3.pl supports wildcard notation though. Without actually looking at the code I would guess not, or if it does it might use regular expressions in which case the syntax would be: ./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_.* -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
On 29 June 2011 11:45, radu.p...@technomatic.de wrote: Was a typo error of the community string, after putting public instead of Public everything looks cool. Still 1 question: If I try to query interface GigabitEthernet0/2 I get the response: GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable data on file (1 rows) : 3 int NOK : CRITICAL So ti seems that it takes all interfaces starting with 0/2 string. Any way to limit this? Do I have to enter any escape character? Thank you! I think you need the -r option, which means Do not use regexp to match NAME in description OID. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Printer error
Hi i have configured nagios for monitoring Printers but i'm getting the error as return code of 127 is out of bounds - plugin may be missing Please help me regarding this issue. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Printer error
Hi all, Usually we get this problem after a nagios reload and not a restart. So just stop and start nagios? Yours faithfully, Wilfred de Bondt -- Systems Administrator Stachanov Group: phone: (+31)205091010 For more information, see website web: www.stachanov.com/ www.issuu.com/stachanov http://www.issuu.com/stachanov From: nag ios [mailto:nagiost...@gmail.com] Sent: woensdag 29 juni 2011 13:31 To: Nagios Users List Subject: [Nagios-users] Printer error Hi i have configured nagios for monitoring Printers but i'm getting the error as return code of 127 is out of bounds - plugin may be missing Please help me regarding this issue. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
After adding the -r option it works like a charm! Now in pnp4nagios I can see both values on the same graph: bytes in and bytes out. I see them as two thin lines. Any way to make the lines thicker and change their colour? Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/29/2011 01:45 PM Betreff:Antwort: RE: MRTG graphs problem - measurement units not taken into consideration Was a typo error of the community string, after putting public instead of Public everything looks cool. Still 1 question: If I try to query interface GigabitEthernet0/2 I get the response: GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable data on file (1 rows) : 3 int NOK : CRITICAL So ti seems that it takes all interfaces starting with 0/2 string. Any way to limit this? Do I have to enter any escape character? Thank you! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/29/2011 01:29 PM Betreff:Antwort: RE: MRTG graphs problem - measurement units not taken into consideration Ok, Here is what I got so far: 1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 -n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 80,80,200,200,20,20 -d 5 , I get the following output: GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 'GigabitEthernet0/24_out_bps'=26237;6;8;0;10 Before getting a correct output I run the command several times. The initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) :(1 UP): UNKNOWN 2. Defined the following command in nagios: define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$ } Where And the corresponding service check: define service{ use generic-service,pnpgraphed-service ; Inherit values from a template host_name test_sw service_description Port 24 Bandwidth Usage check_command check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10 normal_check_interval 2 ; Check the service every 2 units under normal conditions retry_check_interval1 ; Re-check the service every 1 unit until its final/hard state is determined } In nagios service status I get now: ERROR: Description table : No response from remote host 192.168.2.1. In resources I have defined $USER5$=Public Ideas, please? Thank you! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 05:14 PM Betreff:RE: MRTG graphs problem - measurement units not taken into consideration I have tried the SNMP plugin a few minutes ago and here is what I get when running it from the command line: ./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 200,400 -c 0,600 GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN The interface name I took it from listing all available interfaces on the switch. Any ideas, please? Thanks! Von:Radu Popa/Technomatic An: nagios-users@lists.sourceforge.net Datum: 06/28/2011 03:31 PM Betreff:MRTG graphs problem - measurement units not taken into consideration Hello, I am using Nagios to monitor the bandwidth usage of a couple of switches/routers via the check_local_mrtgtraf command. The graphs I handle via pnp4nagios. The problem I encounter is the following: if the monitored bandwidth is at one point expressed in KB/s and then I bring a lot of traffic through the router the, unit is changed into MB/s. The graphed values seem not to take into consideration the change of the measurement unit meaning that the 50KB/s seems to be at the same height as the 50MB/s value in the graph. Can anyone please help? Thank you! -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] A question on status.cgi and resources
I am working on a pretty large distributed Nagios core deployment and have a question regarding resources. It appears that almost every morning @ about 7:20 am, my cpu usage goes from about 4% to about 25-40% . It will do this for a few hours and then go back to 4%. It appears to me that the issue is with wwwrun running status.cgi. I was wondering what would be going on during the morning at about the same time every day as opposed to 24hrs / day. I was looking for some kind of maintenance it could be doing or something but am not finding anything. I know there are no consoles open, 99.5 % of the checks the core server is processing are remote checks. I know my environment will require multiple core servers ( 700 remote sites, 15 clients / site , 20 +/- checks per client ) but have been working all along to tune the servers for the best efficiency. Are there any tuning parameters for status.cgi?? Thanks, Steve -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
radu.p...@technomatic.de schrieb: After adding the -r option it works like a charm! Now in pnp4nagios I can see both values on the same graph: bytes in and bytes out. I see them as two thin lines. Any way to make the lines thicker and change their colour? it depends on the pnp4nagios template you use to create this kind of graphs. I assume its templates.dist/check_snmp_int.php http://docs.pnp4nagios.org/pnp-0.6/tpl You can change the Template to fit your needs. Every valid rrdtool option is allowed. http://oss.oetiker.ch/rrdtool/doc/rrdgraph_graph.en.html Joerg -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration
On 29 June 2011 13:23, radu.p...@technomatic.de wrote: After adding the -r option it works like a charm! Now in pnp4nagios I can see both values on the same graph: bytes in and bytes out. I see them as two thin lines. Any way to make the lines thicker and change their colour? Well done. Yes, you can create a custom template. Take a look under /usr/local/pnp4nagios/share/templates.dist and find the file named check_snmp_int.php . Copy it to the directory /usr/local/pnp4nagios/share/templates and then edit it to get the wider line. For example the line which reads: $def[1] .= LINE1:var1#003300:\in \ ; Change it to read: $def[1] .= LINE3:var1#003300:\in \ ; You will find documentation on how the LINE instruction works at: http://oss.oetiker.ch/rrdtool/doc/rrdgraph_graph.en.html and some general help on PNP templates at: http://docs.pnp4nagios.org/pnp-0.6/tpl -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Printer error
I wouldn't just restart Nagios and hope the problem goes away but try and find out why / where the problem lies. Have you checked by running the plugin itself as the nagios user to ensure it works properly from the command line ? Does the plugin exist in the location that you've configured Nagios to look for it ? Regards, Deborah From: Wilfred de Bondt [mailto:wilf...@stachanov.com] Sent: 29 June 2011 12:36 To: Nagios Users List Subject: Re: [Nagios-users] Printer error Hi all, Usually we get this problem after a nagios reload and not a restart. So just stop and start nagios? Yours faithfully, Wilfred de Bondt -- Systems Administrator Stachanov Group: phone: (+31)205091010 For more information, see website web: www.stachanov.com/http://www.stachanov.com/ www.issuu.com/stachanovhttp://www.issuu.com/stachanov From: nag ios [mailto:nagiost...@gmail.com] Sent: woensdag 29 juni 2011 13:31 To: Nagios Users List Subject: [Nagios-users] Printer error Hi i have configured nagios for monitoring Printers but i'm getting the error as return code of 127 is out of bounds - plugin may be missing Please help me regarding this issue. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Getting re-notified while in a HARD state
I have a few existing and self-developed plugins that output details of the HARD state: CRITICAL: critical 1, warning 1 Detail 1 Detail 2 What I'd like to do is to be able to be re-notified if, while in the HARD state, the number and/or details change. For example, if the above would go to: CRITICAL: critical 2, warning 1 Detail 1 Detail 2 Detail 3 Anyone have an approach that works? The documentation doesn't indicate it's possible, but I'm sure others have encountered this before and perhaps they've worked through a solution. Kind regards, Frank -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Getting re-notified while in a HARD state
Frank Bulk wrote: I have a few existing and self-developed plugins that output details of the HARD state: CRITICAL: critical 1, warning 1 Detail 1 Detail 2 What I'd like to do is to be able to be re-notified if, while in the HARD state, the number and/or details change. For example, if the above would go to: CRITICAL: critical 2, warning 1 Detail 1 Detail 2 Detail 3 Anyone have an approach that works? The documentation doesn't indicate it's possible, but I'm sure others have encountered this before and perhaps they've worked through a solution. Kind regards, Frank I don't think there's a simple way to do this without having your notification command store the value of the $SERVICEOUTPUT$ macro for the host + service for comparison on the next try. Then you would have to set is_volatile on the service and have the notification command suppress the notification if the $SERVICEOUTPUT$ doesn't change. Another thing you can do is tell Nagios to log the hard state status when only the $SERVICEOUTPUT$ changes by setting the stalking_options in the service. Then, if you have something that watching the log file you can trigger notifications with that. If only this state stalking feature would have an option to send notifications in addition to logging you would be set. Hope this helps, Paul Dubuc -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Expanding Custom Variables
I am trying to monitor multiple volumes on a NetApp system. The format of the command requires a hostname:volumename format. I want to reduce my commands/service definitions to a minimum. My initial thought was to have a generic service definition, that gets more specific with a sub definition. When the command is run, it seems like it is not passing the custom variable, but rather leaving a single $ behind where the variable ought to be. Any ideas on 1) fixing my syntax? 2) Doing this somehow easier? NAF UNKNOWN - No Snapmirror with source or destination myhostname:$ found! define service{ name NA_SnapMirror check_command netapp_snapmirror use GenericService_Core normal_check_interval 1000 max_check_attempts 300 register 0 contact_groups CoreServers } define service { use NA_SnapMirror _navolume myvolumename service_description SnapMirror_groups hosts myhostname } define command{ command_name netapp_snapmirror command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$ snapmirror,$HOSTNAME$:$_SERVICEnavolume$,$USER25$ } For more information about Lewis and Roca LLP, please go to www.lewisandroca.comhttp://www.lewisandroca.com/. Phoenix (602)262-5311 Minden (775)586-9500 Tucson (520)622-2090Albuquerque (505)764-5400 Las Vegas (702)949-8200 Silicon Valley (650)391-1380 Reno (775)823-2900 This message is intended only for the use of the individual or entity to which it is addressed. If the reader of this message is not the intended recipient, or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this message is strictly prohibited. If you have received this communication in error, please notify us immediately by replying to the sender of this E-Mail by return E-Mail or by telephone. In accordance with Internal Revenue Service Circular 230, we advise you that if this email contains any tax advice, such tax advice was not intended or written to be used, and it cannot be used, by any taxpayer for the purpose of avoiding penalties that may be imposed on the taxpayer. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Expanding Custom Variables
Ok - I solved it. It was an Upper/Lower case thing. my variable IN USE needed to be $_SERVICENAVOLUME$ instead of $_SERVICEnavolume$ because it automatically converts the variable name to uppercase. From: Stringham, Steven [mailto:sstri...@lrlaw.com] Sent: Wednesday, June 29, 2011 12:19 PM To: 'Nagios Users List' Subject: [Nagios-users] Expanding Custom Variables I am trying to monitor multiple volumes on a NetApp system. The format of the command requires a hostname:volumename format. I want to reduce my commands/service definitions to a minimum. My initial thought was to have a generic service definition, that gets more specific with a sub definition. When the command is run, it seems like it is not passing the custom variable, but rather leaving a single $ behind where the variable ought to be. Any ideas on 1) fixing my syntax? 2) Doing this somehow easier? NAF UNKNOWN - No Snapmirror with source or destination myhostname:$ found! define service{ name NA_SnapMirror check_command netapp_snapmirror use GenericService_Core normal_check_interval 1000 max_check_attempts 300 register 0 contact_groups CoreServers } define service { use NA_SnapMirror _navolume myvolumename service_description SnapMirror_groups hosts myhostname } define command{ command_name netapp_snapmirror command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$ snapmirror,$HOSTNAME$:$_SERVICEnavolume$,$USER25$ } For more information about Lewis and Roca LLP, please go to www.lewisandroca.comhttp://www.lewisandroca.com/. Phoenix (602)262-5311 Minden (775)586-9500 Tucson (520)622-2090Albuquerque (505)764-5400 Las Vegas (702)949-8200 Silicon Valley (650)391-1380 Reno (775)823-2900 This message is intended only for the use of the individual or entity to which it is addressed. If the reader of this message is not the intended recipient, or the employee or agent responsible for delivering the message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this message is strictly prohibited. If you have received this communication in error, please notify us immediately by replying to the sender of this E-Mail by return E-Mail or by telephone. In accordance with Internal Revenue Service Circular 230, we advise you that if this email contains any tax advice, such tax advice was not intended or written to be used, and it cannot be used, by any taxpayer for the purpose of avoiding penalties that may be imposed on the taxpayer. -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Expanding Custom Variables
On 6/29/11 12:18 PM, Stringham, Steven wrote: I am trying to monitor multiple volumes on a NetApp system. The format of the command requires a hostname:volumename format. I want to reduce my commands/service definitions to a minimum. My initial thought was to have a generic service definition, that gets more specific with a sub definition. When the command is run, it seems like it is not passing the custom variable, but rather leaving a single $ behind where the variable ought to be. I'm not sure that custom macros are evaluated at the command level? Perhaps set your command_line to pull in the variable from the service: define service { name NA_SnapMirror check_command netapp_snapmirror!$_SERVICEnavolume$ use GenericService_Core normal_check_interval 1000 max_check_attempts 300 register 0 contact_groups CoreServers } define service { use NA_SnapMirror _navolume myvolumename service_description SnapMirror_groups hosts myhostname } define command { command_name netapp_snapmirror command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$ snapmirror,$HOSTNAME$:$ARG1$,$USER25$ } ... Alternately, if you have enable_environment_macros=1 in nagios.cfg, you could instead put $NAGIOS__SERVICEnavolume and pass the reference to the script. One of the two should work for you. If not, then I'd recommend restarting in debug mode, debug_level=18 will get you debug information about both the configuration load process, and the service check execution, so you should be able to figure out the problem - just fire it up in a reduced config set, so you only have this in there and don't get spammed by normal operations. What version, btw? -- Mike Lindsey -- Mike Lindsey -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null