Re: [Nagios-users] distributed nagios ?

2010-12-14 Thread Andreas Ericsson
On 12/13/2010 09:34 PM, stan wrote: I have a Nagios instance that curently monitors about 70 machines. Now I have a addtional network coming on line that will hide behind a firewall using NAT. It seems to me that the way to deal with this is to install Nagios on one of the machines behind the

[Nagios-users] Notfications control and grouping

2010-12-14 Thread Ihab Samara
Hi List We've got a relatively big environment, where Nagios is monitoring about 100 hosts, and checking about 800 services. My question is about notifications. I am looking for a solution that will do this for us: 1) After setting dependency between several services, the notification is sent

[Nagios-users] JVM Monitoring

2010-12-14 Thread Marc-André Doll
Hi list, I have to monitor some JVM and I don't find plugins that fit exactly with what I want/imagine. I could use the check_jmx but I don't really want to install a JRE on my Nagios server. Currently, I'm monitoring Tomcat servers with check_jmx4perl and I'm quite happy with it. Is it

[Nagios-users] Nagiosgraper

2010-12-14 Thread Rikard Dahlberg
Heya! im currently trying to get a decent graphicsystem online to somewhat replace our Cacti. I know nagiosgraph doesn't really do that but it does fit our needs :) Im trying to re-write this config file to create nice shiney graphs for my monitored services, but so far no luck. This is the

[Nagios-users] Cisco Monitoring Problem - Interface Down state not recognize

2010-12-14 Thread Matias Damian
Hi, i have a problem when i want to monitoring a Cisco device (Switch 2960). Qhen i configured Nagios, never show the CRITICAL state, when the port / interface is down, because always receibe a message for SNMP, that it´s Ok. Doing a research, i see this. When i use the terminal of Linux Server

Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6

2010-12-14 Thread Bret Goodfellow
Thanks Hugo. Yes, I have sent an email to the author but haven’t heard back yet. I have also run the script by hand, the shell output is below: [r...@server ~]# cd /usr/lib64/nagios/plugins/ [r...@server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [r...@server plugins]#

Re: [Nagios-users] check_logs.pl doesn't return outpu t on RHEL 6

2010-12-14 Thread Hugo van der Kooij
On Tue, 14 Dec 2010 15:12:24 +, Bret Goodfellow wrote: [r...@server ~]# cd /usr/lib64/nagios/plugins/ [r...@server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [r...@server plugins]# I strongly suspect that this is a perl issue as RHEL 6 is using a newer perl

[Nagios-users] Return code of 127 is out of bounds - only on high cpu load though

2010-12-14 Thread Daniel Wittenberg
I noticed something odd the other day while stressing my servers. I noticed that when I overload it with too many hosts/checks, that I start getting active check failures with the standard 127 code. But, if I slowly reduce the number of hosts/checks, I’ll get to a point where it starts

Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though

2010-12-14 Thread Andreas Ericsson
On 12/14/2010 05:08 PM, Daniel Wittenberg wrote: I noticed something odd the other day while stressing my servers. I noticed that when I overload it with too many hosts/checks, that I start getting active check failures with the standard 127 code. But, if I slowly reduce the number of

Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though

2010-12-14 Thread Daniel Wittenberg
Yeah, the only two I'm testing with are check_nrpe and check_tcp, and it's all of them on every server that start failing. Any idea what kind of shared resources it might be starving? Dan -Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Tuesday, December 14,

Re: [Nagios-users] JVM Monitoring

2010-12-14 Thread Jim Avery
On 14 December 2010 14:11, Marc-André Doll m...@b-care.net wrote: Hi list, I have to monitor some JVM and I don't find plugins that fit exactly with what I want/imagine. I could use the check_jmx but I don't really want to install a JRE on my Nagios server. Currently, I'm monitoring

Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though

2010-12-14 Thread Andreas Ericsson
On 12/14/2010 05:14 PM, Daniel Wittenberg wrote: Yeah, the only two I'm testing with are check_nrpe and check_tcp, and it's all of them on every server that start failing. Any idea what kind of shared resources it might be starving? Not those two, no. They should be fairly well behaved,

Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though

2010-12-14 Thread Daniel Wittenberg
I ran a full strace of nagios daemon and children and it looks like it was the enable_environment_macros that was causing: [pid 20478] execve(/bin/sh, [sh, -c, . . . . . ] = -1 E2BIG (Argument list too long) 0.000337 [pid 20478] exit_group(127) = ? I turned them off and that fixes

Re: [Nagios-users] distributed nagios ?

2010-12-14 Thread Frost, Mark {PBC}
-Original Message- From: Andreas Ericsson [mailto:a...@op5.se] Sent: Tuesday, December 14, 2010 4:49 AM To: nagios List; doc...@yahoo.com Subject: Re: [Nagios-users] distributed nagios ? Any pointers to docs on how to set it up?

Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6

2010-12-14 Thread Bret Goodfellow
Yes, my servers are x64. The perl release on RHEL4 is v5.8.5, and the perl release on RHEL5 is v5.8.8, and the perl release on RHEL6 is v5.10.1. From: Hugo van der Kooij [mailto:hvdko...@vanderkooij.org] Sent: Tuesday, December 14, 2010 8:47 AM To: Nagios Users List Subject: Re: [Nagios-users]