[Nagios-users] Problem connecting to nsclient++

2011-06-29 Thread Pete Dewell
Hi,

I have an odd problem connecting to a Windows host running nsclient++

We have a server, running Nagios 3.2.0 under RedHat Fedora 12, which 
connects over a VPN to another network. We are already monitoring other 
devices over the VPN with no problem (phone systems, wireless access 
points etc).

I have to add a Windows box, running windows 7, to the monitoring. I 
have installed nsclient++ on it using a configuration that works with 
other Windows machines. However, the nagios server is unable to connect 
to the nsclient - the connection times out. Trying to telnet in on the 
nsclient port also gives a timeout.

To make things worse, I can connect (via telnet) from other machines on 
the nagios server network, so I know that nsclient is running correctly.
I am also able to connect from other networks without any issue.

Running nsclient++ -test I can see all other machines connecting in, and 
giving results as expected. However, when I try  connect from the main 
nagios server there is nothing, not even an unauthorized access 
message (yes, I've checked that as well).

The server can ping the windows machine OK, and the windows firewall is 
turned off. The windows machine can also ping back to the nagios server, 
so there is a link there.

Any thoughts/ideas would be appreciated.

Thanks

Pete Dewell

-- 

Pete Dewell | Stuff Done
p...@stuff-done.co.uk


** The information contained in this message, including any
attachment, is confidential and may be privileged or otherwise
protected from disclosure. The information is intended only for the
person or entity to which it is addressed. If you are not the intended
recipient, please contact the sender and delete this message from your
system. Any review, re-transmission, distribution or other use of this
information by persons or entities other than the intended recipient
is prohibited. *

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] [Nagios-devel] RFC/RFP Nagios command workers

2011-06-29 Thread Andreas Ericsson
On 06/28/2011 05:13 PM, Matthieu Kermagoret wrote:
 Hi list,
 
 First of all, sorry for the delayed response, last month was pretty
 crazy at work :-p
 
 On Mon, May 23, 2011 at 12:38 PM, Andreas Ericssona...@op5.se  wrote:
 On 05/23/2011 11:37 AM, Matthieu Kermagoret wrote:
 Because shipping an official module that does it would mean not only
 supporting the old complexity, but also the new one. Having a single
 default system for running checks would definitely be preferrable to
 supporting multiple ones.

 
 I agree with you when you say that a single system is better than two.
 However I fear that the worker system would need very more code than a
 simpler system (and less code usually means less bugs) and that the
 worker system would destabilize Nagios.

Quite the opposite, really. The amount of backflips we're doing right
now to make sure the core is threadsafe is huge, so it's likely this
patch will even reduce the LoC count in Nagios.

 For years it's been Nagios'
 development team's policy not to include features that could be
 written as modules. I liked it that way.
 

Everything can be written as modules. The worker process thing will have
the nice sideeffect that modules can register sockets that core Nagios
will listen to events from, with a special callback when there's data
available on the socket. This reduces complexity of a lot of modules by
a fair bit. With worker-processes instead of multiple threads it's also
trivial to write modules with regards to thread-safety, and potential
leaks in worker modules (such as embedded perl) can be ignored, since
we can just kill the worker process and spawn a new one once it's done
some arbitrary number of checks. This is how Apache handles leaky
modules and we could do far worse than using the world's most popular
webserver as an example.

There's also another thing. Mozilla Firefox has been accused of feature
stagnation in the core since they let addon writers handle adding new
features, and far from everybody uses modules. Google Chrome has taken
a fair share of users from Firefox lately, partly because it implements
some of the more popular modules directly in-core. Nagios has also been
accused of feature stagnation, even though broker module development
has flourished in recent years (nagios with modules is nothing like the
old nagios without them), so it makes sense to add certain selected
module capabilities to the core.

 1) Remove the multiple fork system to execute a command. The Nagios
 Core process forks directly the process that will exec the command
 (more or less sh's parsing of command line, don't really know if this
 could/should be integreted in the Core).


 This really can't be done without using multiple threads since the
 core can't wait() for input and children while at the same time
 issuing select() calls to multiplex the new output of currently
 running checks.

 
 What about a signal handler on SIGCHLD that would wait() terminated
 process and a select() on pipe FDs connected to child processes, with
 a timeout to kill non-responding checks ?
 

Highly impractical for shortlived children and with so many pipes to
listen to. It would mean we'd be iterating over the entire childstack
several hundred times per second just to read new output. We're forced
to do that, since pipes can't contain an infinite amount of data. The
child's write() call will fail when the pipe is full and the children
won't exit while waiting to write. Doing so many select() calls means
the scheduler will suffer greatly, along with modules that wish to run
code in the main thread every now and then.

With sockets, we can let each worker handle a smaller number of checks
at the time, and since they have no scheduling responsibilities the
master process is free to just await new input.

 2) The root process and the subprocess are connected with a pipe() so
 that the command output can be fetched by reading the pipe. Nagios
 will maintain a list of currently running commands.


 Pipes are limited in that they only guarantee 512 bytes of atomic
 writes and reads. TCP sockets don't have this problem. There's also
 
 It is my understanding of Posix that the core standard defines a
 512-byte minimal limit for atomic I/O operations but I cannot find any
 section enforcing atomicity on I/O operations on TCP sockets, so pipes
 would be better indeed. Were you refering to the XSI Streams or could
 you point me to the appropriate section ?
 

No. TCP sockets don't enforce atomicity beyond the 512 bytes already
specified, but they do enforce ordering, which pipes don't. This is
actually a real problem (although an unusual one) when several processes
tries to write data to Nagios' command pipe and one of them writes
more than the atomic limit on whatever system it's being written on.
The fact that pipes use fixed-size buffers for pipes (requiring a full
kernel recompile to change) and the fact that a program can change the
size of its socket buffers with a simple 

[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Radu . Popa
Ok,

Here is what I got so far:

1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 
-n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 
80,80,200,200,20,20 -d 5 , I get the following output:
GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 
'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 
'GigabitEthernet0/24_out_bps'=26237;6;8;0;10

Before getting a correct output I run the command several times. The 
initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) 
:(1 UP): UNKNOWN

2. Defined the following command in nagios:

define command{
command_namecheck_snmp_int
command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C 
$USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$
}

Where 

And the corresponding service check:

define service{
use generic-service,pnpgraphed-service  ; 
Inherit values from a template
host_name   test_sw
service_description Port 24 Bandwidth Usage
check_command 
check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10
normal_check_interval   2   ; Check the service every 
2 units under normal conditions
retry_check_interval1   ; Re-check the service 
every 1 unit until its final/hard state is determined
}

In nagios service status I get now: ERROR: Description table : No response 
from remote host 192.168.2.1. 

In resources I have defined $USER5$=Public

Ideas, please?

Thank you!


Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 05:14 PM
Betreff:RE: MRTG graphs problem - measurement units not taken into 
consideration


I have tried the SNMP plugin a few minutes ago and here is what I get when 
running it from the command line:

./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 
200,400 -c 0,600
GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN

The interface name I took it from listing all available interfaces on the 
switch.

Any ideas, please?

Thanks!




Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 03:31 PM
Betreff:MRTG graphs problem - measurement units not taken into 
consideration


Hello,

I am using Nagios to monitor the bandwidth usage of a couple of 
switches/routers via the check_local_mrtgtraf command. The graphs I handle 
via pnp4nagios. The problem I encounter is the following: if the monitored 
bandwidth is at one point expressed in KB/s and then I bring a lot of 
traffic through the router the, unit is changed into MB/s. The graphed 
values seem not to take into consideration the change of the measurement 
unit meaning that the 50KB/s seems to be at the same height as the 50MB/s 
value in the graph.

Can anyone please help?

Thank you!
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Radu . Popa
Was a typo error of the community string, after putting public instead 
of Public everything looks cool.

Still 1 question:

If I try to query interface GigabitEthernet0/2 I get the response:

GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP 
No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on 
file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable 
data on file (1 rows) : 3 int NOK : CRITICAL

So ti seems that it takes all interfaces starting with 0/2 string. Any 
way to limit this? Do I have to enter any escape character?

Thank you!



Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/29/2011 01:29 PM
Betreff:Antwort: RE: MRTG graphs problem - measurement units not 
taken into consideration


Ok,

Here is what I got so far:

1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 
-n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 
80,80,200,200,20,20 -d 5 , I get the following output:
GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 
'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 
'GigabitEthernet0/24_out_bps'=26237;6;8;0;10

Before getting a correct output I run the command several times. The 
initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) 
:(1 UP): UNKNOWN

2. Defined the following command in nagios:

define command{
command_namecheck_snmp_int
command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C 
$USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$
}

Where 

And the corresponding service check:

define service{
use generic-service,pnpgraphed-service  ; 
Inherit values from a template
host_name   test_sw
service_description Port 24 Bandwidth Usage
check_command 
check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10
normal_check_interval   2   ; Check the service every 
2 units under normal conditions
retry_check_interval1   ; Re-check the service 
every 1 unit until its final/hard state is determined
}

In nagios service status I get now: ERROR: Description table : No response 
from remote host 192.168.2.1. 

In resources I have defined $USER5$=Public

Ideas, please?

Thank you!



Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 05:14 PM
Betreff:RE: MRTG graphs problem - measurement units not taken into 
consideration


I have tried the SNMP plugin a few minutes ago and here is what I get when 
running it from the command line:

./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 
200,400 -c 0,600
GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN

The interface name I took it from listing all available interfaces on the 
switch.

Any ideas, please?

Thanks!




Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 03:31 PM
Betreff:MRTG graphs problem - measurement units not taken into 
consideration


Hello,

I am using Nagios to monitor the bandwidth usage of a couple of 
switches/routers via the check_local_mrtgtraf command. The graphs I handle 
via pnp4nagios. The problem I encounter is the following: if the monitored 
bandwidth is at one point expressed in KB/s and then I bring a lot of 
traffic through the router the, unit is changed into MB/s. The graphed 
values seem not to take into consideration the change of the measurement 
unit meaning that the 50KB/s seems to be at the same height as the 50MB/s 
value in the graph.

Can anyone please help?

Thank you!
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] How to use check_esx3 Blacklists

2011-06-29 Thread Jim Avery
On 29 June 2011 09:52, Mies, Christian christian.m...@it-novum.com wrote:

 Hi List,

 I try to monitor my Datastores by check_esx3 by Datacenter. I want to exclude 
 each local_* Datastore. The following command is not working ..



 ./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_*



 What am I doing wrong?


At first glance, my guess is your shell is interpreting the * as a
file wildcard.  You probably need to put quotes around it like so:

./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_*

Alternatively, backslash-escape it like so:

./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_\*

It's not clear whether check_esx3.pl supports wildcard notation
though.  Without actually looking at the code I would guess not, or if
it does it might use regular expressions in which case the syntax
would be:

./check_esx3.pl -D 10.1.82.34 -u user -p pw -l vmfs -x local_.*

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Jim Avery
On 29 June 2011 11:45,  radu.p...@technomatic.de wrote:
 Was a typo error of the community string, after putting public instead of
 Public everything looks cool.

 Still 1 question:

 If I try to query interface GigabitEthernet0/2 I get the response:

 GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP
 No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on
 file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable
 data on file (1 rows) : 3 int NOK : CRITICAL

 So ti seems that it takes all interfaces starting with 0/2 string. Any way
 to limit this? Do I have to enter any escape character?

 Thank you!


I think you need the -r option, which means Do not use regexp to
match NAME in description OID.

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Printer error

2011-06-29 Thread nag ios
Hi i have configured nagios for monitoring Printers but i'm getting the
error as return code of 127 is out of bounds - plugin may be missing

Please help me regarding this issue.
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Printer error

2011-06-29 Thread Wilfred de Bondt
Hi all,

 

Usually we get this problem after a nagios reload and not a restart. So just
stop and start nagios?

 

Yours faithfully,

 

Wilfred de Bondt 

-- 

Systems Administrator

Stachanov Group:  phone: (+31)205091010

For more information, see website web: 

www.stachanov.com/

www.issuu.com/stachanov http://www.issuu.com/stachanov 

 



From: nag ios [mailto:nagiost...@gmail.com] 
Sent: woensdag 29 juni 2011 13:31
To: Nagios Users List
Subject: [Nagios-users] Printer error

 

Hi i have configured nagios for monitoring Printers but i'm getting the
error as return code of 127 is out of bounds - plugin may be missing

 

Please help me regarding this issue.

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Radu . Popa
After adding the -r option it works like a charm!

Now in pnp4nagios I can see both values on the same graph: bytes in and 
bytes out. I see them as two thin lines. Any way to make the lines thicker 
and change their colour?



Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/29/2011 01:45 PM
Betreff:Antwort: RE: MRTG graphs problem - measurement units not 
taken into consideration


Was a typo error of the community string, after putting public instead 
of Public everything looks cool.

Still 1 question:

If I try to query interface GigabitEthernet0/2 I get the response:

GigabitEthernet0/21:DOWN, GigabitEthernet0/23:DOWN, GigabitEthernet0/24:UP 
No usable data on file (40 rows) , GigabitEthernet0/2:UP No usable data on 
file (1 rows) , GigabitEthernet0/22:DOWN, GigabitEthernet0/20:UP No usable 
data on file (1 rows) : 3 int NOK : CRITICAL

So ti seems that it takes all interfaces starting with 0/2 string. Any 
way to limit this? Do I have to enter any escape character?

Thank you!




Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/29/2011 01:29 PM
Betreff:Antwort: RE: MRTG graphs problem - measurement units not 
taken into consideration


Ok,

Here is what I got so far:

1. If I run the command: ./check_snmp_int.pl -H 192.168.2.1 -C public -2 
-n GigabitEthernet0/24 -fY -kqB -g -w 60,60,100,100,10,10 -c 
80,80,200,200,20,20 -d 5 , I get the following output:
GigabitEthernet0/24:UP (16.9Kbps/26.2Kbps/0.0/0.0/0.0/0.0):1 UP: OK | 
'GigabitEthernet0/24_in_bps'=16899;6;8;0;10 
'GigabitEthernet0/24_out_bps'=26237;6;8;0;10

Before getting a correct output I run the command several times. The 
initial output is GigabitEthernet0/24:UP No usable data on file (102 rows) 
:(1 UP): UNKNOWN

2. Defined the following command in nagios:

define command{
command_namecheck_snmp_int
command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C 
$USER5$ -2 -n $ARG1$ -fY -kqB -g -w $ARG2$ -c $ARG3$ -d $ARG4$
}

Where 

And the corresponding service check:

define service{
use generic-service,pnpgraphed-service  ; 
Inherit values from a template
host_name   test_sw
service_description Port 24 Bandwidth Usage
check_command 
check_snmp_int!GigabitEthernet0/24!60,60,100,100,10,10!80,80,200,200,20,20!10
normal_check_interval   2   ; Check the service every 
2 units under normal conditions
retry_check_interval1   ; Re-check the service 
every 1 unit until its final/hard state is determined
}

In nagios service status I get now: ERROR: Description table : No response 
from remote host 192.168.2.1. 

In resources I have defined $USER5$=Public

Ideas, please?

Thank you!



Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 05:14 PM
Betreff:RE: MRTG graphs problem - measurement units not taken into 
consideration


I have tried the SNMP plugin a few minutes ago and here is what I get when 
running it from the command line:

./check_snmp_int.pl -H 192.168.2.1 -C public -n GigabitEthernet0/24 -k -w 
200,400 -c 0,600
GigabitEthernet0/24:UP No usable data on file (1 rows) :(1 UP): UNKNOWN

The interface name I took it from listing all available interfaces on the 
switch.

Any ideas, please?

Thanks!




Von:Radu Popa/Technomatic
An: nagios-users@lists.sourceforge.net
Datum:  06/28/2011 03:31 PM
Betreff:MRTG graphs problem - measurement units not taken into 
consideration


Hello,

I am using Nagios to monitor the bandwidth usage of a couple of 
switches/routers via the check_local_mrtgtraf command. The graphs I handle 
via pnp4nagios. The problem I encounter is the following: if the monitored 
bandwidth is at one point expressed in KB/s and then I bring a lot of 
traffic through the router the, unit is changed into MB/s. The graphed 
values seem not to take into consideration the change of the measurement 
unit meaning that the 50KB/s seems to be at the same height as the 50MB/s 
value in the graph.

Can anyone please help?

Thank you!
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] A question on status.cgi and resources

2011-06-29 Thread steve f

I am working on a pretty large distributed Nagios core deployment and have a 
question regarding resources.

It appears that almost every morning @ about 7:20 am, my cpu usage goes from 
about 4% to about 25-40% .  It will do this for a few hours and then go back to 
4%.

It appears to me that the issue is with wwwrun running status.cgi.  I was 
wondering what would be going on during the morning at about the same time 
every day  as opposed to 24hrs / day.  I was looking for some kind of 
maintenance it could be doing or something but am not finding anything.

I know there are no consoles open, 99.5 % of the checks the core server is 
processing are remote checks.

I know my environment will require multiple core servers ( 700 remote sites,  
15 clients / site , 20 +/- checks per client ) but have been working all along 
to tune the servers for the best efficiency.

Are there any tuning parameters for status.cgi??

Thanks,

Steve  
  --
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Joerg Linge
radu.p...@technomatic.de schrieb:
 After adding the -r option it works like a charm!

 Now in pnp4nagios I can see both values on the same graph: bytes in and
 bytes out. I see them as two thin lines. Any way to make the lines thicker
 and change their colour?

it depends on the pnp4nagios template you use to create this kind of graphs.
I assume its templates.dist/check_snmp_int.php

http://docs.pnp4nagios.org/pnp-0.6/tpl

You can change the Template to fit your needs. Every valid rrdtool option is 
allowed.

http://oss.oetiker.ch/rrdtool/doc/rrdgraph_graph.en.html

Joerg

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Antwort: RE: MRTG graphs problem - measurement units not taken into consideration

2011-06-29 Thread Jim Avery
On 29 June 2011 13:23,  radu.p...@technomatic.de wrote:
 After adding the -r option it works like a charm!

 Now in pnp4nagios I can see both values on the same graph: bytes in and
 bytes out. I see them as two thin lines. Any way to make the lines thicker
 and change their colour?

Well done.

Yes, you can create a custom template.

Take a look under /usr/local/pnp4nagios/share/templates.dist and find
the file named check_snmp_int.php .  Copy it to the directory
/usr/local/pnp4nagios/share/templates and then edit it to get the
wider line.

For example the line which reads:

$def[1] .= LINE1:var1#003300:\in  \  ;

Change it to read:

$def[1] .= LINE3:var1#003300:\in  \  ;



You will find documentation on how the LINE instruction works at:

http://oss.oetiker.ch/rrdtool/doc/rrdgraph_graph.en.html

and some general help on PNP templates at:

http://docs.pnp4nagios.org/pnp-0.6/tpl

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Printer error

2011-06-29 Thread Deborah Martin
I wouldn't just restart Nagios and hope the problem goes away but try and find 
out why / where the problem lies.

Have you checked by running the plugin itself as the nagios user to ensure it 
works properly from the command line  ?

Does the plugin exist in the location that you've configured Nagios to look for 
it ?

Regards,
Deborah

From: Wilfred de Bondt [mailto:wilf...@stachanov.com]
Sent: 29 June 2011 12:36
To: Nagios Users List
Subject: Re: [Nagios-users] Printer error

Hi all,

Usually we get this problem after a nagios reload and not a restart. So just 
stop and start nagios?

Yours faithfully,


Wilfred de Bondt

--

Systems Administrator

Stachanov Group:  phone: (+31)205091010

For more information, see website web:

www.stachanov.com/http://www.stachanov.com/

www.issuu.com/stachanovhttp://www.issuu.com/stachanov


From: nag ios [mailto:nagiost...@gmail.com]
Sent: woensdag 29 juni 2011 13:31
To: Nagios Users List
Subject: [Nagios-users] Printer error

Hi i have configured nagios for monitoring Printers but i'm getting the error 
as return code of 127 is out of bounds - plugin may be missing

Please help me regarding this issue.


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Getting re-notified while in a HARD state

2011-06-29 Thread Frank Bulk
I have a few existing and self-developed plugins that output details of the
HARD state:

CRITICAL: critical 1, warning 1
Detail 1
Detail 2

What I'd like to do is to be able to be re-notified if, while in the HARD
state, the number and/or details change.  For example, if the above would go
to:

CRITICAL: critical 2, warning 1
Detail 1
Detail 2
Detail 3

Anyone have an approach that works?  The documentation doesn't indicate it's
possible, but I'm sure others have encountered this before and perhaps
they've worked through a solution.

Kind regards,

Frank


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Getting re-notified while in a HARD state

2011-06-29 Thread Paul M. Dubuc
Frank Bulk wrote:
 I have a few existing and self-developed plugins that output details of the
 HARD state:

   CRITICAL: critical 1, warning 1
   Detail 1
   Detail 2

 What I'd like to do is to be able to be re-notified if, while in the HARD
 state, the number and/or details change.  For example, if the above would go
 to:

   CRITICAL: critical 2, warning 1
   Detail 1
   Detail 2
   Detail 3

 Anyone have an approach that works?  The documentation doesn't indicate it's
 possible, but I'm sure others have encountered this before and perhaps
 they've worked through a solution.

 Kind regards,

 Frank

I don't think there's a simple way to do this without having your notification 
command store the value of the $SERVICEOUTPUT$ macro for the host + service 
for comparison on the next try. Then you would have to set is_volatile on the 
service and have the notification command suppress the notification if the 
$SERVICEOUTPUT$ doesn't change.

Another thing you can do is tell Nagios to log the hard state status when only 
the $SERVICEOUTPUT$ changes by setting the stalking_options in the service. 
Then, if you have something that watching the log file you can trigger 
notifications with that.  If only this state stalking feature would have an 
option to send notifications in addition to logging you would be set.

Hope this helps,
Paul Dubuc

--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Expanding Custom Variables

2011-06-29 Thread Stringham, Steven
I am trying to monitor multiple volumes on a NetApp system. The format of the 
command requires a hostname:volumename format. I want to reduce my 
commands/service definitions to a minimum.  My initial thought was to have a 
generic service definition, that gets more specific with a sub definition.  
When the command is run, it seems like it is not passing the custom variable, 
but rather leaving a single $ behind where the variable ought to be.

Any ideas on
1) fixing my syntax?
2) Doing this somehow easier?


NAF UNKNOWN - No Snapmirror with source or destination myhostname:$ 
found!

 define service{
 name NA_SnapMirror
 check_command netapp_snapmirror
use  GenericService_Core
 normal_check_interval 1000
 max_check_attempts 300
 register 0
 contact_groups CoreServers
}

define service {
 use NA_SnapMirror
 _navolume myvolumename
 service_description SnapMirror_groups
 hosts myhostname
 }

define command{
command_name  netapp_snapmirror
command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$ 
snapmirror,$HOSTNAME$:$_SERVICEnavolume$,$USER25$
}







For more information about Lewis and Roca LLP, please go to 
www.lewisandroca.comhttp://www.lewisandroca.com/.

Phoenix (602)262-5311   Minden (775)586-9500
Tucson (520)622-2090Albuquerque (505)764-5400
Las Vegas (702)949-8200 Silicon Valley (650)391-1380
Reno (775)823-2900

  This message is intended only for the use of the individual or entity to 
which it is addressed. If the reader of this message is not the intended 
recipient, or the employee or agent responsible for delivering the message to 
the intended recipient, you are hereby notified that any dissemination, 
distribution or copying of this message is strictly prohibited. If you have 
received this communication in error, please notify us immediately by replying 
to the sender of this E-Mail by return E-Mail or by telephone.

  In accordance with Internal Revenue Service Circular 230, we advise you that 
if this email contains any tax advice, such tax advice was not intended or 
written to be used, and it cannot be used, by any taxpayer for the purpose of 
avoiding penalties that may be imposed on the taxpayer.
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Expanding Custom Variables

2011-06-29 Thread Stringham, Steven
Ok - I solved it. It was an Upper/Lower case thing. my variable IN USE needed 
to be $_SERVICENAVOLUME$ instead of $_SERVICEnavolume$ because it automatically 
converts the variable name to uppercase.



From: Stringham, Steven [mailto:sstri...@lrlaw.com]
Sent: Wednesday, June 29, 2011 12:19 PM
To: 'Nagios Users List'
Subject: [Nagios-users] Expanding Custom Variables


I am trying to monitor multiple volumes on a NetApp system. The format of the 
command requires a hostname:volumename format. I want to reduce my 
commands/service definitions to a minimum.  My initial thought was to have a 
generic service definition, that gets more specific with a sub definition.  
When the command is run, it seems like it is not passing the custom variable, 
but rather leaving a single $ behind where the variable ought to be.

Any ideas on
1) fixing my syntax?
2) Doing this somehow easier?


NAF UNKNOWN - No Snapmirror with source or destination myhostname:$ found!

 define service{
 name NA_SnapMirror
 check_command netapp_snapmirror
use  GenericService_Core
 normal_check_interval 1000
 max_check_attempts 300
 register 0
 contact_groups CoreServers
}

define service {
 use NA_SnapMirror
 _navolume myvolumename
 service_description SnapMirror_groups
 hosts myhostname
 }

define command{
command_name  netapp_snapmirror
command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$ 
snapmirror,$HOSTNAME$:$_SERVICEnavolume$,$USER25$
}







For more information about Lewis and Roca LLP, please go to 
www.lewisandroca.comhttp://www.lewisandroca.com/.

Phoenix (602)262-5311   Minden (775)586-9500
Tucson (520)622-2090Albuquerque (505)764-5400
Las Vegas (702)949-8200 Silicon Valley (650)391-1380
Reno (775)823-2900

  This message is intended only for the use of the individual or entity to 
which it is addressed. If the reader of this message is not the intended 
recipient, or the employee or agent responsible for delivering the message to 
the intended recipient, you are hereby notified that any dissemination, 
distribution or copying of this message is strictly prohibited. If you have 
received this communication in error, please notify us immediately by replying 
to the sender of this E-Mail by return E-Mail or by telephone.

  In accordance with Internal Revenue Service Circular 230, we advise you that 
if this email contains any tax advice, such tax advice was not intended or 
written to be used, and it cannot be used, by any taxpayer for the purpose of 
avoiding penalties that may be imposed on the taxpayer.
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Expanding Custom Variables

2011-06-29 Thread Mike Lindsey
On 6/29/11 12:18 PM, Stringham, Steven wrote:
  I am trying to monitor multiple volumes on a NetApp system. The 
format of the command requires a
  hostname:volumename format. I want to reduce my commands/service 
definitions to a minimum.  My initial
  thought was to have a generic service definition, that gets more 
specific with a sub definition.  When the
  command is run, it seems like it is not passing the custom variable, 
but rather leaving a single $ behind where
  the variable ought to be.

I'm not sure that custom macros are evaluated at the command level?  
Perhaps set your command_line to pull in the variable from the service:

define service {
 name NA_SnapMirror
 check_command netapp_snapmirror!$_SERVICEnavolume$
 use  GenericService_Core
 normal_check_interval 1000
 max_check_attempts 300
 register 0
 contact_groups CoreServers
}

define service {
 use NA_SnapMirror
 _navolume myvolumename
 service_description SnapMirror_groups
 hosts myhostname
}

define command {
 command_name  netapp_snapmirror
 command_line $USER1$/check_naf.py -H $HOSTADDRESS$ -C $USER8$  
snapmirror,$HOSTNAME$:$ARG1$,$USER25$
}

...
Alternately, if you have enable_environment_macros=1 in nagios.cfg, you 
could instead put $NAGIOS__SERVICEnavolume and pass the reference to 
the script.

One of the two should work for you.  If not, then I'd recommend 
restarting in debug mode, debug_level=18 will get you debug information 
about both the configuration load process, and the service check 
execution, so you should be able to figure out the problem - just fire 
it up in a reduced config set, so you only have this in there and don't 
get spammed by normal operations.

What version, btw?

--
Mike Lindsey

-- 
Mike Lindsey


--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null