Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-27 Thread Klaus Umbach
 
  I should write a patch to make it configurable...
 
 That'd be nice =)

Ehrm, I just looked in the code of nrpe 2.12...
This function is already in (-u). I still had 2.5.1 (Debian Etch).

This really hurts... :-)

Cheers
Klaus

-- 
BOFH excuse #75:

There isn't any problem

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
Hi,

I have a problem with unstable lines and/or machines under stress.

Sometimes they just don't reply on nrpe, but are still working, so I get a
lot of false alarms. Is it possible, to tell check_nrpe to exit with a
warning instead of critical?

Cheers
Klaus

-- 
BOFH excuse #401:

Sales staff sold a product we don't offer.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 5:21 AM, Klaus Umbach
treibh...@sozial-inkompetent.de wrote:
 Hi,

 I have a problem with unstable lines and/or machines under stress.

 Sometimes they just don't reply on nrpe, but are still working, so I get a
 lot of false alarms. Is it possible, to tell check_nrpe to exit with a
 warning instead of critical?

I believe that a state of socket timeout will always be flagged as
critical, though you can increase the max_check_attempts so it won't
throw a notification so soon if you know of false positives.

Of course figuring out the cause of your false positives is probably
the better solution.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
On 26/02/09 12:06, Lee Azzarello wrote:
 On Thu, Feb 26, 2009 at 5:21 AM, Klaus Umbach
 treibh...@sozial-inkompetent.de wrote:
  Hi,
 
  I have a problem with unstable lines and/or machines under stress.
 
  Sometimes they just don't reply on nrpe, but are still working, so I get a
  lot of false alarms. Is it possible, to tell check_nrpe to exit with a
  warning instead of critical?
 
 I believe that a state of socket timeout will always be flagged as
 critical, though you can increase the max_check_attempts so it won't
 throw a notification so soon if you know of false positives.

Sometimes these periods last for a an hour so, on some machines it's
because of shitty connections to another continent or in countries with
unstable infrastructure.

 Of course figuring out the cause of your false positives is probably
 the better solution.

Most of time I know the cause, but it is not solvable in a cheap way :-)
The relevant services are still working, but if the monitored values pass
the thresholds, I want to know it, not if connection fails.

Cheers
Klaus

-- 
BOFH excuse #159:

Stubborn processes

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Marc Powell

On Feb 26, 2009, at 1:02 PM, Klaus Umbach wrote:

 Most of time I know the cause, but it is not solvable in a cheap  
 way :-)
 The relevant services are still working, but if the monitored values  
 pass
 the thresholds, I want to know it, not if connection fails.

Edit check_nrpe.c; look for

void alarm_handler(int sig){

 printf(CHECK_NRPE: Socket timeout after %d seconds. 
\n,socket_timeout);

 exit(STATE_CRITICAL);
 }


change STATE_CRITICAL to whatever state you think it should be and  
recompile.

--
Marc


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
On 26/02/09 13:18, Marc Powell wrote:
 change STATE_CRITICAL to whatever state you think it should be and  
 recompile.

Yes, that solution is an option, too, but I was hoping to find something
without recompiling :-)

I should write a patch to make it configurable...

Cheers
Klaus

-- 
BOFH excuse #445:

Browser's cookie is corrupted -- someone's been nibbling on it.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Marc Powell

On Feb 26, 2009, at 2:13 PM, Klaus Umbach wrote:

 On 26/02/09 13:18, Marc Powell wrote:
 change STATE_CRITICAL to whatever state you think it should be and
 recompile.

 Yes, that solution is an option, too, but I was hoping to find  
 something
 without recompiling :-)

The negate plugin is close but it doesn't look at the text output,  
only the exit code, so you wouldn't have the granularity you need. I'm  
not aware of anything else workable.

 I should write a patch to make it configurable...

That'd be nice =)

--
Marc

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null