Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-27 Thread Klaus Umbach
 
> > I should write a patch to make it configurable...
> 
> That'd be nice =)

Ehrm, I just looked in the code of nrpe 2.12...
This function is already in (-u). I still had 2.5.1 (Debian Etch).

This really hurts... :-)

Cheers
Klaus

-- 
BOFH excuse #75:

There isn't any problem

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Marc Powell

On Feb 26, 2009, at 2:13 PM, Klaus Umbach wrote:

> On 26/02/09 13:18, Marc Powell wrote:
>> change STATE_CRITICAL to whatever state you think it should be and
>> recompile.
>
> Yes, that solution is an option, too, but I was hoping to find  
> something
> without recompiling :-)

The negate plugin is close but it doesn't look at the text output,  
only the exit code, so you wouldn't have the granularity you need. I'm  
not aware of anything else workable.

> I should write a patch to make it configurable...

That'd be nice =)

--
Marc

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
On 26/02/09 13:18, Marc Powell wrote:
> change STATE_CRITICAL to whatever state you think it should be and  
> recompile.

Yes, that solution is an option, too, but I was hoping to find something
without recompiling :-)

I should write a patch to make it configurable...

Cheers
Klaus

-- 
BOFH excuse #445:

Browser's cookie is corrupted -- someone's been nibbling on it.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Marc Powell

On Feb 26, 2009, at 1:02 PM, Klaus Umbach wrote:

> Most of time I know the cause, but it is not solvable in a cheap  
> way :-)
> The relevant services are still working, but if the monitored values  
> pass
> the thresholds, I want to know it, not if connection fails.

Edit check_nrpe.c; look for

void alarm_handler(int sig){

 printf("CHECK_NRPE: Socket timeout after %d seconds. 
\n",socket_timeout);

 exit(STATE_CRITICAL);
 }


change STATE_CRITICAL to whatever state you think it should be and  
recompile.

--
Marc


--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
On 26/02/09 12:06, Lee Azzarello wrote:
> On Thu, Feb 26, 2009 at 5:21 AM, Klaus Umbach
>  wrote:
> > Hi,
> >
> > I have a problem with unstable lines and/or machines under stress.
> >
> > Sometimes they just don't reply on nrpe, but are still working, so I get a
> > lot of false alarms. Is it possible, to tell check_nrpe to exit with a
> > warning instead of critical?
> 
> I believe that a state of "socket timeout" will always be flagged as
> "critical", though you can increase the max_check_attempts so it won't
> throw a notification so soon if you know of false positives.

Sometimes these periods last for a an hour so, on some machines it's
because of shitty connections to another continent or in countries with
unstable infrastructure.

> Of course figuring out the cause of your false positives is probably
> the better solution.

Most of time I know the cause, but it is not solvable in a cheap way :-)
The relevant services are still working, but if the monitored values pass
the thresholds, I want to know it, not if connection fails.

Cheers
Klaus

-- 
BOFH excuse #159:

Stubborn processes

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Lee Azzarello
On Thu, Feb 26, 2009 at 5:21 AM, Klaus Umbach
 wrote:
> Hi,
>
> I have a problem with unstable lines and/or machines under stress.
>
> Sometimes they just don't reply on nrpe, but are still working, so I get a
> lot of false alarms. Is it possible, to tell check_nrpe to exit with a
> warning instead of critical?

I believe that a state of "socket timeout" will always be flagged as
"critical", though you can increase the max_check_attempts so it won't
throw a notification so soon if you know of false positives.

Of course figuring out the cause of your false positives is probably
the better solution.

-lee

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] getting warnings for connection refused with check_nrpe instead of critical

2009-02-26 Thread Klaus Umbach
Hi,

I have a problem with unstable lines and/or machines under stress.

Sometimes they just don't reply on nrpe, but are still working, so I get a
lot of false alarms. Is it possible, to tell check_nrpe to exit with a
warning instead of critical?

Cheers
Klaus

-- 
BOFH excuse #401:

Sales staff sold a product we don't offer.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null