Re: [Nagios-users] service checks running too often

2012-12-14 Thread Andreas Ericsson
On 12/14/2012 04:19 PM, Mark Keisler wrote:
> What you propose sounds acceptable.  In the meantime  I need to be careful
> about reloading nagios :).  Once I get it in that state, I have to disable
> use_retained_scheduling_info and then do a full restart.
> 

I've actually checked Nagios 4 now, and it appears we don't do this there.
I didn't test it all that thoroughly (and I probably should), but it's
friday and I'm two beers past my best-before-thinking hour, so I'll just
refrain from trying it further today.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] service checks running too often

2012-12-14 Thread Mark Keisler
What you propose sounds acceptable.  In the meantime  I need to be careful
about reloading nagios :).  Once I get it in that state, I have to disable
use_retained_scheduling_info and then do a full restart.


On Fri, Dec 14, 2012 at 3:41 AM, Andreas Ericsson  wrote:

> On 12/14/2012 05:13 AM, Mark Keisler wrote:
> > I think I found the issue.  If I happen to send a reload (HUP) to nagios
> > while a service check is in progress (fairly easy since my service check
> is
> > rather long lived), the reloaded nagios doesn't seem to know about that
> > service check and so I'll end up with another being scheduled as well as
> > the original on its schedule.  Create a dummy service check that just
> > sleeps for 30 seconds or something and issue a reload while it is running
> > and see if your nagios instance will start another sequence of service
> > checks.
> >
>
> This should be pretty easily fixed by just adding a check reaping event
> before initializing the event queue and skipping all checks that have
> already been scheduled.
>
> I'll have to add a check for it in 4.x. Since we keep workers between
> reloads, the same thing can easily happen there.
>
> That means we'll reschedule all checks like normal when we're starting,
> but if a check result comes in when a new check is already scheduled,
> we'll remove the old event and reschedule a new one according to the
> retry interval. I'd suggest doing something similar in the 3.4.x
> branch, but I'm not sure I can commit to that one without doing a new
> svn clone, and that takes at least a day.
>
> Mark; Would that be acceptable to you?
>
> Oh, and good catch :)
>
> --
> Andreas Ericsson   andreas.erics...@op5.se
> OP5 AB www.op5.se
> Tel: +46 8-230225  Fax: +46 8-230231
>
> Considering the successes of the wars on alcohol, poverty, drugs and
> terror, I think we should give some serious thought to declaring war
> on peace.
>
--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] service checks running too often

2012-12-14 Thread Andreas Ericsson
On 12/14/2012 05:13 AM, Mark Keisler wrote:
> I think I found the issue.  If I happen to send a reload (HUP) to nagios
> while a service check is in progress (fairly easy since my service check is
> rather long lived), the reloaded nagios doesn't seem to know about that
> service check and so I'll end up with another being scheduled as well as
> the original on its schedule.  Create a dummy service check that just
> sleeps for 30 seconds or something and issue a reload while it is running
> and see if your nagios instance will start another sequence of service
> checks.
> 

This should be pretty easily fixed by just adding a check reaping event
before initializing the event queue and skipping all checks that have
already been scheduled.

I'll have to add a check for it in 4.x. Since we keep workers between
reloads, the same thing can easily happen there.

That means we'll reschedule all checks like normal when we're starting,
but if a check result comes in when a new check is already scheduled,
we'll remove the old event and reschedule a new one according to the
retry interval. I'd suggest doing something similar in the 3.4.x
branch, but I'm not sure I can commit to that one without doing a new
svn clone, and that takes at least a day.

Mark; Would that be acceptable to you?

Oh, and good catch :)

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] service checks running too often

2012-12-13 Thread Mark Keisler
I think I found the issue.  If I happen to send a reload (HUP) to nagios
while a service check is in progress (fairly easy since my service check is
rather long lived), the reloaded nagios doesn't seem to know about that
service check and so I'll end up with another being scheduled as well as
the original on its schedule.  Create a dummy service check that just
sleeps for 30 seconds or something and issue a reload while it is running
and see if your nagios instance will start another sequence of service
checks.


On Thu, Dec 13, 2012 at 2:37 PM, Mike Guthrie  wrote:

>
> On 12/13/2012 12:38 PM, Mark Keisler wrote:
>
> I understand that nagios dynamically adjusts service check times, but the
> puzzling thing is that there is a check that runs every 5 minutes but then
> an extra or two in between.  And yes, the web interface shows the next
> service check as 5 mins out and yet another runs before that time hits.
>
> Is there any chance that there could be a second instance of Nagios
> running?   Look for multiple *parent* processes from the following
>
> #modify the nagios binary path to match your system
>
> ps aux | grep /bin/nagios
>
>  /etc/init.d/nagios stop
>
> killall -9 nagios
>
> /etc/init.d/nagios start
>
>
>
>
>
> On Thu, Dec 13, 2012 at 10:24 AM, Mike Guthrie wrote:
>
>>  Although some of those start times do seem close together, it's
>> important to know that the check_interval in Nagios is not necessarily a
>> hard number. Nagios is continually adjusting and recalculating the check
>> schedule, so if you need a check to run on a hard 5mn schedule, you might
>> be better off using cron, and then pushing the result to Nagios passively.
>>
>> With that said, access the service details for this service. When new
>> results come in does the scheduler set the Next Check 5mn out as expected?
>>
>>
>>
>> On 12/13/2012 9:43 AM, Mark Keisler wrote:
>>
>>  I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a
>> poller (service check) that is running too often and I am not sure why. I
>> have "service_check_timeout=180" because I had trouble with the poller
>> running long. Relevant settings for the service check:
>>
>> check_period24x7
>> max_check_attempts  1
>> normal_check_interval   5
>> retry_check_interval5
>>
>> I also set up a tracking logger in the poller to record "timestamp PID
>> started by PPID : Poll [Start|End] of poller"
>> 2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
>> 2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
>> 2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
>> 2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
>> 2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
>> 2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
>> 2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
>> 2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
>> 2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
>> 2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
>> 2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
>> 2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
>> 2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
>> 2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
>> 2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
>> 2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
>> 2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
>> 2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
>> 2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
>> 2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
>> 2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
>> 2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
>> 2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller
>>
>> As you can see, I start to get overlapping pollers. I don't understand
>> why this would happen. Any hints or clues?
>>
>>
>>  
>> --
>> LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
>> Remotely access PCs and mobile devices and provide instant support
>> Improve your efficiency, and focus on delivering more value-add services
>> Discover what IT Professionals Know. Rescue 
>> delivershttp://p.sf.net/sfu/logmein_12329d2d
>>
>>
>>
>> ___
>> Nagios-users mailing 
>> listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
>> any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>
>>
>>
>> --
>>
>>
>> Mike Guthrie
>> Technical Team
>> ___
>> Nagios Ent

Re: [Nagios-users] service checks running too often

2012-12-13 Thread Mark Keisler
There isn't a second nagios instance.  While I was watching the pollers
spawn, they all led back to the same master nagios instance.


On Thu, Dec 13, 2012 at 2:37 PM, Mike Guthrie  wrote:

>
> On 12/13/2012 12:38 PM, Mark Keisler wrote:
>
> I understand that nagios dynamically adjusts service check times, but the
> puzzling thing is that there is a check that runs every 5 minutes but then
> an extra or two in between.  And yes, the web interface shows the next
> service check as 5 mins out and yet another runs before that time hits.
>
> Is there any chance that there could be a second instance of Nagios
> running?   Look for multiple *parent* processes from the following
>
> #modify the nagios binary path to match your system
>
> ps aux | grep /bin/nagios
>
>  /etc/init.d/nagios stop
>
> killall -9 nagios
>
> /etc/init.d/nagios start
>
>
>
>
>
> On Thu, Dec 13, 2012 at 10:24 AM, Mike Guthrie wrote:
>
>>  Although some of those start times do seem close together, it's
>> important to know that the check_interval in Nagios is not necessarily a
>> hard number. Nagios is continually adjusting and recalculating the check
>> schedule, so if you need a check to run on a hard 5mn schedule, you might
>> be better off using cron, and then pushing the result to Nagios passively.
>>
>> With that said, access the service details for this service. When new
>> results come in does the scheduler set the Next Check 5mn out as expected?
>>
>>
>>
>> On 12/13/2012 9:43 AM, Mark Keisler wrote:
>>
>>  I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a
>> poller (service check) that is running too often and I am not sure why. I
>> have "service_check_timeout=180" because I had trouble with the poller
>> running long. Relevant settings for the service check:
>>
>> check_period24x7
>> max_check_attempts  1
>> normal_check_interval   5
>> retry_check_interval5
>>
>> I also set up a tracking logger in the poller to record "timestamp PID
>> started by PPID : Poll [Start|End] of poller"
>> 2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
>> 2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
>> 2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
>> 2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
>> 2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
>> 2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
>> 2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
>> 2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
>> 2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
>> 2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
>> 2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
>> 2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
>> 2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
>> 2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
>> 2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
>> 2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
>> 2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
>> 2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
>> 2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
>> 2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
>> 2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
>> 2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
>> 2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller
>>
>> As you can see, I start to get overlapping pollers. I don't understand
>> why this would happen. Any hints or clues?
>>
>>
>>  
>> --
>> LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
>> Remotely access PCs and mobile devices and provide instant support
>> Improve your efficiency, and focus on delivering more value-add services
>> Discover what IT Professionals Know. Rescue 
>> delivershttp://p.sf.net/sfu/logmein_12329d2d
>>
>>
>>
>> ___
>> Nagios-users mailing 
>> listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
>> any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>
>>
>>
>> --
>>
>>
>> Mike Guthrie
>> Technical Team
>> ___
>> Nagios Enterprises, LLC
>> Email:  mguth...@nagios.com
>> Web:www.nagios.com
>>
>>
>>
>> --
>> LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
>> Remotely access PCs and mobile devices and provide instant support
>> Improve your efficiency, and focus on delivering more value-add services
>> Di

Re: [Nagios-users] service checks running too often

2012-12-13 Thread Mike Guthrie


On 12/13/2012 12:38 PM, Mark Keisler wrote:
I understand that nagios dynamically adjusts service check times, but 
the puzzling thing is that there is a check that runs every 5 minutes 
but then an extra or two in between.  And yes, the web interface shows 
the next service check as 5 mins out and yet another runs before that 
time hits.
Is there any chance that there could be a second instance of Nagios 
running?   Look for multiple *parent* processes from the following


#modify the nagios binary path to match your system

ps aux | grep /bin/nagios

/etc/init.d/nagios stop

killall -9 nagios

/etc/init.d/nagios start






On Thu, Dec 13, 2012 at 10:24 AM, Mike Guthrie > wrote:


Although some of those start times do seem close together, it's
important to know that the check_interval in Nagios is not
necessarily a hard number. Nagios is continually adjusting and
recalculating the check schedule, so if you need a check to run on
a hard 5mn schedule, you might be better off using cron, and then
pushing the result to Nagios passively.

With that said, access the service details for this service. When
new results come in does the scheduler set the Next Check 5mn out
as expected?



On 12/13/2012 9:43 AM, Mark Keisler wrote:

I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a
poller (service check) that is running too often and I am not
sure why. I have "service_check_timeout=180" because I had
trouble with the poller running long. Relevant settings for the
service check:

check_period24x7
max_check_attempts  1
normal_check_interval   5
retry_check_interval5

I also set up a tracking logger in the poller to record
"timestamp PID started by PPID : Poll [Start|End] of poller"
2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller

As you can see, I start to get overlapping pollers. I don't
understand why this would happen. Any hints or clues?



--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d


___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net  

https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null



-- 



Mike Guthrie
Technical Team
___
Nagios Enterprises, LLC
Email:mguth...@nagios.com  
Web:www.nagios.com  



--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add
services
Discover what IT Professionals Know. Rescue delivers
http://

Re: [Nagios-users] service checks running too often

2012-12-13 Thread Mark Keisler
I understand that nagios dynamically adjusts service check times, but the
puzzling thing is that there is a check that runs every 5 minutes but then
an extra or two in between.  And yes, the web interface shows the next
service check as 5 mins out and yet another runs before that time hits.


On Thu, Dec 13, 2012 at 10:24 AM, Mike Guthrie  wrote:

>  Although some of those start times do seem close together, it's
> important to know that the check_interval in Nagios is not necessarily a
> hard number. Nagios is continually adjusting and recalculating the check
> schedule, so if you need a check to run on a hard 5mn schedule, you might
> be better off using cron, and then pushing the result to Nagios passively.
>
> With that said, access the service details for this service. When new
> results come in does the scheduler set the Next Check 5mn out as expected?
>
>
>
> On 12/13/2012 9:43 AM, Mark Keisler wrote:
>
> I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a poller
> (service check) that is running too often and I am not sure why. I have
> "service_check_timeout=180" because I had trouble with the poller running
> long. Relevant settings for the service check:
>
> check_period24x7
> max_check_attempts  1
> normal_check_interval   5
> retry_check_interval5
>
> I also set up a tracking logger in the poller to record "timestamp PID
> started by PPID : Poll [Start|End] of poller"
> 2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
> 2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
> 2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
> 2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
> 2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
> 2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
> 2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
> 2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
> 2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
> 2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
> 2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
> 2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
> 2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
> 2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
> 2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
> 2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
> 2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
> 2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
> 2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
> 2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
> 2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
> 2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
> 2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller
>
> As you can see, I start to get overlapping pollers. I don't understand why
> this would happen. Any hints or clues?
>
>
> --
> LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
> Remotely access PCs and mobile devices and provide instant support
> Improve your efficiency, and focus on delivering more value-add services
> Discover what IT Professionals Know. Rescue 
> delivershttp://p.sf.net/sfu/logmein_12329d2d
>
>
>
> ___
> Nagios-users mailing 
> listNagios-users@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
> --
>
>
> Mike Guthrie
> Technical Team
> ___
> Nagios Enterprises, LLC
> Email:  mguth...@nagios.com
> Web:www.nagios.com
>
>
>
> --
> LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
> Remotely access PCs and mobile devices and provide instant support
> Improve your efficiency, and focus on delivering more value-add services
> Discover what IT Professionals Know. Rescue delivers
> http://p.sf.net/sfu/logmein_12329d2d
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your 

Re: [Nagios-users] service checks running too often

2012-12-13 Thread Mike Guthrie
Although some of those start times do seem close together, it's 
important to know that the check_interval in Nagios is not necessarily a 
hard number. Nagios is continually adjusting and recalculating the check 
schedule, so if you need a check to run on a hard 5mn schedule, you 
might be better off using cron, and then pushing the result to Nagios 
passively.


With that said, access the service details for this service. When new 
results come in does the scheduler set the Next Check 5mn out as expected?



On 12/13/2012 9:43 AM, Mark Keisler wrote:
I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a 
poller (service check) that is running too often and I am not sure 
why. I have "service_check_timeout=180" because I had trouble with the 
poller running long. Relevant settings for the service check:


check_period24x7
max_check_attempts  1
normal_check_interval   5
retry_check_interval5

I also set up a tracking logger in the poller to record "timestamp PID 
started by PPID : Poll [Start|End] of poller"

2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller

As you can see, I start to get overlapping pollers. I don't understand 
why this would happen. Any hints or clues?



--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d


___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null



--


Mike Guthrie
Technical Team
___
Nagios Enterprises, LLC
Email:  mguth...@nagios.com
Web:www.nagios.com

--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] service checks running too often

2012-12-13 Thread Mark Keisler
I'm running Nagios 3.4.1 on RHEL6. I have an issue where I have a poller
(service check) that is running too often and I am not sure why. I have
"service_check_timeout=180" because I had trouble with the poller running
long. Relevant settings for the service check:

check_period24x7
max_check_attempts  1
normal_check_interval   5
retry_check_interval5

I also set up a tracking logger in the poller to record "timestamp PID
started by PPID : Poll [Start|End] of poller"
2012-12-12_12:26:38 19448 started by 19442 : Poll Start of poller
2012-12-12_12:27:13 19448 started by 19442 : Poll End of poller
2012-12-12_12:28:14 19931 started by 19930 : Poll Start of poller
2012-12-12_12:30:14 19931 started by 19930 : Poll End of poller
2012-12-12_12:31:37 20467 started by 20460 : Poll Start of poller
2012-12-12_12:33:15 20949 started by 20946 : Poll Start of poller
2012-12-12_12:33:15 20467 started by 20460 : Poll End of poller
2012-12-12_12:33:41 20949 started by 20946 : Poll End of poller
2012-12-12_12:36:38 21483 started by 21478 : Poll Start of poller
2012-12-12_12:38:14 21971 started by 21964 : Poll Start of poller
2012-12-12_12:39:17 21483 started by 21478 : Poll End of poller
2012-12-12_12:39:18 21971 started by 21964 : Poll End of poller
2012-12-12_12:41:38 22500 started by 22492 : Poll Start of poller
2012-12-12_12:42:19 22500 started by 22492 : Poll End of poller
2012-12-12_12:43:14 23003 started by 22999 : Poll Start of poller
2012-12-12_12:45:20 23003 started by 22999 : Poll End of poller
2012-12-12_12:46:37 23540 started by 23535 : Poll Start of poller
2012-12-12_12:48:14 24025 started by 24024 : Poll Start of poller
2012-12-12_12:48:20 23540 started by 23535 : Poll End of poller
2012-12-12_12:48:41 24025 started by 24024 : Poll End of poller
2012-12-12_12:51:38 24558 started by 24554 : Poll Start of poller
2012-12-12_12:53:14 25044 started by 25041 : Poll Start of poller
2012-12-12_12:54:35 25044 started by 25041 : Poll End of poller

As you can see, I start to get overlapping pollers. I don't understand why
this would happen. Any hints or clues?
--
LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial
Remotely access PCs and mobile devices and provide instant support
Improve your efficiency, and focus on delivering more value-add services
Discover what IT Professionals Know. Rescue delivers
http://p.sf.net/sfu/logmein_12329d2d___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null