Re: [Pacemaker] meaning of Monitor Interval

Lars Marowsky-Bree Thu, 12 Sep 2013 23:52:44 -0700

On 2013-09-13T12:20:54, Xiaomin Zhang <zhangxiao...@gmail.com> wrote:


> Hi, Gurus:
> Here's a question about service Monitor Interval: considering this value is
> configured as '15' seconds, does this mean corosync/pacemaker will take
> average 15 seconds to schedule failed resource on a ready node?

It'll take about a maximum of 15 seconds to schedule a monitoring
operation that can detect the error.

If the monitor operation returns within <1s with a failure, that'll mean
the recovery will begin real quick.

If the monitor operation *doesn't* return but hit it's "timeout" (and is
aborted by the lrmd), then the recovery will be delayed by that much. So
for an ``interval=15 timeout=30'', it could take up to 45s before
recovery is scheduled.

Note however this on timeouts:
http://advogato.org/person/lmb/diary/108.html Just making them shorter
isn't necessarily always beneficial, either.


Best,
    Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Re: [Pacemaker] meaning of Monitor Interval

Reply via email to