Re: [Pacemaker] Need to relax corosync due to backup of VM through snapshot

Lars Marowsky-Bree Thu, 21 Nov 2013 00:19:32 -0800

On 2013-11-20T16:58:01, Gianluca Cecchi <gianluca.cec...@gmail.com> wrote:


> Based on docs  I thought that the timeout should be
> 
> token x token_retransmits_before_loss_const

No, the comments in the corosync.conf.example and man corosync.conf
should be pretty clear, I hope. Can you recommend which phrasing we
should improve?

> SO my current test config is:
>   # diff corosync.conf corosync.conf.pre181113
> 24,25c24
> < #token: 5000
> < token: 120000

A 120s node timeout? That is really, really long. Why is the backup tool
interfering with the scheduling of high priority processes so much? That
sounds like the real bug.

> Any comment?
> Any different strategies successfully used in similar environments
> where high latencies get in place at snapshot deletion when
> consolidate phase of disks is executed?

A setup where a VM apparently can freeze for almost 120s is not suitable
for HA.



Regards,
    Lars

-- 
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 
21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde


_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Re: [Pacemaker] Need to relax corosync due to backup of VM through snapshot

Reply via email to