Re: [Pacemaker] Pacemaker very often STONITHs other node

Michał Margula Mon, 09 Dec 2013 03:06:31 -0800

W dniu 09.12.2013 11:34, Nikita Staroverov pisze:


So, what happens? :)
Rivendell-B tried to stop XEN-acsystemy01, but couldn't do that due to
time out of operation. Failure on stop operation is fatal by default and
leading to stonith.
Rivendell-A caught this and fence rivendell-B.
You also have got some other problems, like clone-LVM not running (but
it is'nt fatal).

I think your servers is overloaded due to one DRBD for all VM's. You
must increase timeout of operations or do something with cluster
configuration.
As for me, i use configuration with one drbd per virtual machine drive,
moderated timeouts and 802.3ad bonding configuration without problems.


Hello,

Thank you for your answer. I have two drbd - /dev/drbd1 and /dev/drbd2.And I use them as PVs for LVM which has one Volume Group hosting all theVMs.


So should I have as many DRBDs as VMs and get rid off LVM at all?

PS. If it is not a secret what are you recommended timeouts?

Thank you!

--
Michał Margula, alche...@uznam.net.pl, http://alchemyx.uznam.net.pl/
"W życiu piękne są tylko chwile" [Ryszard Riedel]

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Re: [Pacemaker] Pacemaker very often STONITHs other node

Reply via email to