I've got a very simple question which I suspect betrays my lack of understanding of something basic. Could someone help me understand?
If I have a two-node Pacemaker cluster - say, a really simple cluster of two nodes, A & B, with a solitary network connection between them - then I have to set no-quorum-policy to 'ignore'. If the network connection is broken then both A & B will attempt to STONITH each other. Is there anything that would stop an endless cycle of each killing the other if the actions of the STONITH agents are set to reboot? I.e.: - A & B race to STONITH each other - A kills B - A assumes resources - B reboots - B can't see A - B kills A - B assumes resources - A reboots - A can't see B - A kills B - A assumes resources ... etc. It's to stop this sort of cycle that I've set my STONITH actions to 'off' rather than 'reboot'. But I was reading the 'Fencing topology' document that Digimer referenced and I was reminded in my perusal that many people/clusters use a 'reboot' action. For a simple quorum-less cluster of two nodes how do those clusters avoid a never-ending cycle of each node killing the other, if neither node can 'see' the other via corosync? It's a very basic question; I think I'm forgetting something obvious. Thanks for any help! _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
