[Linux-HA] Quick 'death match cycle' question.

Alex Sudakar Mon, 02 Sep 2013 20:24:03 -0700

I've got a very simple question which I suspect betrays my lack of
understanding of something basic.  Could someone help me understand?


If I have a two-node Pacemaker cluster - say, a really simple cluster
of two nodes, A & B, with a solitary network connection between them -
then I have to set no-quorum-policy to 'ignore'.  If the network
connection is broken then both A & B will attempt to STONITH each
other.

Is there anything that would stop an endless cycle of each killing the
other if the actions of the STONITH agents are set to reboot?

I.e.:

-  A & B race to STONITH each other
-  A kills B
-  A assumes resources

-  B reboots
-  B can't see A
-  B kills A
-  B assumes resources

-  A reboots
-  A can't see B
-  A kills B
-  A assumes resources

... etc.

It's to stop this sort of cycle that I've set my STONITH actions to
'off' rather than 'reboot'.

But I was reading the 'Fencing topology' document that Digimer
referenced and I was reminded in my perusal that many people/clusters
use a 'reboot' action.

For a simple quorum-less cluster of two nodes how do those clusters
avoid a never-ending cycle of each node killing the other, if neither
node can 'see' the other via corosync?

It's a very basic question; I think I'm forgetting something obvious.
Thanks for any help!
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

[Linux-HA] Quick 'death match cycle' question.

Reply via email to