ZiLioN ZilLioN wrote:


Date: Wed, 30 Jul 2008 08:15:04 +0200
From: [EMAIL PROTECTED]
To: [email protected]
Subject: Re: [Linux-HA] Crazy because of SplitBrain!

When the node A STONITH to node B, the node B is rebooted. When the node B has 
be rebooted, node B do not start the resources again?
Don't start heartbeat at boot time. I don't know if that's the suggested method, but that's the way I do it and that certainly works.

Ok, this method certainly will work

I can to specify with "contraints" that the node with less score is the node to 
kill, the node will die?
When the nodes don't see each other, they each only compute scores for themselves.

To end the topic:

Suppose a scenario (in the same net) where node A has connectivity to Internet 
and node B no. Then only node A can offer the service.

If node A STONITH node B and node B STONITH node A...

If in this moment the nodes lose the communication between them, the node B 
STONIH to node A sucesfully. The node A has died and the service can´t offer to 
Internet.

Disaster!.

Well how likely is that?

This is the problem if you can´t decide what node should die :(
It´s not possible that: the node A has the resource STONITH started because he 
has communication (goal shoot the node B) with Internet and node B has the 
resource STONITH stopped because he hasn´t communication with Internet. When 
both lose the communication between them, only the node A will shoot to the 
node B.

That _is_ possible. Look into resource location constraints and pingd.

Prevent the stonith resource to run on a node with no gateway ping connectivity.

Important question:

Can the fencing method STONITH uses in a scenario distributed geographically? 
if both nodes lose the communication, How they use STONITH?

Assuming the nodes are geographically separated and communicate over the internet (which isn't exactly a great idea). This implies the stonith devices being used over the internet, too. If this communication breaks, nodeA cannot execute STONITH for nodeB and vice versa. Since nodes cannot shoot themselves, the STONITH commands would be queued until they can be executed. So I guess when communication comes back, the first node to shoot the other one wins.

Please correct me if I'm wrong here.

Regards
Dominik
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to