On 2008-08-20T18:33:47, Robert <[EMAIL PROTECTED]> wrote:

> Well, from my perspective - which of course is the perspective of a user 
> and not the developer of heartbeat - it works. By "works" I mean, when I 
> "pull" the cable between the datacenters, the resources are active on one 
> node only (the node on which it will be active can not be told). When I put 
> the cable back, the resource may switch sides, but is active on one node 
> only. So the quorum server makes sure - in this special 2 node setup - that 
> the ressource is active on one node only. In my world this means "it works" 
> :)

It doesn't work if combined with fencing. Try it. It will fail. The
nodes will either block indefinitely (because they can't fence the other
side), or because they cannot/do not commit suicide, and if there are
stop failures, it will not be able to recover.

>> It will only defer to the quorum server in case majority is lost. If the
>> quorum server than grants quorum, while the other side still has
>> majority, both sides will have quorum.
> Why that ? The other node sees the node failure too and a new voting 
> session is started - isn't it ? So if the heartbeat is lost, both nodes 
> start to check the quorum, both fail with majority and ONE is granted 
> quorum via quorumd. Isn't this as it is expeted work ? Isn't this what the 
> quorum plugin architecture can be used for ?

This "works" only for 2 nodes.

>> (The above line "works" for the very special case of two nodes only, but
>> that's luck, not design.)
> Does "luck" mean, that this behaviour will change in future heartbeat 
> releases OR that this kind of setup is limited to 2 node setups only ?

As I don't see anyone maintaining the quorum infrastructure in
heartbeat, it's unlikely to change. If it works for you, you're lucky.

>> DO NOT USE THE QUORUM SERVER.
> One last - and sorry for the pointed remark - why is it part of the 
> official heartbeat distribution then (I mean the binary package, but the 
> source) ?

Because Alan thinks (thought?) the quorum server worked, and he was the
maintainer of it. I maintain he's quite wrong on that count. 

In particular the interaction between quorumd & fencing is broken, and
that it does not work for >2 nodes is a severe restriction.

Everyone is of course free to use it, but I won't have anything to do
with it.


Regards,
    Lars

-- 
Teamlead Kernel, SuSE Labs, Research and Development
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to