Re: FastLeaderElection

Flavio Junqueira Mon, 13 Apr 2009 15:08:55 -0700

Hi Raghu, Upon multiple consecutive crashes (or perhaps a networkpartition), it is possible that we keep electing a faulty server if weonly use zxid. We avoid such a problem using a logical clock asservers only consider changing their proposals if they received anotification from the same or a later epoch. With this mechanism, ifan elected server crashes before exercising its role as a leader, itwon't be considered in later epochs. Without a logical clock, a serverlagging behind in the election could re-introduce the faulty serverinto the election, and it would be elected again if the faulty serveris the one with highest zxid.

Note that we are not using "logical clocks" in the sense of Lamportclocks. We are not incrementing upon every event, but instead onlycounting rounds of leader election.


-Flavio

On Apr 13, 2009, at 8:55 PM, rag...@yahoo.com wrote:

Could someone please throw some light on this? Thanks.

-Raghu



----- Original Message ----
From: "rag...@yahoo.com" <rag...@yahoo.com>
To: zookeeper-u...@hadoop.apache.org
Sent: Friday, 10 April, 2009 8:11:34
Subject: FastLeaderElection


Hi,
Could someone please explain quickly why logical clock is used inFastLeaderElection? It looks to me like the peers can converge on aleader (with highest zxid or server id if zxids are the same) evenwithout the logical clock. May be I am missing something here, Icould not figure out why logical clock is needed.
Thanks
Raghu

Re: FastLeaderElection

Reply via email to