Andrey created ZOOKEEPER-2400:
---------------------------------

             Summary: ZooKeeper not starting: Follower is ahead of the leader
                 Key: ZOOKEEPER-2400
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2400
             Project: ZooKeeper
          Issue Type: Bug
          Components: quorum
    Affects Versions: 3.4.6
            Reporter: Andrey


Steps to reproduce:
# Select deprecated algorithm in zoo.cfg: {code}electionAlg=0{code}
# Start zookeeper cluster: A(index 3),B(index 1),C(index 2) nodes
# Stop A node.
# Make some change to zk data. i.e. re-create ephemeral node. Make sure 
currentEpoch increased in B and C nodes.
# currentEpoch/acceptedEpoch in node A less than B/C epoch
# Stop node B. Zookeeper cluster is not available
# Start node A. In A's node logs:
{code}
LEADING [quorum.QuorumPeer] [QuorumPeer[myid=3]/0:0:0:0:0:0:0:0:15523]
LEADING - LEADER ELECTION TOOK - 1458721180995 [quorum.Leader]
Follower sid: 2 : info : 
org.apache.zookeeper.server.quorum.QuorumPeer$QuorumServer@3a888c1
java.io.IOException: Follower is ahead of the leader, leader summary: 10 
(current epoch), 42949672964 (last zxid)
        at 
org.apache.zookeeper.server.quorum.Leader.waitForEpochAck(Leader.java:894)
        at 
org.apache.zookeeper.server.quorum.LearnerHandler.run(LearnerHandler.java:365)

...

Follower sid: 1 : info : 
org.apache.zookeeper.server.quorum.QuorumPeer$QuorumServer@5001b9f5

...

java.lang.InterruptedException: Timeout while waiting for epoch to be acked by 
quorum
        at 
org.apache.zookeeper.server.quorum.Leader.waitForEpochAck(Leader.java:915)
        at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:394)
        at 
org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:799)
{code}

The logs above will be printed indefinitely and cluster won't recover.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to