[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16789626#comment-16789626 ] Lawrence Craft commented on ZOOKEEPER-2307: --- Has there been any movement on this? We are encountering the same bug. It looks like the latest patch no longer applies properly; the changes look simple enough to implement, though. [~arshad.mohammad] have you submitted a pull request for this? I looked through Github and couldn't find one. I can rebase the patch and submit one if it would make things easier. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Mohammad Arshad >Assignee: Mohammad Arshad >Priority: Major > Fix For: 3.6.0, 3.5.5 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16695356#comment-16695356 ] Michael K. Edwards commented on ZOOKEEPER-2307: --- Should this be a release blocker for 3.5.5? > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Mohammad Arshad >Assignee: Mohammad Arshad >Priority: Major > Fix For: 3.6.0, 3.5.5 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16665325#comment-16665325 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12822995/ZOOKEEPER-2307-04.patch against trunk revision 1ce2ca8107438d283581d18d064a25bd6b74adf7. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 5 new or modified tests. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3707//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Mohammad Arshad >Assignee: Mohammad Arshad >Priority: Major > Fix For: 3.6.0, 3.5.5 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16665018#comment-16665018 ] Mohammad Arshad commented on ZOOKEEPER-2307: Work for this issue was completed long back. Now the last given patch need to be re based. I will rebase the changes and raise a PR. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Mohammad Arshad >Assignee: Mohammad Arshad >Priority: Major > Fix For: 3.6.0, 3.5.5 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16664961#comment-16664961 ] Andor Molnar commented on ZOOKEEPER-2307: - [~arshad.mohammad] This one looks like an important and easy to fix. Are you still working on this? Reverting this to major, so it won't block the stable release. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Mohammad Arshad >Assignee: Mohammad Arshad >Priority: Critical > Fix For: 3.6.0, 3.5.5 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15832304#comment-15832304 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12822995/ZOOKEEPER-2307-04.patch against trunk revision 42c75b5f2457f8ea5b4106ce5dc1c34c330361c0. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 5 new or modified tests. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3564//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.3 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15415095#comment-15415095 ] Arshad Mohammad commented on ZOOKEEPER-2307: Hi [~rgs], hope you remember this issue. Submitted new patch ZOOKEEPER-2307-04.patch which addressed your comments. * updated logic toverify current epoch in Zab1_0Test * removed unused method from org.apache.zookeeper.server.quorum.Zab1_0Test.deserializeSnapshot() > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.3 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15415083#comment-15415083 ] Arshad Mohammad commented on ZOOKEEPER-2307: bq. -1 core tests. The patch failed core unit tests. {{org.apache.zookeeper.test.AsyncHammerTest.testHammer}} failed. But the failure is not related to this patch. Verified locally, testHammer passed. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.3 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15415066#comment-15415066 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12822995/ZOOKEEPER-2307-04.patch against trunk revision 1755379. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 5 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3347//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3347//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3347//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.3 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch, ZOOKEEPER-2307-04.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15343002#comment-15343002 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12772143/ZOOKEEPER-2307-03.patch against trunk revision 1748630. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 8 new or modified tests. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/3227//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.3 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15044101#comment-15044101 ] Raul Gutierrez Segales commented on ZOOKEEPER-2307: --- Ignore my comment about getQuorumPeer not being used, forgot to look at the new file src/java/test/org/apache/zookeeper/test/EpochWriteFailureTest.java... > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15044099#comment-15044099 ] Raul Gutierrez Segales commented on ZOOKEEPER-2307: --- The patch generally lgtm, a few nits and observations: * do you mind adding java docs for QuorumPeer#setCurrentEpoch and QuorumPeer#setAcceptedEpoch? Specially, noting why they don't have to be synchronized would be useful (i.e.: they can only be called from the Leader and Learner classes and it all happens from one thread * in src/java/test/org/apache/zookeeper/server/quorum/QuorumPeerTestBase.java why is getQuorumPeer added? Can't see where it's used. Also, there's some whitespaces/tabs before the method... * i think waitUptoAFileWriteTime() is a bit racy and hackish and will break in the future. as much as i dislike sleep() calls in tests, in this case it's probably alright to loop a few times until getAcceptedEpoch() and getCurrentEpoch() converge Other than, it looks good. Thanks [~arshad.mohammad]! cc [~fpj] for further thoughts. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15003791#comment-15003791 ] Arshad Mohammad commented on ZOOKEEPER-2307: {quote} -1 javadoc. The javadoc tool appears to have generated 1 warning messages. {quote} This patch has not introduced any new javadoc warning > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15003689#comment-15003689 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12772143/ZOOKEEPER-2307-03.patch against trunk revision 1713774. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 8 new or modified tests. -1 javadoc. The javadoc tool appears to have generated 1 warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs (version 2.0.3) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2956//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2956//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2956//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch, > ZOOKEEPER-2307-03.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991451#comment-14991451 ] Hadoop QA commented on ZOOKEEPER-2307: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12770769/ZOOKEEPER-2307-02.patch against trunk revision 1712218. +1 @author. The patch does not contain any @author tags. +1 tests included. The patch appears to include 5 new or modified tests. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. +1 findbugs. The patch does not introduce any new Findbugs (version 2.0.3) warnings. -1 release audit. The applied patch generated 1 release audit warnings (more than the trunk's current 0 warnings). -1 core tests. The patch failed core unit tests. +1 contrib tests. The patch passed contrib unit tests. Test results: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2943//testReport/ Release audit warnings: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2943//artifact/trunk/patchprocess/patchReleaseAuditProblems.txt Findbugs warnings: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2943//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html Console output: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2943//console This message is automatically generated. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2 > > Attachments: ZOOKEEPER-2307-01.patch, ZOOKEEPER-2307-02.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14984827#comment-14984827 ] Arshad Mohammad commented on ZOOKEEPER-2307: Hi [~fpj], What is your opinion on the proposed fix now. Any other opinion on the proposed fix. Junit test case patch is already attached, I think it will help to quickly understand the scenario. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > Attachments: ZOOKEEPER-2307-01.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14984821#comment-14984821 ] Arshad Mohammad commented on ZOOKEEPER-2307: The issue seems similar but actually it is not. Scenarios are completely different. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > Attachments: ZOOKEEPER-2307-01.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14981680#comment-14981680 ] Akihiro Suda commented on ZOOKEEPER-2307: - ZOOKEEPER-2162 seems very similar > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > Attachments: ZOOKEEPER-2307-01.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980947#comment-14980947 ] Arshad Mohammad commented on ZOOKEEPER-2307: Thanks [~fpj] for your quick response AFAIK, operation order is the real problem Have a look at code snippet in {org.apache.zookeeper.server.quorum.Learner.registerWithLeader(int)} {code} if (newEpoch > self.getAcceptedEpoch()) { wrappedEpochBytes.putInt((int)self.getCurrentEpoch()); self.setAcceptedEpoch(newEpoch); } else if (newEpoch == self.getAcceptedEpoch()) { // since we have already acked an epoch equal to the leaders, we cannot ack // again, but we still need to send our lastZxid to the leader so that we can // sync with it if it does assume leadership of the epoch. // the -1 indicates that this reply should not count as an ack for the new epoch wrappedEpochBytes.putInt(-1); } {code} Suppose, newEpoch=1 and self.getAcceptedEpoch()=0 If self.setAcceptedEpoch(newEpoch); throws error while writing to file. The server will go for leader election and when again comes at code {{if (newEpoch > self.getAcceptedEpoch())}} this time self.getAcceptedEpoch() will return 1 and will proceed for Leader.ACKEPOCH. self.getAcceptedEpoch() should not have returned 1 because the initial value was 0 and setAcceptedEpoch() was unsuccessful. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > Attachments: ZOOKEEPER-2307-01.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980746#comment-14980746 ] Flavio Junqueira commented on ZOOKEEPER-2307: - I don't see a problem with inverting the order of the operations, but I'm thinking that these methods that update epochs might need to be synchronized. The problem you're describing seems to be due to concurrent access. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > Attachments: ZOOKEEPER-2307-01.patch > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (ZOOKEEPER-2307) ZooKeeper not starting because acceptedEpoch is less than the currentEpoch
[ https://issues.apache.org/jira/browse/ZOOKEEPER-2307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980633#comment-14980633 ] Arshad Mohammad commented on ZOOKEEPER-2307: There is a problem in the code. Runtime value of acceptedEpoch is being updated before writing to acceptedEpoch file {code} public void setAcceptedEpoch(long e) throws IOException { acceptedEpoch = e; writeLongToFile(ACCEPTED_EPOCH_FILENAME, e); } {code} To fix the issue we should first write to file and then update the run time value. Same change should be done for currentEpoch as well. > ZooKeeper not starting because acceptedEpoch is less than the currentEpoch > -- > > Key: ZOOKEEPER-2307 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2307 > Project: ZooKeeper > Issue Type: Bug > Components: server >Reporter: Arshad Mohammad >Assignee: Arshad Mohammad >Priority: Critical > Fix For: 3.5.2, 3.6.0 > > > This issue occurred in one of our test environment where disk was being > changed to read only very frequently. > The the scenario is as follows: > # Configure three node ZooKeeper cluster, lets say nodes are A, B and C > # Start A and B. Both A and B start successfully, quorum is running. > # Start C, because of IO error C fails to update acceptedEpoch file. But C > also starts successfully, joins the quorum as follower > # Stop C > # Start C, bellow exception with message "The accepted epoch, 0 is less than > the current epoch, 1" is thrown > {code} > 2015-10-29 16:52:32,942 [myid:3] - ERROR [main:QuorumPeer@784] - Unable to > load database on disk > java.io.IOException: The accepted epoch, 0 is less than the current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > 2015-10-29 16:52:32,946 [myid:3] - ERROR [main:QuorumPeerMain@111] - > Unexpected exception, exiting abnormally > java.lang.RuntimeException: Unable to run quorum server > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:785) > at > org.apache.zookeeper.server.quorum.QuorumPeer.start(QuorumPeer.java:720) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.runFromConfig(QuorumPeerMain.java:202) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.initializeAndRun(QuorumPeerMain.java:139) > at > org.apache.zookeeper.server.quorum.QuorumPeerMain.main(QuorumPeerMain.java:88) > Caused by: java.io.IOException: The accepted epoch, 0 is less than the > current epoch, 1 > at > org.apache.zookeeper.server.quorum.QuorumPeer.loadDataBase(QuorumPeer.java:781) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)