[
https://issues.apache.org/jira/browse/ZOOKEEPER-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13652290#comment-13652290
]
Patrick Hunt commented on ZOOKEEPER-1700:
-----------------------------------------
[~shralex] this is very odd. It looks to me like a jvm issue - I find that hard
to believe but I don't know what else it could be.
It would explain why only I see this. (ubuntu jdk1.6.0_37)
the following code
{noformat}
public synchronized QuorumVerifier getLastSeenQuorumVerifier(){
LOG.info("called getLastSeenQuorumVerifier " + lastSeenQuorumVerifier);
return lastSeenQuorumVerifier;
}
public synchronized void connectNewPeers(){
LOG.info("connectnewpeers called");
QuorumVerifier qv = getQuorumVerifier();
LOG.info("got qv");
QuorumVerifier lsqv2 = getLastSeenQuorumVerifier();
LOG.info("got lsqv2");
{noformat}
results in this
{noformat}
2013-05-08 12:59:00,143 [myid:] - WARN
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1235] -
setLastSeenQuorumVerifier called with stale config 0. Current version: 0
2013-05-08 12:59:00,144 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1240] - calling
connectnewpeers
2013-05-08 12:59:00,144 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1200] - connectnewpeers
called
2013-05-08 12:59:00,144 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1202] - got qv
2013-05-08 12:59:04,140 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@585] - Shutting down
2013-05-08 12:59:04,140 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@591] - Shutdown called
java.lang.Exception: shutdown Leader! reason: Waiting for a quorum of
followers, only synced with sids: [ [1] ]
{noformat}
No exception is generated (caught throwable in call to connectNewPeer)
I'm about to try with a different jvm version.
> FLETest consistently failing - setLastSeenQuorumVerifier seems to be hanging
> ----------------------------------------------------------------------------
>
> Key: ZOOKEEPER-1700
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1700
> Project: ZooKeeper
> Issue Type: Bug
> Components: quorum
> Affects Versions: 3.5.0
> Reporter: Patrick Hunt
> Assignee: Flavio Junqueira
> Priority: Critical
> Fix For: 3.5.0
>
> Attachments: ZOOKEEPER-1700.patch
>
>
> I'm consistently seeing a failure on my laptop when running the FLETest
> "testJoin" test. What seems to be happening is that the call to
> setLastSeenQuorumVerifier is hanging.
> See the following log from the test, notice 17:35:57 for the period in
> question. Note that I turned on debug logging and added a few log messages
> around the call to setLastSeenQuorumVerifier (you can see the code enter but
> never leave)
> Note: I've applied ZOOKEEPER-1324 to trunk code and then run this test but
> that doesn't seem to help. Also note that this test is passing consistently
> when run against branch-3.4.
> {noformat}
> 2013-05-07 17:35:57,859 [myid:] - INFO
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Follower@65] - FOLLOWING - LEADER
> ELECTION TOOK - 16
> 2013-05-07 17:35:57,859 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@436] - LEADING - LEADER
> ELECTION TOOK - 17
> 2013-05-07 17:35:57,863 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:FileTxnSnapLog@270] - Snapshotting:
> 0x0 to
> /home/phunt/dev/zookeeper-trunk/build/test/tmp/test3690487600947307322.junit.dir/version-2/snapshot.0
> 2013-05-07 17:35:57,873 [myid:] - INFO
> [LearnerHandler-/127.0.0.1:34262:LearnerHandler@269] - Follower sid: 0 : info
> : 0.0.0.0:11222:11223:participant;0.0.0.0:11221
> 2013-05-07 17:35:57,878 [myid:] - INFO
> [LearnerHandler-/127.0.0.1:34262:LearnerHandler@328] - Synchronizing with
> Follower sid: 0 maxCommittedLog=0x0 minCommittedLog=0x0 peerLastZxid=0x0
> 2013-05-07 17:35:57,878 [myid:] - DEBUG
> [LearnerHandler-/127.0.0.1:34262:LearnerHandler@395] - committedLog is empty
> but leader and follower are in sync, zxid=0x0
> 2013-05-07 17:35:57,878 [myid:] - INFO
> [LearnerHandler-/127.0.0.1:34262:LearnerHandler@404] - Sending DIFF
> 2013-05-07 17:35:57,879 [myid:] - DEBUG
> [LearnerHandler-/127.0.0.1:34262:LearnerHandler@411] - Sending NEWLEADER
> message to 0
> 2013-05-07 17:35:57,880 [myid:] - INFO
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@331] - Getting a diff from
> the leader 0x0
> 2013-05-07 17:35:57,885 [myid:] - INFO
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@457] - Learner received
> NEWLEADER message
> 2013-05-07 17:35:57,885 [myid:] - INFO
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@460] - NEWLEADER calling
> configfromstring
> 2013-05-07 17:35:57,885 [myid:] - INFO
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@462] - NEWLEADER setting
> quorum verifier
> 2013-05-07 17:35:57,886 [myid:] - WARN
> [QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1218] -
> setLastSeenQuorumVerifier called with stale config 0. Current version: 0
> 2013-05-07 17:36:01,880 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@585] - Shutting down
> 2013-05-07 17:36:01,881 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@591] - Shutdown called
> java.lang.Exception: shutdown Leader! reason: Waiting for a quorum of
> followers, only synced with sids: [ [1] ]
> at org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:591)
> at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:487)
> at
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:949)
> 2013-05-07 17:36:01,881 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:ZooKeeperServer@398] - shutting down
> 2013-05-07 17:36:01,881 [myid:] - INFO
> [LearnerCnxAcceptor-0.0.0.0/0.0.0.0:11225:Leader$LearnerCnxAcceptor@398] -
> exception while shutting down acceptor: java.net.SocketException: Socket
> closed
> 2013-05-07 17:36:01,882 [myid:] - WARN
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@979] - PeerState set to
> LOOKING
> 2013-05-07 17:36:01,882 [myid:] - INFO
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@863] - LOOKING
> 2013-05-07 17:36:01,883 [myid:] - DEBUG
> [QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@792] - Initializing
> leader election protocol...
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira