Patrick Hunt created ZOOKEEPER-1700:
---------------------------------------
Summary: FLETest consistently failing - setLastSeenQuorumVerifier
seems to be hanging
Key: ZOOKEEPER-1700
URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1700
Project: ZooKeeper
Issue Type: Bug
Components: quorum
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Priority: Critical
Fix For: 3.5.0
I'm consistently seeing a failure on my laptop when running the FLETest
"testJoin" test. What seems to be happening is that the call to
setLastSeenQuorumVerifier is hanging.
See the following log from the test, notice 17:35:57 for the period in
question. Note that I turned on debug logging and added a few log messages
around the call to setLastSeenQuorumVerifier (you can see the code enter but
never leave)
Note: I've applied ZOOKEEPER-1324 to trunk code and then run this test but that
doesn't seem to help. Also note that this test is passing consistently when run
against branch-3.4.
{noformat}
2013-05-07 17:35:57,859 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Follower@65] - FOLLOWING - LEADER
ELECTION TOOK - 16
2013-05-07 17:35:57,859 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@436] - LEADING - LEADER
ELECTION TOOK - 17
2013-05-07 17:35:57,863 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:FileTxnSnapLog@270] - Snapshotting:
0x0 to
/home/phunt/dev/zookeeper-trunk/build/test/tmp/test3690487600947307322.junit.dir/version-2/snapshot.0
2013-05-07 17:35:57,873 [myid:] - INFO
[LearnerHandler-/127.0.0.1:34262:LearnerHandler@269] - Follower sid: 0 : info :
0.0.0.0:11222:11223:participant;0.0.0.0:11221
2013-05-07 17:35:57,878 [myid:] - INFO
[LearnerHandler-/127.0.0.1:34262:LearnerHandler@328] - Synchronizing with
Follower sid: 0 maxCommittedLog=0x0 minCommittedLog=0x0 peerLastZxid=0x0
2013-05-07 17:35:57,878 [myid:] - DEBUG
[LearnerHandler-/127.0.0.1:34262:LearnerHandler@395] - committedLog is empty
but leader and follower are in sync, zxid=0x0
2013-05-07 17:35:57,878 [myid:] - INFO
[LearnerHandler-/127.0.0.1:34262:LearnerHandler@404] - Sending DIFF
2013-05-07 17:35:57,879 [myid:] - DEBUG
[LearnerHandler-/127.0.0.1:34262:LearnerHandler@411] - Sending NEWLEADER
message to 0
2013-05-07 17:35:57,880 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@331] - Getting a diff from
the leader 0x0
2013-05-07 17:35:57,885 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@457] - Learner received
NEWLEADER message
2013-05-07 17:35:57,885 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@460] - NEWLEADER calling
configfromstring
2013-05-07 17:35:57,885 [myid:] - INFO
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:Learner@462] - NEWLEADER setting
quorum verifier
2013-05-07 17:35:57,886 [myid:] - WARN
[QuorumPeer[myid=0]/0:0:0:0:0:0:0:0:11221:QuorumPeer@1218] -
setLastSeenQuorumVerifier called with stale config 0. Current version: 0
2013-05-07 17:36:01,880 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@585] - Shutting down
2013-05-07 17:36:01,881 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:Leader@591] - Shutdown called
java.lang.Exception: shutdown Leader! reason: Waiting for a quorum of
followers, only synced with sids: [ [1] ]
at org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:591)
at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:487)
at
org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:949)
2013-05-07 17:36:01,881 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:ZooKeeperServer@398] - shutting down
2013-05-07 17:36:01,881 [myid:] - INFO
[LearnerCnxAcceptor-0.0.0.0/0.0.0.0:11225:Leader$LearnerCnxAcceptor@398] -
exception while shutting down acceptor: java.net.SocketException: Socket closed
2013-05-07 17:36:01,882 [myid:] - WARN
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@979] - PeerState set to
LOOKING
2013-05-07 17:36:01,882 [myid:] - INFO
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@863] - LOOKING
2013-05-07 17:36:01,883 [myid:] - DEBUG
[QuorumPeer[myid=1]/0:0:0:0:0:0:0:0:11224:QuorumPeer@792] - Initializing leader
election protocol...
{noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira