[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13002357#comment-13002357
 ] 

Patrick Hunt commented on ZOOKEEPER-1006:
-----------------------------------------

here's an example of recent failure on ubuntu:

{noformat}

2011-03-03 16:25:53,599 - INFO  [main:CnxManagerTest@309] - Round 0, restarting 
peer 1
2011-03-03 16:25:53,600 - INFO  [main:FileSnap@82] - Reading snapshot 
/home/phunt/zookeeper/build/test/tmp/test4354134956302701428.junit.dir/version-2/snapshot.0
2011-03-03 16:25:53,601 - INFO  [Thread-96:QuorumCnxManager$Listener@473] - My 
election bind port: 11255
2011-03-03 16:25:53,602 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:QuorumPeer@621] - LOOKING
2011-03-03 16:25:53,603 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@663] - New election. My 
id =  1, Proposed zxid = 0
2011-03-03 16:25:53,637 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 1 (n.leader), 0 (n.zxid), 1 (n.round), LOOKING (n.state), 1 
(n.sid), FOLLOWING (my state)
2011-03-03 16:25:53,637 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 0 (n.leader), 4294967296 (n.zxid), 1 (n.round), LOOKING 
(n.state), 0 (n.sid), LOOKING (my state)
2011-03-03 16:25:53,638 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@721] - Updating proposal
2011-03-03 16:25:53,638 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 2 (n.leader), 0 (n.zxid), 1 (n.round), FOLLOWING (n.state), 0 
(n.sid), LOOKING (my state)
2011-03-03 16:25:53,638 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 1 (n.leader), 0 (n.zxid), 1 (n.round), LOOKING (n.state), 1 
(n.sid), LOOKING (my state)
2011-03-03 16:25:53,654 - INFO  [WorkerSender Thread:QuorumCnxManager@183] - 
Have smaller server identifier, so dropping the connection: (2, 1)
2011-03-03 16:25:53,654 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 0 (n.leader), 4294967296 (n.zxid), 1 (n.round), LOOKING 
(n.state), 1 (n.sid), FOLLOWING (my state)
2011-03-03 16:25:53,654 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 0 (n.leader), 4294967296 (n.zxid), 1 (n.round), LOOKING 
(n.state), 1 (n.sid), LOOKING (my state)
2011-03-03 16:25:53,655 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 2 (n.leader), 0 (n.zxid), 1 (n.round), FOLLOWING (n.state), 0 
(n.sid), LOOKING (my state)
2011-03-03 16:25:53,657 - INFO  [WorkerSender Thread:QuorumCnxManager@183] - 
Have smaller server identifier, so dropping the connection: (2, 1)
2011-03-03 16:25:53,657 - WARN  [Thread-102:QuorumCnxManager$RecvWorker@727] - 
Connection broken for id 1, my id = 2, error = 
java.nio.channels.AsynchronousCloseException
2011-03-03 16:25:53,658 - WARN  [Thread-102:QuorumCnxManager$RecvWorker@730] - 
Interrupting SendWorker
2011-03-03 16:25:53,687 - WARN  [Thread-103:QuorumCnxManager$SendWorker@638] - 
Exception when using channel: for id 2 my id = 1 error = java.io.IOException: 
Broken pipe
2011-03-03 16:25:53,687 - WARN  [Thread-104:QuorumCnxManager$RecvWorker@727] - 
Connection broken for id 2, my id = 1, error = 
java.nio.channels.ClosedChannelException
2011-03-03 16:25:53,687 - WARN  [Thread-104:QuorumCnxManager$RecvWorker@730] - 
Interrupting SendWorker
2011-03-03 16:25:53,688 - ERROR [Thread-101:QuorumCnxManager$SendWorker@611] - 
Failed to send last message. Shutting down thread.
java.nio.channels.ClosedChannelException
        at 
sun.nio.ch.SocketChannelImpl.ensureWriteOpen(SocketChannelImpl.java:126)
        at sun.nio.ch.SocketChannelImpl.write(SocketChannelImpl.java:324)
        at 
org.apache.zookeeper.server.quorum.QuorumCnxManager$SendWorker.send(QuorumCnxManager.java:599)
        at 
org.apache.zookeeper.server.quorum.QuorumCnxManager$SendWorker.run(QuorumCnxManager.java:609)
2011-03-03 16:25:53,688 - WARN  [Thread-101:QuorumCnxManager$SendWorker@642] - 
Send worker leaving thread
2011-03-03 16:25:53,688 - WARN  [Thread-103:QuorumCnxManager$SendWorker@642] - 
Send worker leaving thread
2011-03-03 16:25:53,855 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@697] - Notification time 
out: 400
2011-03-03 16:25:53,855 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 0 (n.leader), 4294967296 (n.zxid), 1 (n.round), LOOKING 
(n.state), 1 (n.sid), LOOKING (my state)
2011-03-03 16:25:53,856 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 0 (n.leader), 4294967296 (n.zxid), 1 (n.round), LOOKING 
(n.state), 1 (n.sid), FOLLOWING (my state)
2011-03-03 16:25:53,856 - INFO  [WorkerReceiver Thread:FastLeaderElection@496] 
- Notification: 2 (n.leader), 0 (n.zxid), 1 (n.round), FOLLOWING (n.state), 0 
(n.sid), LOOKING (my state)
2011-03-03 16:25:54,256 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@697] - Notification time 
out: 800
2011-03-03 16:25:55,057 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@697] - Notification time 
out: 1600
2011-03-03 16:25:56,657 - INFO  
[QuorumPeer:/0:0:0:0:0:0:0:0:11254:FastLeaderElection@697] - Notification time 
out: 3200
------------- ---------------- ---------------
FAILURE
{noformat}

> QuorumPeer "Address already in use" -- regression in 3.3.3
> ----------------------------------------------------------
>
>                 Key: ZOOKEEPER-1006
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1006
>             Project: ZooKeeper
>          Issue Type: Bug
>    Affects Versions: 3.3.3
>            Reporter: Patrick Hunt
>            Priority: Blocker
>         Attachments: TEST-org.apache.zookeeper.test.CnxManagerTest.txt, 
> ZOOKEEPER-1006.patch, workerthreads_badtest.txt
>
>
> CnxManagerTest.testWorkerThreads 
> See attachment, this is the first time I've seen this test fail, and it's 
> failed 2 out of the last three test runs.
> Notice (attachment) once this happens the port never becomes available.
> {noformat}
> 2011-03-02 15:53:12,425 - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11245:NIOServerCnxn$Factory@251] - 
> Accepted socket connection from /172.29.6.162:51441
> 2011-03-02 15:53:12,430 - WARN  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11245:NIOServerCnxn@639] - Exception 
> causing close of session 0x0 due to java.io.IOException: ZooKeeperServer not 
> running
> 2011-03-02 15:53:12,430 - INFO  
> [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11245:NIOServerCnxn@1435] - Closed 
> socket connection for client /172.29.6.162:51441 (no session established for 
> client)
> 2011-03-02 15:53:12,430 - WARN  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11241:Follower@82] - Exception when following 
> the leader
> java.io.EOFException
>       at java.io.DataInputStream.readInt(DataInputStream.java:375)
>       at 
> org.apache.jute.BinaryInputArchive.readInt(BinaryInputArchive.java:63)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPacket.deserialize(QuorumPacket.java:84)
>       at 
> org.apache.jute.BinaryInputArchive.readRecord(BinaryInputArchive.java:108)
>       at 
> org.apache.zookeeper.server.quorum.Learner.readPacket(Learner.java:148)
>       at 
> org.apache.zookeeper.server.quorum.Learner.registerWithLeader(Learner.java:267)
>       at 
> org.apache.zookeeper.server.quorum.Follower.followLeader(Follower.java:66)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:645)
> 2011-03-02 15:53:12,431 - INFO  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11241:Follower@165] - shutdown called
> java.lang.Exception: shutdown Follower
>       at 
> org.apache.zookeeper.server.quorum.Follower.shutdown(Follower.java:165)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:649)
> 2011-03-02 15:53:12,432 - INFO  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11241:QuorumPeer@621] - LOOKING
> 2011-03-02 15:53:12,432 - INFO  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11241:FastLeaderElection@663] - New election. My 
> id =  0, Proposed zxid = 0
> 2011-03-02 15:53:12,433 - INFO  [WorkerReceiver 
> Thread:FastLeaderElection@496] - Notification: 0 (n.leader), 0 (n.zxid), 2 
> (n.round), LOOKING (n.state), 0 (n.sid), LOOKING (my state)
> 2011-03-02 15:53:12,433 - INFO  [WorkerReceiver 
> Thread:FastLeaderElection@496] - Notification: 0 (n.leader), 0 (n.zxid), 2 
> (n.round), LOOKING (n.state), 0 (n.sid), LOOKING (my state)
> 2011-03-02 15:53:12,433 - INFO  [WorkerReceiver 
> Thread:FastLeaderElection@496] - Notification: 0 (n.leader), 0 (n.zxid), 2 
> (n.round), LOOKING (n.state), 0 (n.sid), LOOKING (my state)
> 2011-03-02 15:53:12,633 - INFO  [WorkerReceiver 
> Thread:FastLeaderElection@496] - Notification: 0 (n.leader), 0 (n.zxid), 2 
> (n.round), LOOKING (n.state), 0 (n.sid), LOOKING (my state)
> 2011-03-02 15:53:12,633 - INFO  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11245:QuorumPeer@655] - LEADING
> 2011-03-02 15:53:12,636 - INFO  [QuorumPeer:/0:0:0:0:0:0:0:0:11245:Leader@54] 
> - TCP NoDelay set to: true
> 2011-03-02 15:53:12,638 - INFO  
> [QuorumPeer:/0:0:0:0:0:0:0:0:11245:ZooKeeperServer@151] - Created server with 
> tickTime 1000 minSessionTimeout 2000 maxSessionTimeout 20000 datadir 
> /var/lib/hudson/workspace/CDH3-ZooKeeper-3.3.3_sles/build/test/tmp/test9001250572426375869.junit.dir/version-2
>  snapdir 
> /var/lib/hudson/workspace/CDH3-ZooKeeper-3.3.3_sles/build/test/tmp/test9001250572426375869.junit.dir/version-2
> 2011-03-02 15:53:12,639 - ERROR 
> [QuorumPeer:/0:0:0:0:0:0:0:0:11245:Leader@133] - Couldn't bind to port 11245
> java.net.BindException: Address already in use
>       at java.net.PlainSocketImpl.socketBind(Native Method)
>       at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:365)
>       at java.net.ServerSocket.bind(ServerSocket.java:319)
>       at java.net.ServerSocket.<init>(ServerSocket.java:185)
>       at java.net.ServerSocket.<init>(ServerSocket.java:97)
>       at org.apache.zookeeper.server.quorum.Leader.<init>(Leader.java:131)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.makeLeader(QuorumPeer.java:512)
>       at 
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:657)
> {noformat}

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to