Hello Everyone, I am using Zookeeper 3.5.1-alpha and I see a problem when I am using a 2 node setup.
Node 1 Zookeeper logs: 2020-01-11 11:29:52,141 [myid:2147483653] - INFO [QuorumPeerListener:QuorumCnxManager$Listener@631] - My election bind port: 0.0.0.0/0.0.0.0:61898 2020-01-11 11:29:52,149 [myid:2147483653] - ERROR [WorkerSender[myid=2147483653]:NIOServerCnxnFactory$1@92] - Thread Thread[WorkerSender[myid=2147483653],5,main] died java.lang.NullPointerException at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(Unknown Source) at java.util.concurrent.LinkedBlockingQueue.poll(Unknown Source) at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.run(FastLeaderElection.java:462) at java.lang.Thread.run(Unknown Source) 2020-01-11 11:29:52,161 [myid:2147483653] - INFO [QuorumPeer[myid=2147483653](plain=/0:0:0:0:0:0:0:0:61896)(secure=disabled):QuorumPeer@986] - LOOKING Node 2 Zookeeper logs: 2020-01-11 11:29:51,852 [myid:2147483652] - WARN [WorkerSender[myid=2147483652]:QuorumCnxManager@459] - Cannot open channel to 2147483653 at election address /128.0.0.5:61898 java.net.ConnectException: Connection refused at java.net.PlainSocketImpl.socketConnect(Native Method) at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source) at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source) at java.net.AbstractPlainSocketImpl.connect(Unknown Source) at java.net.SocksSocketImpl.connect(Unknown Source) at java.net.Socket.connect(Unknown Source) at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:444) at org.apache.zookeeper.server.quorum.QuorumCnxManager.connectOne(QuorumCnxManager.java:485) at org.apache.zookeeper.server.quorum.QuorumCnxManager.toSend(QuorumCnxManager.java:421) at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.process(FastLeaderElection.java:486) at org.apache.zookeeper.server.quorum.FastLeaderElection$Messenger$WorkerSender.run(FastLeaderElection.java:465) at java.lang.Thread.run(Unknown Source) Zookeeper server on the nodes never recover from this state and clients are unable to connect to the server. Any hint on what the problem is based on the back trace on Node 1 logs? Is this a Zookeeper server code issue or a setup issue? Thanks, Pramod