[ https://issues.apache.org/jira/browse/ZOOKEEPER-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14243864#comment-14243864 ]
Rakesh R commented on ZOOKEEPER-2091: ------------------------------------- bq.I am confused how it reads the returning result without finishing sending the entire packet? As I remember the situation happened in one of the cluster due to the heavy system resource usage like cpu/memory was high(due to map reduce job execution). It was continuously hitting the below exception and all the operations was failing. {code} org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1127) java.io.IOException: Nothing in the queue, but got 425467 at org.apache.zookeeper.ClientCnxn$SendThread.readResponse(ClientCnxn.java:788) {code} Now with the patch, the cluster is running and no issue reported till now. > Possible logic error in ClientCnxnSocketNIO > ------------------------------------------- > > Key: ZOOKEEPER-2091 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-2091 > Project: ZooKeeper > Issue Type: Bug > Components: java client > Affects Versions: 3.4.6 > Reporter: Cheng > Assignee: Rakesh R > Fix For: 3.5.1 > > Attachments: ZOOKEEPER-2091.patch > > > When SASL authentication is enabled, the ZooKeeper client will finally call > ClientCnxnSocketNIO#sendPacket(Packet p) to send a packet to server: > @Override > void sendPacket(Packet p) throws IOException { > SocketChannel sock = (SocketChannel) sockKey.channel(); > if (sock == null) { > throw new IOException("Socket is null!"); > } > p.createBB(); > ByteBuffer pbb = p.bb; > sock.write(pbb); > } > One problem I can see is that the sock is non-blocking, so when the sock's > output buffer is full(theoretically), only part of the Packet is sent out and > the communication will break. -- This message was sent by Atlassian JIRA (v6.3.4#6332)