Pengwei created KAFKA-5014:
------------------------------

             Summary: SSL Channel not ready but tcp is established and the 
server is hung will not sending metadata
                 Key: KAFKA-5014
                 URL: https://issues.apache.org/jira/browse/KAFKA-5014
             Project: Kafka
          Issue Type: Bug
    Affects Versions: 0.10.2.0, 0.9.0.1
            Reporter: Pengwei
            Priority: Minor


In our test env, QA hang one of the connecting broker of the producer, then the 
producer will be stuck in send method, and throw the exception: fail to update 
metadata after request timeout.

I found the reason as follow:  when the producer chose one of the broker to 
send metadata, it connect to the broker, but the broker is hang, the tcp is 
connected and Network client marks this broker is connected, but the SSL 
channel is not ready yet so the channel is not ready.    

Then the Network client chooses the connected node in the leastLoadedNode every 
time to send the metadata, but the node's channel is not ready yet.  

So the producer stuck in getting metadata and will not try another node to 
request metadata.  The client should not stuck only one node is hung



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to