Eric Payne created HDFS-7224:
--------------------------------

             Summary: Allow reuse of NN connections via webhdfs
                 Key: HDFS-7224
                 URL: https://issues.apache.org/jira/browse/HDFS-7224
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: webhdfs
    Affects Versions: 2.5.0
            Reporter: Eric Payne
            Assignee: Eric Payne


In very large clusters, the webhdfs client could get bind exceptions because it 
runs out of ephemeral
ports. This could happen when using webhdfs to talk to the NN in order to do 
list globbing of a
huge amount of files.

WebHdfsFileSystem#jsonParse gets the input/error stream from the connection,
but never closes the stream. Since it's not closed, the JVM thinks the stream 
may still
be transferring data, so the next time through this code, it has to get a new 
connection
rather than reusing an existing one. 

The lack of connection reuse has poor latency and adds too much overhead to the 
NN.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to