[ 
https://issues.apache.org/jira/browse/HDFS-10781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Clampffer updated HDFS-10781:
-----------------------------------
    Parent Issue: HDFS-14032  (was: HDFS-8707)

> libhdfs++: redefine NN timeout to be "time without a response"
> --------------------------------------------------------------
>
>                 Key: HDFS-10781
>                 URL: https://issues.apache.org/jira/browse/HDFS-10781
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: hdfs-client
>            Reporter: Bob Hansen
>            Priority: Major
>
> In the find tool, we submit a zillion requests to the NameNode 
> asynchronously.  As the queue on the NameNode grows, the time to response for 
> each individual message will increase.  In the find tool, we were eventually 
> getting timeouts on requests, even though the NN was respoinding as fast as 
> its little feet could carry it.
> I propose that we should redefine timeouts to be on a per-connection basis 
> rather than per-request.  If a client has an outstanding request to the NN 
> but hasn't gotten a response back within n msec, it should declare the 
> connection dead and retry.  As long as the NameNode is being responsive to 
> the best of its ability and providing data, we will not declare the link dead.
> One potential for Failure of Least Astonishment here is that it will mean any 
> particular request from a client cannot be depended on to get a positive or 
> negative response within a fixed amount of time, but I think that may be a 
> good trade to make.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to