[
https://issues.apache.org/jira/browse/HDFS-9574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088438#comment-15088438
]
Kihwal Lee commented on HDFS-9574:
----------------------------------
The new patch addresses the review comments.
- All relevant DataTransfer methods are now calling {{checkAccess()}} and the
registration is checked from there.
- The elapsed time is now tracked using {{StopWatch}}.
- {{getReplicaVisibleLength()}} now throws {{RetriableException}}.
- {{DFSInpuStream}} retries those nodes that threw {{RetriableException}} on
{{getReplicaVisibleLength()}}, with a limit. The client read timeout is used
for the retry timeout.
- The test case was expanded to cover the {{getReplicaVisibleLength()}} case.
> Reduce client failures during datanode restart
> ----------------------------------------------
>
> Key: HDFS-9574
> URL: https://issues.apache.org/jira/browse/HDFS-9574
> Project: Hadoop HDFS
> Issue Type: Bug
> Reporter: Kihwal Lee
> Assignee: Kihwal Lee
> Attachments: HDFS-9574.patch, HDFS-9574.v2.patch, HDFS-9574.v3.patch
>
>
> Since DataXceiverServer is initialized before BP is fully up, client requests
> will fail until the datanode registers.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)