[
https://issues.apache.org/jira/browse/HDFS-8383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14732320#comment-14732320
]
Li Bo commented on HDFS-8383:
-----------------------------
Thanks [~walter.k.su] for the work! I have just read the code and find some
points to be discussed:
1) When only one streamer fails, do we need to do anything? I think we can
just ignore the failed streamer unless more than 3 streamers are found failed.
The offline decode work will be started by some datanode later.
2) I think it’s not right to set the failed status of streamer in
outputstream due to the asynchronization. I have given some reasons in
HDFS-8704. The outputstream doesn’t need to care about the status of each
streamer if just one or two streamers fail. This will not complicate the logic
of outputstreamer.
3) Not very clear about the error handling. For example, streamer_i fails
to write a packet of block_j, but it succeeds to write block_j+1, could you
give some detailed description about this situation?
> Tolerate multiple failures in DFSStripedOutputStream
> ----------------------------------------------------
>
> Key: HDFS-8383
> URL: https://issues.apache.org/jira/browse/HDFS-8383
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Reporter: Tsz Wo Nicholas Sze
> Assignee: Walter Su
> Attachments: HDFS-8383.00.patch, HDFS-8383.01.patch
>
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)