[
https://issues.apache.org/jira/browse/HDFS-457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934785#action_12934785
]
Eli Collins commented on HDFS-457:
----------------------------------
Hey Uma,
Think you're looking at an old patch, all the accesses of removedVols in
checkDirs are first checked against null in trunk.
Thanks,
Eli
> better handling of volume failure in Data Node storage
> ------------------------------------------------------
>
> Key: HDFS-457
> URL: https://issues.apache.org/jira/browse/HDFS-457
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: data-node
> Reporter: Boris Shkolnik
> Assignee: Boris Shkolnik
> Fix For: 0.21.0
>
> Attachments: HDFS-457-1.patch, HDFS-457-2.patch, HDFS-457-2.patch,
> HDFS-457-2.patch, HDFS-457-3.patch, HDFS-457-y20.patch, HDFS-457.patch,
> HDFS-457_20-append.patch, HDFS_457.patch,
> jira.HDFS-457.branch-0.20-internal.patch, TestFsck.zip
>
>
> Current implementation shuts DataNode down completely when one of the
> configured volumes of the storage fails.
> This is rather wasteful behavior because it decreases utilization (good
> storage becomes unavailable) and imposes extra load on the system
> (replication of the blocks from the good volumes). These problems will become
> even more prominent when we move to mixed (heterogeneous) clusters with many
> more volumes per Data Node.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.