[
https://issues.apache.org/jira/browse/HDFS-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970565#action_12970565
]
Eli Collins commented on HDFS-1362:
-----------------------------------
Hey Wang,
Is the high-level use case replacing a failed disk with a new one without
restarting the datanode (so you don't need to fail existing operations on that
datanode)? Is this a feature that's needed if you've configured the datanode
to tolerate multiple volume failures (HDFS-457)?
Thanks,
Eli
> Provide volume management functionality for DataNode
> ----------------------------------------------------
>
> Key: HDFS-1362
> URL: https://issues.apache.org/jira/browse/HDFS-1362
> Project: Hadoop HDFS
> Issue Type: New Feature
> Components: data-node
> Reporter: Wang Xu
> Assignee: Wang Xu
> Attachments: HDFS-1362.4_w7001.txt, HDFS-1362.txt,
> Provide_volume_management_for_DN_v1.pdf
>
>
> The current management unit in Hadoop is a node, i.e. if a node failed, it
> will be kicked out and all the data on the node will be replicated.
> As almost all SATA controller support hotplug, we add a new command line
> interface to datanode, thus it can list, add or remove a volume online, which
> means we can change a disk without node decommission. Moreover, if the failed
> disk still readable and the node has enouth space, it can migrate data on the
> disks to other disks in the same node.
> A more detailed design document will be attached.
> The original version in our lab is implemented against 0.20 datanode
> directly, and is it better to implemented it in contrib? Or any other
> suggestion?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.