[jira] [Commented] (HDFS-1312) Re-balance disks within a Datanode

Arpit Agarwal (JIRA) Thu, 23 Jun 2016 23:26:50 -0700

    [ 
https://issues.apache.org/jira/browse/HDFS-1312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15347829#comment-15347829
 ]


Arpit Agarwal commented on HDFS-1312:
-------------------------------------

The checkstyle failures were 'hides a field' and one long method which was not 
added by this patch.

I've merged the HDFS-1312 feature branch to trunk. Thanks for the code 
contribution [~anu], [~xiaobingo], [~eddyxu] and [~linyiqun]. Thanks to 
everyone else who contributed ideas and feedback on this historical jira. :) 
Users frequently request this feature and it felt good to commit it. 

Anu or I will resolve this Jira shortly and move out the remaining sub-tasks to 
a follow-up Jira.

> Re-balance disks within a Datanode
> ----------------------------------
>
>                 Key: HDFS-1312
>                 URL: https://issues.apache.org/jira/browse/HDFS-1312
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: datanode
>            Reporter: Travis Crawford
>            Assignee: Anu Engineer
>         Attachments: Architecture_and_test_update.pdf, 
> Architecture_and_testplan.pdf, HDFS-1312.001.patch, HDFS-1312.002.patch, 
> HDFS-1312.003.patch, HDFS-1312.004.patch, HDFS-1312.005.patch, 
> HDFS-1312.006.patch, HDFS-1312.007.patch, disk-balancer-proposal.pdf
>
>
> Filing this issue in response to ``full disk woes`` on hdfs-user.
> Datanodes fill their storage directories unevenly, leading to situations 
> where certain disks are full while others are significantly less used. Users 
> at many different sites have experienced this issue, and HDFS administrators 
> are taking steps like:
> - Manually rebalancing blocks in storage directories
> - Decomissioning nodes & later readding them
> There's a tradeoff between making use of all available spindles, and filling 
> disks at the sameish rate. Possible solutions include:
> - Weighting less-used disks heavier when placing new blocks on the datanode. 
> In write-heavy environments this will still make use of all spindles, 
> equalizing disk use over time.
> - Rebalancing blocks locally. This would help equalize disk use as disks are 
> added/replaced in older cluster nodes.
> Datanodes should actively manage their local disk so operator intervention is 
> not needed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HDFS-1312) Re-balance disks within a Datanode

Reply via email to