[
https://issues.apache.org/jira/browse/HDFS-6441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14005796#comment-14005796
]
Benoy Antony commented on HDFS-6441:
------------------------------------
Thanks for the pointer, [~carp84]. After reviewing HDFS-6010's requirements, I
have updated the patch to accept either an exclude list of datanodes or an
include list of datanodes. This required only a minor change to the patch.
I have changed the the parameter to "-datanodes" .
Also added two flags - "-exclude" and "-include"
The exclude list can be specified as
{panel}
run-balancer.sh -threshold 10 -exclude -datanodes <hosts-file>
{panel}
With this, the datanodes in the hosts-file will not be used while balancing.
The include list can be specified as
{panel}
run-balancer.sh -threshold 10 -include -datanodes <hosts-file>
{panel}
With this, only the datanodes in the hosts-file will be used for balancing.
When there are many hosts, it will be better to keep the hosts in a file rather
than pass it via command line.
> Add ability to Ignore few datanodes while balancing
> ---------------------------------------------------
>
> Key: HDFS-6441
> URL: https://issues.apache.org/jira/browse/HDFS-6441
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: balancer
> Affects Versions: 2.4.0
> Reporter: Benoy Antony
> Assignee: Benoy Antony
> Attachments: HDFS-6441.patch
>
>
> In some use cases, it is desirable to ignore a few data nodes while
> balancing. The administrator should be able to specify a list of data nodes
> in a file similar to the hosts file and the balancer should ignore these data
> nodes while balancing so that no blocks are added/removed on these nodes.
--
This message was sent by Atlassian JIRA
(v6.2#6252)