[ 
https://issues.apache.org/jira/browse/HDFS-4420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13589716#comment-13589716
 ] 

Max Lapan commented on HDFS-4420:
---------------------------------

Single point that disturbs me about this is too slow exclude blocks 
enumeration. In our installation with 500TB /hbase, balancing iteration is 
about 15 minutes, 5 minutes took by block move, and 10 minutes by exclude 
blocks list preparation. It would be gread to reduce this somehow, but I'm in 
lack of ideas how to achieve this.
                
> Provide a way to exclude subtree from balancing process
> -------------------------------------------------------
>
>                 Key: HDFS-4420
>                 URL: https://issues.apache.org/jira/browse/HDFS-4420
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>          Components: balancer
>            Reporter: Max Lapan
>            Priority: Minor
>         Attachments: Balancer-exclude-subtree-0.90.2.patch, 
> Balancer-exclude-trunk.patch, Balancer-exclude-trunk-v2.patch, 
> Balancer-exclude-trunk-v3.patch
>
>
> During balancer operation, it balances all blocks, regardless of their 
> filesystem hierarchy. Sometimes, it would be usefull to exclude some subtree 
> from balancing process.
> For example, regionservers data locality is cruical for HBase performance. 
> Region's data is tied to regionservers, which reside on specific machines in 
> cluster. During operation, regionservers reads and writes region's data, and 
> after some time, all this data are reside on local machine, so, all reads 
> become local, which is great for performance. Balancer breaks this locality 
> during opertation by moving blocks around.
> This patch adds [-exclude <path>] switch, and, if path is provided,
> balancer will not move blocks under this path during operation.
> Attached patch have tested for 0.90.2.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to