[ 
https://issues.apache.org/jira/browse/HDFS-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14392663#comment-14392663
 ] 

Kihwal Lee edited comment on HDFS-8041 at 4/2/15 12:47 PM:
-----------------------------------------------------------

bq. -1 findbugs. The patch appears to introduce 1 new Findbugs (version 2.0.3) 
warnings.
I left synchronized for the test method. Thanks, Mr. Findbugs for catching this.


was (Author: kihwal):
bq. -1 findbugs. The patch appears to introduce 1 new Findbugs (version 2.0.3) 
warnings.
I left the synchronized for the test method. Thanks, Mr. Findbugs for catching 
this.

> Consider remaining space during block blockplacement if dfs space is highly 
> utilized
> ------------------------------------------------------------------------------------
>
>                 Key: HDFS-8041
>                 URL: https://issues.apache.org/jira/browse/HDFS-8041
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>            Reporter: Kihwal Lee
>            Assignee: Kihwal Lee
>         Attachments: HDFS-8041.v1.patch, HDFS-8041.v2.patch
>
>
> This feature is helpful in avoiding smaller nodes (i.e. heterogeneous 
> environment) getting constantly being full when the overall space utilization 
> is over a certain threshold.  When the utilization is low, balancer can keep 
> up, but once the average per-node byte goes over the capacity of the smaller 
> nodes, they get full so quickly even after perfect balance.
> This jira proposes an improvement that can be optionally enabled in order to 
> slow down the rate of space usage growth of smaller nodes if the overall 
> storage utilization is over a configured threshold.  It will not replace 
> balancer, rather will help balancer keep up. Also, the primary replica 
> placement will not be affected. Only the replicas typically placed in a 
> remote rack will be subject to this check.
> The appropriate threshold is cluster configuration specific. There is no 
> generally good value to set, thus it is disabled by default. We have seen 
> cases where the threshold of 85% - 90% would help. Figuring when 
> {{totalSpaceUsed / numNodes}} becomes close to the capacity of a smaller node 
> is helpful in determining the threshold.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to