[ https://issues.apache.org/jira/browse/HDFS-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15564824#comment-15564824 ]
Hadoop QA commented on HDFS-8041: --------------------------------- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem || Runtime || Comment || | {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s{color} | {color:blue} Docker mode activated. {color} | | {color:red}-1{color} | {color:red} patch {color} | {color:red} 0m 4s{color} | {color:red} HDFS-8041 does not apply to trunk. Rebase required? Wrong Branch? See https://wiki.apache.org/hadoop/HowToContribute for help. {color} | \\ \\ || Subsystem || Report/Notes || | JIRA Issue | HDFS-8041 | | JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12724011/HDFS-8041.v4.patch | | Console output | https://builds.apache.org/job/PreCommit-HDFS-Build/17097/console | | Powered by | Apache Yetus 0.4.0-SNAPSHOT http://yetus.apache.org | This message was automatically generated. > Consider remaining space during block blockplacement if dfs space is highly > utilized > ------------------------------------------------------------------------------------ > > Key: HDFS-8041 > URL: https://issues.apache.org/jira/browse/HDFS-8041 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Kihwal Lee > Assignee: Kihwal Lee > Labels: BlockPlacementPolicy > Attachments: HDFS-8041.v1.patch, HDFS-8041.v2.patch, > HDFS-8041.v3.patch, HDFS-8041.v4.patch > > > This feature is helpful in avoiding smaller nodes (i.e. heterogeneous > environment) getting constantly being full when the overall space utilization > is over a certain threshold. When the utilization is low, balancer can keep > up, but once the average per-node byte goes over the capacity of the smaller > nodes, they get full so quickly even after perfect balance. > This jira proposes an improvement that can be optionally enabled in order to > slow down the rate of space usage growth of smaller nodes if the overall > storage utilization is over a configured threshold. It will not replace > balancer, rather will help balancer keep up. Also, the primary replica > placement will not be affected. Only the replicas typically placed in a > remote rack will be subject to this check. > The appropriate threshold is cluster configuration specific. There is no > generally good value to set, thus it is disabled by default. We have seen > cases where the threshold of 85% - 90% would help. Figuring when > {{totalSpaceUsed / numNodes}} becomes close to the capacity of a smaller node > is helpful in determining the threshold. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org