Kihwal Lee created HDFS-8041:
--------------------------------

             Summary: Consider remaining space during block blockplacement if 
dfs space is highly utilized
                 Key: HDFS-8041
                 URL: https://issues.apache.org/jira/browse/HDFS-8041
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: Kihwal Lee


This feature is helpful in avoiding smaller nodes (i.e. heterogeneous 
environment) getting constantly being full when the overall space utilization 
is over a certain threshold.  When the utilization is low, balancer can keep 
up, but once the average per-node byte goes over the capacity of the smaller 
nodes, they get full so quickly even after perfect balance.

This jira proposes an improvement that can be optionally enabled in order to 
slow down the rate of space usage growth of smaller nodes if the overall 
storage utilization is over a configured threshold.  It will not replace 
balancer, rather will help balancer keep up. Also, the primary replica 
placement will not be affected. Only the replicas typically placed in a remote 
rack will be subject to this check.

The appropriate threshold is cluster configuration specific. There is no 
generally good value to set, thus it is disabled by default. We have seen cases 
where the threshold of 85% - 90% would help. Figuring when {{totalSpaceUsed / 
numNodes}} becomes close to the capacity of a smaller node is helpful in 
determining the threshold.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to