[ 
https://issues.apache.org/jira/browse/HDFS-9502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anu Engineer updated HDFS-9502:
-------------------------------
    Attachment: HDFS-9502-HDFS-1312.001.patch

This patch changes VolumeDataDensity to VolumeWeightedVariance using the 
algorithm suggested by [~szetszwo].

[~szetszwo], Please note that I have left NodeDataDensity as is since it is 
only used by command line tool to print out which nodes need users attention.

The real balancing computation uses the VolumeWeightedMean and 
VolumeWeightedVariance.

This patch is dependent on HDFS-9469, hence not submitting for the Jenkins run 
now. I will do it after HDFS-9469 is committed. I am uploading the patch since 
it is ready for code review.
 

> DiskBalancer : Replace Node and Data Density with Weighted Mean and Variance
> ----------------------------------------------------------------------------
>
>                 Key: HDFS-9502
>                 URL: https://issues.apache.org/jira/browse/HDFS-9502
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: datanode
>            Reporter: Anu Engineer
>            Assignee: Anu Engineer
>         Attachments: HDFS-9502-HDFS-1312.001.patch
>
>
> We use notions called Data Density which are based are similar to weighted 
> mean and variance. Make sure that computations map directly to these concepts 
> since it is easier to understand them than the density as defined in Disk 
> Balancer now.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to