[
https://issues.apache.org/jira/browse/HDFS-9502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anu Engineer updated HDFS-9502:
-------------------------------
Attachment: HDFS-9502-HDFS-1312.001.patch
This patch changes VolumeDataDensity to VolumeWeightedVariance using the
algorithm suggested by [~szetszwo].
[~szetszwo], Please note that I have left NodeDataDensity as is since it is
only used by command line tool to print out which nodes need users attention.
The real balancing computation uses the VolumeWeightedMean and
VolumeWeightedVariance.
This patch is dependent on HDFS-9469, hence not submitting for the Jenkins run
now. I will do it after HDFS-9469 is committed. I am uploading the patch since
it is ready for code review.
> DiskBalancer : Replace Node and Data Density with Weighted Mean and Variance
> ----------------------------------------------------------------------------
>
> Key: HDFS-9502
> URL: https://issues.apache.org/jira/browse/HDFS-9502
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: datanode
> Reporter: Anu Engineer
> Assignee: Anu Engineer
> Attachments: HDFS-9502-HDFS-1312.001.patch
>
>
> We use notions called Data Density which are based are similar to weighted
> mean and variance. Make sure that computations map directly to these concepts
> since it is easier to understand them than the density as defined in Disk
> Balancer now.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)