[
https://issues.apache.org/jira/browse/HDFS-10567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15421642#comment-15421642
]
Anu Engineer commented on HDFS-10567:
-------------------------------------
Can we please re-write
{noformat}
Percentage skew in decimal that we " +
"tolerate before diskbalancer starts working e.g. 10 means " +
"it starts working when disk data density is less or " +
"greater than 10%.
{noformat}
as
{noformat}
Percentage of data skew that is tolerated before disk balancer starts working.
For example, if total data on a 2 disk node is 100 GB then disk balancer
calculates
the expected value on each disk, which is 50 GB. if the tolerance is 10% then
data on a single disk needs to be more than 60 GB ( 50 GB + 10% tolerance
value)
for Disk balancer to balance the disks.
{noformat}
> Improve plan command help message
> ---------------------------------
>
> Key: HDFS-10567
> URL: https://issues.apache.org/jira/browse/HDFS-10567
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: datanode
> Reporter: Lei (Eddy) Xu
> Assignee: Xiaobing Zhou
> Attachments: HDFS-10567-HDFS-10576.001.patch,
> HDFS-10567-HDFS-10576.002.patch, HDFS-10567-HDFS-1312.000.patch
>
>
> {code}
> --bandwidth <arg> Maximum disk bandwidth to be consumed by
> diskBalancer. e.g. 10
> --maxerror <arg> Describes how many errors can be
> tolerated while copying between a pair
> of disks.
> --out <arg> File to write output to, if not
> specified defaults will be used.
> --plan <arg> creates a plan for datanode.
> --thresholdPercentage <arg> Percentage skew that wetolerate before
> diskbalancer starts working e.g. 10
> --v Print out the summary of the plan on
> console
> {code}
> We should
> * Put the unit into {{--bandwidth}}, or its help message. Is it an integer or
> float / double number? Not clear in CLI message.
> * Give more details about {{--plan}}. It is not clear what the {{<arg>}} is
> for.
> * {{--thresholdPercentage}}, has typo {{wetolerate}} in the error message.
> Also it needs to indicated that it is the difference between space
> utilization between two disks / volumes. Is it an integer or float / double
> number?
> Thanks.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]