[ 
https://issues.apache.org/jira/browse/HBASE-25625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17294925#comment-17294925
 ] 

Clara Xiong edited comment on HBASE-25625 at 3/4/21, 1:36 AM:
--------------------------------------------------------------

after:  


was (Author: claraxiong):
after:  !Screen Shot 2021-03-03 at 12.17.39 PM.png!

> StochasticBalancer CostFunctions needs a better way to evaluate resource 
> distribution
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-25625
>                 URL: https://issues.apache.org/jira/browse/HBASE-25625
>             Project: HBase
>          Issue Type: Improvement
>          Components: Balancer, master
>            Reporter: Clara Xiong
>            Assignee: Clara Xiong
>            Priority: Major
>         Attachments: Screen Shot 2021-03-03 at 12.01.12 PM.png, Screen Shot 
> 2021-03-03 at 12.08.58 PM.png, Screen Shot 2021-03-03 at 12.17.39 PM.png
>
>
> Currently CostFunctions including RegionCountSkewCostFunctions, 
> PrimaryRegionCountSkewCostFunctions and all load cost functions calculate how 
> uneven the distribution by getting the sum of deviation per region server. 
> TableSkewCostFunction uses the sum of the max region per server for all 
> tables as the measure of unevenness. 
> This simple implementation works when the cluster is small. But when the 
> cluster get larger with more region servers and regions, it doesn't work well 
> with hot spots or a small number of unbalanced servers.
> The proposal is to use the standard deviation of the count per region server 
> to capture the existence of a small portion of region servers with 
> overwhelming load/allocation.
> Patch is in test and will follow shortly.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to