[ 
https://issues.apache.org/jira/browse/HBASE-25769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17412231#comment-17412231
 ] 

Clara Xiong commented on HBASE-25769:
-------------------------------------

[~Xiaolin Ha] More updates for byTable option. We have seen it enabled on a 
different large cluster ( close to 1000 RS )which worked poorly too. By table 
works fine with many small tables and with at most a large table on a large 
cluster. The fix in https://issues.apache.org/jira/browse/HBASE-25739  make 
tableSkewCostFunction finally work and we switched to rely on it by increasing 
its weight other than  by table option. 

> Update default weight of cost functions
> ---------------------------------------
>
>                 Key: HBASE-25769
>                 URL: https://issues.apache.org/jira/browse/HBASE-25769
>             Project: HBase
>          Issue Type: Sub-task
>          Components: Balancer
>            Reporter: Clara Xiong
>            Priority: Major
>
> In production, we have seen some critical big tables that handle majority of 
> the load. Table Skew is becoming more important. With the update of table 
> skew function, balancer finally works for large table distribution on large 
> cluster. We should increase the weight from 35 to a level comparable to 
> region count skew: 500. We can even push further to replace region count skew 
> by table skew since the latter works in the same way and account for region 
> distribution per node.
> Another weight we found helpful to increase is for store file size cost 
> function. Ideally if normalizer works perfectly, we don't need to worry about 
> it since region count skew would have accounted for it. But we are often in a 
> situation it doesn't. Store file distribution needs to be given more way as 
> accommodation. we tested changing it from 5 to 200 and it works fine.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to