[ 
https://issues.apache.org/jira/browse/HBASE-30134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wellington Chevreuil updated HBASE-30134:
-----------------------------------------
    Summary: Improve CacheAwareLoadBalancer to consider low cache ratio when 
calculating imbalance  (was: Improve CacheAwareLoadBalancer to consider low 
cache ratio and available cache space)

> Improve CacheAwareLoadBalancer to consider low cache ratio when calculating 
> imbalance
> -------------------------------------------------------------------------------------
>
>                 Key: HBASE-30134
>                 URL: https://issues.apache.org/jira/browse/HBASE-30134
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Wellington Chevreuil
>            Assignee: Wellington Chevreuil
>            Priority: Major
>
> When enabling time based priority on CFs that are already cached on clusters 
> with datasets larger than the cache capacity, once the now cold data gets 
> evicted, hot data that was previously uncached due to no cache space would 
> remain uncached unless:
> 1) A client read request read it from the file system and caches it;
> 2) An operator manually disables and reenables tables with hot data, so that 
> the prefetch executor can run and cache those blocks.
> Both options are non optimal, leading to temporary performance impacts and/or 
> requiring manual interventions.
> The CacheAwareLoadBalancer, currently, only raises the cost of moving highly 
> cached regions, when calculating assignment plans, and doesn't consider cache 
> ratio at all when calculating potential imbalance, only skewness. So in the 
> scenario where regions are evenly distributed and there's no skewness, 
> CacheAwareLoadBalancer would not trigger any moves even when there would be 
> regions with low cache ratio on many servers and enough cache space to 
> accommodate those regions data in region servers cache.
>  
> This proposal is to include low cached ratio regions in the imbalance 
> calculation, so that CacheAwareLoadBalancer can trigger the computation of 
> new assignment plans. 
> It also needs to recalculate cache ratio on the assignment plans, simulating 
> that low cache ratio regions moved to enough cache free space servers would 
> get fully cached, so that such plan scores higher then the current state.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to