[
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Duo Zhang updated HBASE-24298:
------------------------------
Fix Version/s: (was: 1.2.12)
> Reduce cpu load of locating region especially in batch mode.
> ------------------------------------------------------------
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
> Issue Type: Bug
> Affects Versions: 1.2.12
> Reporter: star
> Assignee: star
> Priority: Major
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is
> already fast enough, while cpu of HBASE client becomes the bottleneck when
> doing TCSB benchmark. We can make the process of locating region faster to
> reduce cpu load in some special cases , which however is our common case in
> production environment. It is the case:
> 1. Predefined splits in uniform distribution.
>
> 2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search.
> Initially, record all startIndex and endIndex of first or two bytes of keys.
> When a region key comes, find the contracted startIndex and endIndex of the
> key. Then return to normal binary search process with the specified
> startIndex and endIndex.
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)