[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2021-12-31 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang updated HBASE-24298:
--
Component/s: Performance

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>  Components: Performance
>Affects Versions: 1.2.12
>Reporter: star
>Assignee: star
>Priority: Major
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2021-12-31 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang updated HBASE-24298:
--
Fix Version/s: (was: 1.2.12)

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 1.2.12
>Reporter: star
>Assignee: star
>Priority: Major
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-07-15 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Attachment: HBASE-24298.patch

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 1.2.12
>Reporter: star
>Assignee: star
>Priority: Major
> Fix For: 1.2.12
>
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-07-15 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Fix Version/s: 1.2.12
Affects Version/s: 1.2.12
   Status: Patch Available  (was: Open)

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 1.2.12
>Reporter: star
>Assignee: star
>Priority: Major
> Fix For: 1.2.12
>
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-07-15 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Attachment: (was: HBASE-24298.patch)

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Reporter: star
>Assignee: star
>Priority: Major
> Attachments: locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-07-15 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Attachment: HBASE-24298.patch

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Reporter: star
>Assignee: star
>Priority: Major
> Attachments: HBASE-24298.patch, locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-05-06 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Description: 
Binary search is used to speedup the process of locating region. It is already 
fast enough, while cpu of HBASE client becomes the bottleneck when doing TCSB 
benchmark. We can make the process of locating region faster to reduce cpu load 
in some special cases , which however is our common case in production 
environment.  It is the case: 

        1. Predefined splits in uniform distribution.
  
         2. Load data in batch mode.

The optimization is very simple, just to contract range of binary search. 
Initially,  record all startIndex and endIndex of first or two bytes of keys. 
When a region key comes, find the contracted startIndex and endIndex of the 
key. Then return to normal binary search process with the specified startIndex 
and endIndex. 

Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.

  was:
Binary search is used to speedup the process of locating region. It is already 
fast enough, while cpu of HBASE client becomes the bottleneck when doing TCSB 
benchmark. We can make the process of locating region faster to reduce cpu load 
in some special cases , which however it is our common case in production 
environment.  It is the case: 

        1. Predefined splits in uniform distribution.
  
         2. Load data in batch mode.

The optimization is very simple, just to contract range of binary search. 
Initially,  record all startIndex and endIndex of first or two bytes of keys. 
When a region key comes, find the contracted startIndex and endIndex of the 
key. Then return to normal binary search process with the specified startIndex 
and endIndex. 

When we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.


> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Reporter: star
>Priority: Major
> Attachments: locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however is our common case in 
> production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> Then we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-24298) Reduce cpu load of locating region especially in batch mode.

2020-05-01 Thread star (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

star updated HBASE-24298:
-
Summary: Reduce cpu load of locating region especially in batch mode.  
(was: Reduce cpu cost of locating region especially in batch mode.)

> Reduce cpu load of locating region especially in batch mode.
> 
>
> Key: HBASE-24298
> URL: https://issues.apache.org/jira/browse/HBASE-24298
> Project: HBase
>  Issue Type: Bug
>Reporter: star
>Priority: Major
> Attachments: locating region.svg
>
>
> Binary search is used to speedup the process of locating region. It is 
> already fast enough, while cpu of HBASE client becomes the bottleneck when 
> doing TCSB benchmark. We can make the process of locating region faster to 
> reduce cpu load in some special cases , which however it is our common case 
> in production environment.  It is the case: 
>         1. Predefined splits in uniform distribution.
>   
>          2. Load data in batch mode.
> The optimization is very simple, just to contract range of binary search. 
> Initially,  record all startIndex and endIndex of first or two bytes of keys. 
> When a region key comes, find the contracted startIndex and endIndex of the 
> key. Then return to normal binary search process with the specified 
> startIndex and endIndex. 
> When we can ideally reduce cpu to 1/8 with 1 byte or 1/16 with 2 bytes.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)