[jira] [Comment Edited] (HBASE-24436) The store file open and close thread pool should be shared at the region level

Anoop Sam John (Jira) Tue, 26 May 2020 23:40:24 -0700


    [ 
https://issues.apache.org/jira/browse/HBASE-24436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17117435#comment-17117435
 ]


Anoop Sam John edited comment on HBASE-24436 at 5/27/20, 6:39 AM:
------------------------------------------------------------------

Just moving the discussion from PR comments to here.

If I understand the jira correctly, what you are trying to solve is below case.
One region with say 2 stores. Store1 having much more files than other. Say the 
config for the #threads in open pool is 10. Now it will create 2 pools for each 
store with 5 threads each. The Store2 will get finished soon. But store1 will 
take much longer. So if it was a shared pool of 10 threads the overall time for 
opening both stores would have been lesser. my understanding correct?

Also this make more sense when hbase.hstore.open.and.close.threads.max is also 
configured to be 1+  By default we open stores serially (only 1 thread)..  Then 
also this change will give a larger thread pool for the store file open.



was (Author: anoop.hbase):
Just moving the discussion from PR comments to here.

If I understand the jira correctly, what you are trying to solve is below case.
One region with say 2 stores. Store1 having much more files than other. Say the 
config for the #threads in open pool is 10. Now it will create 2 pools for each 
store with 5 threads each. The Store2 will get finished soon. But store1 will 
take much longer. So if it was a shared pool of 10 threads the overall time for 
opening both stores would have been lesser. my understanding correct?

> The store file open and close thread pool should be shared at the region level
> ------------------------------------------------------------------------------
>
>                 Key: HBASE-24436
>                 URL: https://issues.apache.org/jira/browse/HBASE-24436
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Junhong Xu
>            Assignee: Junhong Xu
>            Priority: Minor
>
> For now, we provide threads per column family evenly in general, but  there 
> are some cases that some column families have much more store files than 
> others( maybe that's the life, right? ). So in that case, some Stores have 
> beed done quickly while others are struggling.We should share the thread pool 
> at the region level in case of data skew.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Comment Edited] (HBASE-24436) The store file open and close thread pool should be shared at the region level

Reply via email to