[
https://issues.apache.org/jira/browse/HBASE-24436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17117435#comment-17117435
]
Anoop Sam John commented on HBASE-24436:
----------------------------------------
Just moving the discussion from PR comments to here.
If I understand the jira correctly, what you are trying to solve is below case.
One region with say 2 stores. Store1 having much more files than other. Say the
config for the #threads in open pool is 10. Now it will create 2 pools for each
store with 5 threads each. The Store2 will get finished soon. But store1 will
take much longer. So if it was a shared pool of 10 threads the overall time for
opening both stores would have been lesser. my understanding correct?
> The store file open and close thread pool should be shared at the region level
> ------------------------------------------------------------------------------
>
> Key: HBASE-24436
> URL: https://issues.apache.org/jira/browse/HBASE-24436
> Project: HBase
> Issue Type: Improvement
> Reporter: Junhong Xu
> Assignee: Junhong Xu
> Priority: Minor
>
> For now, we provide threads per column family evenly in general, but there
> are some cases that some column families have much more store files than
> others( maybe that's the life, right? ). So in that case, some Stores have
> beed done quickly while others are struggling.We should share the thread pool
> at the region level in case of data skew.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)