[ 
https://issues.apache.org/jira/browse/HBASE-9467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13766263#comment-13766263
 ] 

Feng Honghua commented on HBASE-9467:
-------------------------------------

Change and explanation of the patch:

1. Throw RegionOverloadedException immediately rather than wait/retry within 
HRegion when the target region is above the memstore limit, this avoid write 
requests on region above memstore limit occupying/saturating handler threads. 
This change is in HRegion.checkResources method.

2. Reuse the exception handling and retry mechanism of AsyncProcess in client 
to handle RegionOverloadedException thrown from RS. Since 
RegionOverloadedException is not a DoNotRetryIOException, it'll be handled the 
same way as other non-DoNotRetryIOException thrown from RS by AsyncProcess and 
the according request will be retried using incremental backoff. 
   In a more general sense, we can view RegionOverloadedException as another 
kind of retriable exception and reuse all the current handling for it in 
AsyncProcess/client, so no change in client side code. And if we really want to 
use exponential backoff rather than incremental backoff for 
RegionOverloadedException, as Todd suggested, we can change the code in 
AsyncProcess accordingly.

3. We also need to check memstore limit and throw RegionOverloadedException for 
'increment' and 'append' operations, since they also insert kv to memstore and 
increase its size. (checkResources is not called for these two operations in 
HRegion previously, corrected here)

4. In UT TestHFileArchiving, RegionOverloadedException is thrown during 
loadRegion and since the 'put' operations are called directly via HRegion, not 
via client/AsyncProcess, a similiar 'catch-and-wait' handling is added here to 
proceed without failure.

[~nkeywal] / [~stack] / [~tlipcon] : Any feedback for the patch? Thanks in 
advance.
                
> write can be totally blocked temporarily by a write-heavy region
> ----------------------------------------------------------------
>
>                 Key: HBASE-9467
>                 URL: https://issues.apache.org/jira/browse/HBASE-9467
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Feng Honghua
>            Assignee: Feng Honghua
>         Attachments: HBASE-9467-trunk-v0.patch
>
>
> Write to a region can be blocked temporarily if the memstore of that region 
> reaches the threshold(hbase.hregion.memstore.block.multiplier * 
> hbase.hregion.flush.size) until the memstore of that region is flushed.
> For a write-heavy region, if its write requests saturates all the handler 
> threads of that RS when write blocking for that region occurs, requests of 
> other regions/tables to that RS also can't be served due to no available 
> handler threads...until the pending writes of that write-heavy region are 
> served after the flush is done. Hence during this time period, from the RS 
> perspective it can't serve any request from any table/region just due to a 
> single write-heavy region.
> This sounds not very reasonable, right? Maybe write requests from a region 
> can only be served by a sub-set of the handler threads, and then write 
> blocking of any single region can't lead to the scenario mentioned above?
> Comment?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to