[ 
https://issues.apache.org/jira/browse/HBASE-2752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12880422#action_12880422
 ] 

Dave Latham commented on HBASE-2752:
------------------------------------

Thanks for the quick work.  It's really aprpeciated.  I'll try to get this 
patch tested on a cluster.

Minor nits:
* The log on "Cache flush failed" should use toStringBinary for the region name.
* blockingWaitTime / 100 seems somewhat arbitrary for check interval, but 
probably fine for now.


> Don't retry forever when waiting on too many store files
> --------------------------------------------------------
>
>                 Key: HBASE-2752
>                 URL: https://issues.apache.org/jira/browse/HBASE-2752
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Jean-Daniel Cryans
>            Assignee: stack
>            Priority: Critical
>             Fix For: 0.20.5, 0.21.0
>
>         Attachments: 2752.txt
>
>
> HBASE-2087 introduced a way to not block all flushes when on region has too 
> many store files. Unfortunately, that undid the behavior that if we waited 
> for longer than 90 secs then that we would still flush the region... which 
> means that when a  region blocks inserts because its memstore is too big it's 
> actually holding off writes for a very long time, occupying handlers, etc.
> We need to add more smarts in MemStoreFlusher so that we detect when a region 
> was held up for too long.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to