[jira] [Commented] (HBASE-12457) Regions in transition for a long time when CLOSE interleaves with a slow compaction

Lars Hofhansl (JIRA) Thu, 13 Nov 2014 09:18:11 -0800

    [ 
https://issues.apache.org/jira/browse/HBASE-12457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14210036#comment-14210036
 ]


Lars Hofhansl commented on HBASE-12457:
---------------------------------------

Sorry about the build break on branch-1. I cherry-picked the patch. Usually I 
do a compile and run the relevant tests, but I spaced it this time.

The hang will not happen since we only notify *after* we set 
writestate.compacting (or writestate.flushing) back to false, so there is no 
race. I looked at that part :)

In the face of the test failures I am going to roll this back anyway, though.


> Regions in transition for a long time when CLOSE interleaves with a slow 
> compaction
> -----------------------------------------------------------------------------------
>
>                 Key: HBASE-12457
>                 URL: https://issues.apache.org/jira/browse/HBASE-12457
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.98.7
>            Reporter: Lars Hofhansl
>            Assignee: Lars Hofhansl
>             Fix For: 2.0.0, 0.98.8, 0.99.2
>
>         Attachments: 12457-combined-0.98-v2.txt, 12457-combined-0.98.txt, 
> 12457-combined-trunk.txt, 12457-minifix.txt, 12457.interrupt-v2.txt, 
> 12457.interrupt.txt, HBASE-12457.patch, HBASE-12457_addendum.patch, 
> TestRegionReplicas-jstack.txt
>
>
> Under heave load we have observed regions remaining in transition for 20 
> minutes when the master requests a close while a slow compaction is running.
> The pattern is always something like this:
> # RS starts a compaction
> # HM request the region to be closed on this RS
> # Compaction is not aborted for another 20 minutes
> # The region is in transition and not usable.
> In every case I tracked down so far the time between the requested CLOSE and 
> abort of the compaction is almost exactly 20 minutes, which is suspicious.
> Of course part of the issue is having compactions that take over 20 minutes, 
> but maybe we can do better here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12457) Regions in transition for a long time when CLOSE interleaves with a slow compaction

Reply via email to