[ https://issues.apache.org/jira/browse/HBASE-12457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14210160#comment-14210160 ]
Hudson commented on HBASE-12457: -------------------------------- FAILURE: Integrated in HBase-0.98 #675 (See [https://builds.apache.org/job/HBase-0.98/675/]) Revert "HBASE-12457 Regions in transition for a long time when CLOSE interleaves with a slow compaction." (larsh: rev 7f5f1570ce83c62ce9408701677994415b127b36) * hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompactionIO.java * hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java * hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/SplitTransaction.java * hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/compactions/DefaultCompactor.java * hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HStore.java * hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java > Regions in transition for a long time when CLOSE interleaves with a slow > compaction > ----------------------------------------------------------------------------------- > > Key: HBASE-12457 > URL: https://issues.apache.org/jira/browse/HBASE-12457 > Project: HBase > Issue Type: Bug > Affects Versions: 0.98.7 > Reporter: Lars Hofhansl > Assignee: Lars Hofhansl > Fix For: 2.0.0, 0.98.9, 0.99.2 > > Attachments: 12457-combined-0.98-v2.txt, 12457-combined-0.98.txt, > 12457-combined-trunk.txt, 12457-minifix.txt, 12457.interrupt-v2.txt, > 12457.interrupt.txt, HBASE-12457.patch, HBASE-12457_addendum.patch, > TestRegionReplicas-jstack.txt > > > Under heave load we have observed regions remaining in transition for 20 > minutes when the master requests a close while a slow compaction is running. > The pattern is always something like this: > # RS starts a compaction > # HM request the region to be closed on this RS > # Compaction is not aborted for another 20 minutes > # The region is in transition and not usable. > In every case I tracked down so far the time between the requested CLOSE and > abort of the compaction is almost exactly 20 minutes, which is suspicious. > Of course part of the issue is having compactions that take over 20 minutes, > but maybe we can do better here. -- This message was sent by Atlassian JIRA (v6.3.4#6332)