Region close needs to be  fast; e.g. if compacting, abandon it
--------------------------------------------------------------

                 Key: HBASE-2228
                 URL: https://issues.apache.org/jira/browse/HBASE-2228
             Project: Hadoop HBase
          Issue Type: Bug
            Reporter: stack


Over last week or so i've seen slow closes cause regions be off line for a good 
amount of time. Just now, i saw a big compaction go into effect because "too 
many store files".  This compaction took nearly two minutes on loaded server.  
But during this time flushing was held up.  When the order to close came in 
(overloaded), we started the close -- so incoming writes were rejected -- but 
then we had to wait on the compaction to finish before the close went ahead... 
though incoming clients by now are being turned away.  Eventually the 
compaction completed and then the held-up flush was allowed run..... 91M in 
about 5 seconds.  Only now was the close allowed complete and the region 
deployed elsewhere.

Another time I saw the flush take a good long time because hdfs was running 
slow.  Probably not much we can do about this one but we should at least look 
into the above.  Interrupt an ongoing compaction and abandon it... or else keep 
region open while the compaction is going on and only when compete, then start 
up the close (Would require new state of CLOSING keeping up a progressable with 
the master).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to