[ 
https://issues.apache.org/jira/browse/HBASE-3332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970284#action_12970284
 ] 

Jonathan Gray commented on HBASE-3332:
--------------------------------------

I think this is newly broken because we don't do anything with PENDING_CLOSE 
timeouts anymore.  We used to but in some other jira recently I think stack 
took it out and removed the unit test for it (I agreed at the time).  Thinking 
on this case now, it can certainly happen because of the logic in 
ServerShutdownHandler ignoring this region.

Rather than putting that back, I think there's a better way to deal with 
PENDING_CLOSE/CLOSING timeouts which should only happen in something like what 
you saw here.

Let me work on a patch.

> Regions stuck in transition after RS failure
> --------------------------------------------
>
>                 Key: HBASE-3332
>                 URL: https://issues.apache.org/jira/browse/HBASE-3332
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.90.0
>            Reporter: Todd Lipcon
>            Priority: Blocker
>         Attachments: log.txt
>
>
> Testing 0.90rc2 I ran into this issue. The test scenario was to kill -9 the 
> server hosting ROOT and META, and before it had been detected, run "balancer" 
> from the shell. After logs were split and regions were reassigned, I ended up 
> with some regions stuck in transition.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to