[ 
https://issues.apache.org/jira/browse/HBASE-2482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon updated HBASE-2482:
-------------------------------

    Attachment: hbase-2482.txt

Here's a preliminary patch that has gone through cluster testing, but could 
probably still be improved a bit (needs unit tests, and I think a lot of the 
meta/root special casing could be removed now)

Also please note the "I DONT BELIEVE YOU WILL EVER SEE THIS" message. If 
someone can produce this message I will give them a cookie! Otherwise we should 
remove this code.

I've been testing this for the last 9 hours or so with a RS failure once every 
minute and no regions have "fallen out" of the cluster. Before this fix, it 
used to "lose" a region after an hour or two.

> regions in transition do not get reassigned by master when RS crashes
> ---------------------------------------------------------------------
>
>                 Key: HBASE-2482
>                 URL: https://issues.apache.org/jira/browse/HBASE-2482
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: master
>    Affects Versions: 0.20.5
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Blocker
>             Fix For: 0.20.5, 0.21.0
>
>         Attachments: hbase-2482.txt
>
>
> Very similar to HBASE-1928, but for the general case (not just ROOT/META):
> If a region is in transition on a RS when the RS crashes, the master does not 
> remove it from regionsInTransition when processing the RS shutdown. This is 
> fairly easy to trigger by bringing up a RS and kill -9ing it just as it 
> starts to get regions assigned. Those regions will get permanently lost since 
> they're stuck in regionsInTransition and thus don't get assigned by the 
> metascanner.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to