[jira] Commented: (HBASE-3142) If a master dies and comes back up before his znode expires, the RS heartbeat can lock up

Jonathan Gray (JIRA) Mon, 08 Nov 2010 15:50:32 -0800

    [ 
https://issues.apache.org/jira/browse/HBASE-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12929827#action_12929827
 ]


Jonathan Gray commented on HBASE-3142:
--------------------------------------

I think another master became the active one, so they all blocked here 
indefinitely (he never finished startup).

> If a master dies and comes back up before his znode expires, the RS heartbeat 
> can lock up
> -----------------------------------------------------------------------------------------
>
>                 Key: HBASE-3142
>                 URL: https://issues.apache.org/jira/browse/HBASE-3142
>             Project: HBase
>          Issue Type: Bug
>          Components: master, regionserver
>    Affects Versions: 0.89.20100924, 0.90.0
>            Reporter: Jonathan Gray
>            Assignee: ryan rawson
>            Priority: Critical
>             Fix For: 0.90.0
>
>
> During a rolling restart, we ran into a case where a master was shutdown and 
> then brought back up before the znode expired.
> On the RS side, while the master was down, it was getting ConnectionRefused 
> exceptions trying to heartbeat to what it thinks is the active master.
> Once the master process comes back up, the next heartbeat done by all the RSs 
> just blocks indefinitely.
> This is somewhat related to HBASE-3141

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HBASE-3142) If a master dies and comes back up before his znode expires, the RS heartbeat can lock up

Reply via email to