[
https://issues.apache.org/jira/browse/HBASE-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12929827#action_12929827
]
Jonathan Gray commented on HBASE-3142:
--------------------------------------
I think another master became the active one, so they all blocked here
indefinitely (he never finished startup).
> If a master dies and comes back up before his znode expires, the RS heartbeat
> can lock up
> -----------------------------------------------------------------------------------------
>
> Key: HBASE-3142
> URL: https://issues.apache.org/jira/browse/HBASE-3142
> Project: HBase
> Issue Type: Bug
> Components: master, regionserver
> Affects Versions: 0.89.20100924, 0.90.0
> Reporter: Jonathan Gray
> Assignee: ryan rawson
> Priority: Critical
> Fix For: 0.90.0
>
>
> During a rolling restart, we ran into a case where a master was shutdown and
> then brought back up before the znode expired.
> On the RS side, while the master was down, it was getting ConnectionRefused
> exceptions trying to heartbeat to what it thinks is the active master.
> Once the master process comes back up, the next heartbeat done by all the RSs
> just blocks indefinitely.
> This is somewhat related to HBASE-3141
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.