If a master dies and comes back up before his znode expires, the RS heartbeat
can lock up
-----------------------------------------------------------------------------------------
Key: HBASE-3142
URL: https://issues.apache.org/jira/browse/HBASE-3142
Project: HBase
Issue Type: Bug
Components: master, regionserver
Affects Versions: 0.89.20100924, 0.90.0
Reporter: Jonathan Gray
Priority: Critical
Fix For: 0.90.0
During a rolling restart, we ran into a case where a master was shutdown and
then brought back up before the znode expired.
On the RS side, while the master was down, it was getting ConnectionRefused
exceptions trying to heartbeat to what it thinks is the active master.
Once the master process comes back up, the next heartbeat done by all the RSs
just blocks indefinitely.
This is somewhat related to HBASE-3141
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.