[
https://issues.apache.org/jira/browse/HBASE-3801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13022617#comment-13022617
]
Aaron Guo commented on HBASE-3801:
----------------------------------
The backup didn't register a listener to ZK, until this code:
this.activeMasterManager.blockUntilBecomingActiveMaster();so it can't get a
notify when the primary master crash.
> Backup Master blocked when the HMaster Node Fail.
> -------------------------------------------------
>
> Key: HBASE-3801
> URL: https://issues.apache.org/jira/browse/HBASE-3801
> Project: HBase
> Issue Type: Bug
> Components: master
> Affects Versions: 0.90.2
> Environment: 1 HMaster
> 1 HMaster -backup
> 6 HResignServer
> Reporter: Aaron Guo
>
> When the HMaster crash, the Backup HMaster blocked for waiting the ZK notify.
> The Backup HMaster's thread stack is :
> "master-hp1:60000" prio=10 tid=0x00000000484c6800 nid=0x4b56 waiting on
> condition [0x0000000040209000]
> java.lang.Thread.State: TIMED_WAITING (sleeping)
> at java.lang.Thread.sleep(Native Method)
> at
> org.apache.hadoop.hbase.master.HMaster.stallIfBackupMaster(HMaster.java:251)
> at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:279)
> Locked ownable synchronizers:
> - None
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira