[
https://issues.apache.org/jira/browse/HBASE-9918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13819473#comment-13819473
]
stack commented on HBASE-9918:
------------------------------
[~jeffreyz] Probably good to have this fix in 0.96 w/ the absolute removal you
suggest happening out in 0.98.
> MasterAddressTracker & ZKNamespaceManager ZK listeners are missed after
> master recovery
> ---------------------------------------------------------------------------------------
>
> Key: HBASE-9918
> URL: https://issues.apache.org/jira/browse/HBASE-9918
> Project: HBase
> Issue Type: Bug
> Reporter: Jeffrey Zhong
> Assignee: Jeffrey Zhong
> Attachments: HBase-9918.patch, hbase-9918.v1.patch
>
>
> TestZooKeeper#testRegionAssignmentAfterMasterRecoveryDueToZKExpiry always
> failed at the following verification for me in my dev env(you have to run the
> single test not the whole TestZooKeeper suite to reproduce)
> {code}
> assertEquals("Number of rows should be equal to number of puts.",
> numberOfPuts, numberOfRows);
> {code}
> We missed two ZK listeners after master recovery MasterAddressTracker &
> ZKNamespaceManager.
> My current patch is to fix the JIRA issue while I'm wondering if we should
> totally remove the master failover implementation when ZK session expired
> because this causes reinitialize HMaster partially which is error prone and
> not a clean state to start from.
>
--
This message was sent by Atlassian JIRA
(v6.1#6144)