[
https://issues.apache.org/jira/browse/HBASE-9451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13760115#comment-13760115
]
Nicolas Liochon commented on HBASE-9451:
----------------------------------------
bq. I am trying to understand why it's hardcoded to 'false' for former case.
It's because if we don't have the status, then we don't know, so we consider
the server is up.
> Meta remains unassigned when the meta server crashes with the
> ClusterStatusListener set
> ---------------------------------------------------------------------------------------
>
> Key: HBASE-9451
> URL: https://issues.apache.org/jira/browse/HBASE-9451
> Project: HBase
> Issue Type: Bug
> Reporter: Devaraj Das
>
> While running tests described in HBASE-9338, ran into this problem. The
> hbase.status.listener.class was set to
> org.apache.hadoop.hbase.client.ClusterStatusListener$MultiCastListener.
> 1. I had the meta server coming down
> 2. The metaSSH got triggered. The call chain:
> 2.1 verifyAndAssignMetaWithRetries
> 2.2 verifyMetaRegionLocation
> 2.3 waitForMetaServerConnection
> 2.4 getMetaServerConnection
> 2.5 getCachedConnection
> 2.6 HConnectionManager.getAdmin(serverName, false)
> 2.7 isDeadServer(serverName) -> This is hardcoded to return 'false' when
> the clusterStatusListener field is null. If clusterStatusListener is not null
> (in my test), then it could return true in certain cases (and in this case,
> indeed it should return true since the server is down). I am trying to
> understand why it's hardcoded to 'false' for former case.
> 3. When isDeadServer returns true, the method
> HConnectionManager.getAdmin(ServerName, boolean) throws
> RegionServerStoppedException.
> 4. Finally, after the retries are over verifyAndAssignMetaWithRetries gives
> up and the master aborts.
> The methods in the above call chain don't handle
> RegionServerStoppedException. Maybe something to look at...
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira