[ 
https://issues.apache.org/jira/browse/HBASE-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2971.
--------------------------

    Resolution: Duplicate

Marking as fixed by HBASE-3062.  The more general issue of connection loss at 
any time is HBASE-3065.

> On cluster startup, master/rs connect to ZK before it's fully ready causing a 
> ConnectionLossException
> -----------------------------------------------------------------------------------------------------
>
>                 Key: HBASE-2971
>                 URL: https://issues.apache.org/jira/browse/HBASE-2971
>             Project: HBase
>          Issue Type: Bug
>          Components: zookeeper
>    Affects Versions: 0.90.0
>            Reporter: Jonathan Gray
>            Assignee: Jonathan Gray
>             Fix For: 0.90.0
>
>
> There is a race condition that has existed but has been glossed over to this 
> point (because of our "loose" zk usage).
> The ZK server process can be in a state where it will accept the socket 
> connection from our client in master or RS but if we do anything against the 
> server, we get a ConnectionLossException.  The ZK client handles this 
> automagically and reconnects properly, as long as we are not aborting when we 
> get this exception.
> So this works on the last 0.89 and even with the master rewrite, but as we 
> move towards strict usage of ZK, we should wait for ZK availability before 
> proceeding with startup.
> I already have a patch in a local branch and it's working.  Will put up a 
> patch soon against new master.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to