[ 
https://issues.apache.org/jira/browse/HBASE-16144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-16144:
-----------------------------------
    Fix Version/s:     (was: 0.98.21)

I had to revert this from 0.98 because it lead to repeatable failures of 
TestZooKeeper#testLogSplittingAfterMasterRecoveryDueToZKExpiry

Thanks [~ted_yu] for finding the problem

> Replication queue's lock will live forever if RS acquiring the lock has died 
> prematurely
> ----------------------------------------------------------------------------------------
>
>                 Key: HBASE-16144
>                 URL: https://issues.apache.org/jira/browse/HBASE-16144
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 2.0.0, 1.3.0, 1.4.0, 1.2.2, 0.98.20, 1.1.6
>            Reporter: Phil Yang
>            Assignee: Phil Yang
>             Fix For: 2.0.0, 1.3.0, 1.4.0, 1.1.6, 1.2.3
>
>         Attachments: HBASE-16144-0.98.v1.patch, 
> HBASE-16144-branch-1-v1.patch, HBASE-16144-branch-1-v2.patch, 
> HBASE-16144-branch-1.1-v1.patch, HBASE-16144-branch-1.1-v2.patch, 
> HBASE-16144-v1.patch, HBASE-16144-v2.patch, HBASE-16144-v3.patch, 
> HBASE-16144-v4.patch, HBASE-16144-v5.patch, HBASE-16144-v6.patch, 
> HBASE-16144-v6.patch
>
>
> In default, we will use multi operation when we claimQueues from ZK. But if 
> we set hbase.zookeeper.useMulti=false, we will add a lock first, then copy 
> nodes, finally clean old queue and the lock. 
> However, if the RS acquiring the lock crash before claimQueues done, the lock 
> will always be there and other RS can never claim the queue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to