Pierre Yin created ZOOKEEPER-3589:
-------------------------------------

             Summary: 3.4-branch has potential data inconsistency caused by 
ZOOKEEPER-3104
                 Key: ZOOKEEPER-3589
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-3589
             Project: ZooKeeper
          Issue Type: Bug
          Components: server
    Affects Versions: 3.4.14, 3.4.13
            Reporter: Pierre Yin


ZOOKEEPER-3104 describes one critical data inconsistency risk.

The risk also exists in 3.4 branch.

In our 3.4.13 production cluster, the data inconsistency happens for many times.

After digging some transaction logs and snapshot, we believe that 
ZOOKEEPER-3104 is the main risk to contributes to our data inconsistency.

The risk probability maybe higher than we can consider in real product 
environment.  The serialization of big DataTree may leads to a big risk time 
window in the high write traffic situation. Any failure during the risk time 
window would cause the data inconsistency. 

The data inconsistency is almost unacceptable in zookeeper semantic.

This issue is already fixed in 3.6. But I think it is very necessary to 
backport ZOOKEEPER-3104 to branch-3.4, specially in the situation that the 
migration from 3.4 to 3.5 actually take more effort to evaluate the 
compatibility risk in real product environment.

I will have submit a github pull request to fix it. Can anyone help us to 
review it please ?

Many thanks.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to