[ https://issues.apache.org/jira/browse/ZOOKEEPER-1879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13897577#comment-13897577 ]
Thawan Kooburat commented on ZOOKEEPER-1879: -------------------------------------------- We can contribute our "consistency checker" as a contrib module. It is essentially a program that read the entire data tree from 2 servers using normal client API and compare it. It has heuristic to ignore in-flight changes so it never report false positive. We use this program to make pair-wise comparison between servers in each production ensemble. > improve the correctness checking of txn log replay > -------------------------------------------------- > > Key: ZOOKEEPER-1879 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1879 > Project: ZooKeeper > Issue Type: Improvement > Components: server > Affects Versions: 3.4.6, 3.5.0 > Reporter: Patrick Hunt > Fix For: 3.4.7, 3.5.0 > > > In ZOOKEEPER-1573 we decided to fix an issue by relaxing some of the > checking. Specifically when the sequence of txns is as follows: > * zxid 1: create /prefix/a > * zxid 2: create /prefix/a/b > * zxid 3: delete /prefix/a/b > * zxid 4: delete /prefix/a > the log may fail to replay. > We addressed this by relaxing a check, which is essentially invalid for this > case, but is important in finding corruptions of the datastore. > We should add this check back with proper validation of correctness. -- This message was sent by Atlassian JIRA (v6.1.5#6160)