Thawan Kooburat created ZOOKEEPER-1624:
------------------------------------------

             Summary: PrepRequestProcessor abort multi-operation incorrectly
                 Key: ZOOKEEPER-1624
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1624
             Project: ZooKeeper
          Issue Type: Bug
          Components: server
            Reporter: Thawan Kooburat
            Priority: Critical


We found this issue when trying to issue multiple instances of the following 
multi-op concurrently

multi {
1. create sequential node /a- 
2. create node /b
}

The expected result is that only the first multi-op request should success and 
the rest of request should fail because /b is already exist

However, the reported result is that the subsequence multi-op failed because of 
sequential node creation failed which is not possible.

Below is the return code for each sub-op when issuing 3 instances of the above 
multi-op asynchronously

1. ZOK, ZOK
2. ZOK, ZNODEEXISTS,
3. ZNODEEXISTS, ZRUNTIMEINCONSISTENCY,

When I added more debug log. The cause is that PrepRequestProcessor rollback 
outstandingChanges of the second multi-op incorrectly causing sequential node 
name generation to be incorrect. Below is the sequential node name generated by 
PrepRequestProcessor

1. create /a-0001
2. create /a-0003
3. create /a-0001

The bug is getPendingChanges() method. In failed to copied ChangeRecord for the 
parent node (/).  So rollbackPendingChanges() cannot restore the right previous 
change record of the parent node when aborting the second multi-op

The impact of this bug is that sequential node creation on the same parent node 
may fail until the previous one is committed. I am not sure if there is other 
implication or not.  


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to