tillrohrmann commented on issue #8046: [FLINK-12006][coordination] Ensure owned 
znode deleted on ZooKeeperHaServices#deleteOwnedZNode
URL: https://github.com/apache/flink/pull/8046#issuecomment-477210281
 
 
   I think I might have an idea where the problem comes from: The underlying 
problem is that we have an ongoing background operation originating from a 
`NodeCache` which makes sure that all parent nodes are created. I think the 
following can happen: The `NodeCache` inserts the background task into the 
`CuratorFrameworkImpl's` `Executor` but it is not executed. Next the 
`NodeCache` is stopped (as part of stopping the owning 
`LeaderRetrievalService`). Then `ZooKeeperHaServices#closeAndCleanupAllData` is 
called. This call will remove all created `zNodes`. Right after removing all 
zNodes as part of the `#deleteOwnedZNode` call, the background task is started. 
The background task will then recreate the parent nodes which will lead to the 
test failure.
   
   I think Tison's fix won't fully solve the problem, because we actually 
deleted all zNodes at some point in time.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to