tillrohrmann commented on issue #8046: [FLINK-12006][coordination] Ensure owned znode deleted on ZooKeeperHaServices#deleteOwnedZNode URL: https://github.com/apache/flink/pull/8046#issuecomment-477210281 I think I might have an idea where the problem comes from: The underlying problem is that we have an ongoing background operation originating from a `NodeCache` which makes sure that all parent nodes are created. I think the following can happen: The `NodeCache` inserts the background task into the `CuratorFrameworkImpl's` `Executor` but it is not executed. Next the `NodeCache` is stopped (as part of stopping the owning `LeaderRetrievalService`). Then `ZooKeeperHaServices#closeAndCleanupAllData` is called. This call will remove all created `zNodes`. Right after removing all zNodes as part of the `#deleteOwnedZNode` call, the background task is started. The background task will then recreate the parent nodes which will lead to the test failure. I think Tison's fix won't fully solve the problem, because we actually deleted all zNodes at some point in time.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
