Zhen Zhang created HELIX-527:
--------------------------------

             Summary: Mitigate zookeeper watch leak
                 Key: HELIX-527
                 URL: https://issues.apache.org/jira/browse/HELIX-527
             Project: Apache Helix
          Issue Type: Bug
            Reporter: Zhen Zhang


On investigating zookeeper watch leakage problem, it turns out to be a 
zookeeper issue:
https://issues.apache.org/jira/browse/ZOOKEEPER-442
For zookeeper before 3.5.0, we can't remove watches that are no longer of 
interests. The only way to remove a watch is to trigger it; that is, if it is a 
DataWatch, we need to trigger a data change on the watching path, or if it is a 
ChildWatch, we need to trigger a child change on the watching path. 
Unfortunately, if we are watching a path that has been deleted, unless we 
re-create the path, there is no way we can remove the watch.
Here are some of the most common scenarios where we will have dead zookeeper 
watches on zookeeper server side even though we unregister all the listeners on 
the zookeeper client side:
- When we drop a resource group from a cluster, we may have dead watches on 
ideal-state, participant current-state, and external-view
- When we remove an instance from a cluster, we may have dead watches on 
current-state, participant-config, and participant messages
- When we use property store with caches enabled by zookeeper watches, we may 
have dead watches on all removed paths





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to