Aeham Abushwashi created CONNECTORS-1123:
--------------------------------------------

             Summary: ZK node leak
                 Key: CONNECTORS-1123
                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1123
             Project: ManifoldCF
          Issue Type: Bug
          Components: Framework core
    Affects Versions: ManifoldCF 1.8
         Environment: 4-node manifold cluster, 3-node ZK ensemble for 
coordination and configuration management
            Reporter: Aeham Abushwashi


Looking at the stats of the zookeeper cluster, I was struck by the very high 
node count reported by the ZK stat command  which shows there being just over 
3.84 MILLION nodes. The number keeps rising as long as the manifold nodes are 
running. Stopping manifold does NOT reduce the number significantly, nor does 
restarting the ZK ensemble.

The ZK ensemble was initialised around 20 days ago. Manifold has been running 
on and off on this cluster since that time.

The flat nature of the manifold node structure in ZK (at least in the dev_1x 
branch) makes it difficult to identify node names but after tweaking the 
jute.maxbuffer parameter on the client, I was able to get a list of all nodes. 
There's a huge number of nodes with the name pattern 
org.apache.manifoldcf.locks-<Output Connection>:<Hash>. 
I could see using this node name pattern used in 
IncrementalIngester#documentDeleteMultiple and 
IncrementalIngester#documentRemoveMultiple. However, I'm not expecting any 
deletions in the tests I've been running recently - perhaps this is part of the 
duplicate deletion logic which came up in an email thread earlier today? or 
maybe there's another code path I missed entirely and which creates nodes with 
names like the above.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to