Hi, I am wondering if there is a concise list of all of the uses of Zookeeper throughout Storm. I had thought that the only use was for supervisor discovery, node assignments, and the like, but I am running into issues with Zookeeper dying when I launch a new topology that is slow to start up, and generates moderate load during startup.
This is baffling to me, because I had thought that workers did not communicate with Zookeeper at all, and that was all contained to the appropriate supervisor. Behavior is essentially after submission and launching of the job, during the prepare method, all zookeeper connections begin to fail/reconnect, causing thrashing amongst Storm (reassigning what it thought were dead nodes), even among separate topologies. Eventually, it works itself out, and deploys successfully. It does seem to actually kill zookeeper in the process though. Any thoughts? Everything works normally with any other topology that I have tried to deploy to the same system.
