We have recently upgraded to storm 0.9.4 from storm 0.8.3
We have seen a peculiar behavior when we run a security scan in our
environment. [This is a qualys scan]
1. The workers on a particular host is killed by the nimbus or the
supervisor.
a. supervisor kills it because of a worker heartbeat timeout.
b. Nimbus sees the tasks as not alive and issues a rebalance.
2. I have tried playing around the following parameters
a. storm.messaging.netty.server_worker_threads:24 [so the tcp scan are
stiffled on the threads available for processing]
b. supervisor.worker.timeout.secs: 120 [because i saw some worker
heartbeat timeout]
c. nimbus.task.timeout.secs: 30 [nimbus seemed to have some tasks
marked as dead.]
3. This is reproducible almost all the time.
Is there anybody else out there who has faced this issue and if so whats
the workaround ?
Let me know if any other details are required.
Thanks,
Amit.