We have recently upgraded to storm 0.9.4 from storm 0.8.3

We have seen a peculiar behavior when we run a security scan in our
environment. [This is a qualys scan]

1. The workers on a particular host is killed by the nimbus or the
supervisor.
a. supervisor kills it because of a worker heartbeat timeout.
b. Nimbus sees the tasks as not alive and issues a rebalance.

2. I have tried playing around the following parameters
a. storm.messaging.netty.server_worker_threads:24 [so the tcp scan are
stiffled on the threads available for processing]
        b. supervisor.worker.timeout.secs: 120 [because i saw some worker
heartbeat timeout]
        c. nimbus.task.timeout.secs: 30 [nimbus seemed to have some tasks
marked as dead.]

3. This is reproducible almost all the time.


Is there anybody else out there who has faced this issue and if so whats
the workaround ?

Let me know if any other details are required.

Thanks,
Amit.

Reply via email to