Github user revans2 commented on a diff in the pull request: https://github.com/apache/storm/pull/2433#discussion_r172934007 --- Diff: conf/defaults.yaml --- @@ -182,7 +195,7 @@ worker.log.level.reset.poll.secs: 30 # control how many worker receiver threads we need per worker topology.worker.receiver.thread.count: 1 -task.heartbeat.frequency.secs: 3 +task.heartbeat.frequency.secs: 60 --- End diff -- This is a bit scary. Why is the default now 20 times bigger than it was before? If this is just for scale we should drop it back to the default and write up documentation on how to scale a cluster much larger. Even in that case I don't feel comfortable with waiting 10+ mins for a worker to time out in the worst case.
---