[
https://issues.apache.org/jira/browse/STORM-233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13998773#comment-13998773
]
ASF GitHub Bot commented on STORM-233:
--------------------------------------
Github user revans2 commented on the pull request:
https://github.com/apache/incubator-storm/pull/98#issuecomment-43210740
After ```_ (heartbeat-fn)``` the supervisor gives the worker 5 seconds, by
default, to heartbeat in again or it thinks it has died. If ZK is taking a
long time, which it can do under heavy load, the supervisor shoots the worker.
Without the call the supervisor gives the worker 120 seconds by default to
heartbeat in the first time.
> Avoid worker killed on startup because of heavy ZK load
> -------------------------------------------------------
>
> Key: STORM-233
> URL: https://issues.apache.org/jira/browse/STORM-233
> Project: Apache Storm (Incubating)
> Issue Type: Sub-task
> Reporter: Robert Joseph Evans
>
> When under a very heavy load ZK can be very slow. If it takes longer than 3
> seconds for a worker to heatbeat in through ZK the first time, this can cause
> the supervisor to time out the worker process and shoot it before it comes up
> fully.
--
This message was sent by Atlassian JIRA
(v6.2#6252)