[ https://issues.apache.org/jira/browse/STORM-532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14172053#comment-14172053 ]
ASF GitHub Bot commented on STORM-532: -------------------------------------- GitHub user caofangkun opened a pull request: https://github.com/apache/storm/pull/293 STORM-532,Supervisor should restart worker immediately, if the worker pr... https://issues.apache.org/jira/browse/STORM-532 For now if the worker process does not exist any more Supervisor will have to wait a few seconds for worker heartbeart timeout and restart worker . If supervisor knows the worker processid and check if the process exists in the sync-processes thread ,may need less time to restart worker. 1: record worker process id in the worker local heartbeart 2: in supervisor sync-processes ,get process id from worker local heartbeat and check if the process exits 3: if not restart it immediately You can merge this pull request into a Git repository by running: $ git pull https://github.com/caofangkun/incubator-storm master Alternatively you can review and apply these changes as the patch at: https://github.com/apache/storm/pull/293.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #293 ---- commit ab9ae75fe3f3bbf070d05b5b165af04152a5933d Author: caokun <caokun@caokun-virtualbox.(none)> Date: 2014-10-15T01:12:11Z STORM-532,Supervisor should restart worker immediately, if the worker process does not exist any more ---- > Supervisor should restart worker immediately, if the worker process does not > exist any more > -------------------------------------------------------------------------------------------- > > Key: STORM-532 > URL: https://issues.apache.org/jira/browse/STORM-532 > Project: Apache Storm > Issue Type: Improvement > Affects Versions: 0.10.0 > Reporter: caofangkun > Priority: Minor > > For now > if the worker process does not exist any more > Supervisor will have to wait a few seconds for worker heartbeart timeout and > restart worker . > If supervisor knows the worker processid and check if the process exists in > the sync-processes thread ,may need less time to restart worker. > 1: record worker process id in the worker local heartbeart > 2: in supervisor sync-processes ,get process id from worker local heartbeat > and check if the process exits > 3: if not restart it immediately -- This message was sent by Atlassian JIRA (v6.3.4#6332)