[ https://issues.apache.org/jira/browse/FLINK-4449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15436442#comment-15436442 ]
ASF GitHub Bot commented on FLINK-4449: --------------------------------------- GitHub user beyond1920 opened a pull request: https://github.com/apache/flink/pull/2417 [FLINK-4449] [cluster management] heartbeat receiveing endpoint monitor the heartbeat from sender receiving end monitor the sender so that if the heartbeat request is not delivered, then receiving end could mark sending end as dead. You can merge this pull request into a Git repository by running: $ git pull https://github.com/alibaba/flink jira-4449-1 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/2417.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #2417 ---- commit 5daba076242e2ffe010cc8c17969369e0a02e927 Author: beyond1920 <beyond1...@126.com> Date: 2016-08-25T07:48:28Z on the receiving end, monitor the heartbeat from sender in case it cannot receive heartbeat from a long time which leads to mark it as failed ---- > Heartbeat Manager between ResourceManager and TaskExecutor > ---------------------------------------------------------- > > Key: FLINK-4449 > URL: https://issues.apache.org/jira/browse/FLINK-4449 > Project: Flink > Issue Type: Sub-task > Components: Cluster Management > Reporter: zhangjing > Assignee: zhangjing > > HeartbeatManager is responsible for heartbeat between resourceManager to > TaskExecutor > 1. Register taskExecutors > register heartbeat targets. If the heartbeat response for these targets is > not reported in time, mark target failed and notify resourceManager > 2. trigger heartbeat > trigger heartbeat from resourceManager to TaskExecutor periodically > taskExecutor report slot allocation in the heartbeat response > ResourceManager sync self slot allocation with the heartbeat response -- This message was sent by Atlassian JIRA (v6.3.4#6332)