[ https://issues.apache.org/jira/browse/APEXCORE-426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15638187#comment-15638187 ]
ASF GitHub Bot commented on APEXCORE-426: ----------------------------------------- GitHub user sandeshh opened a pull request: https://github.com/apache/apex-core/pull/416 *Review only* APEXCORE-426 Reuse the running container, when the Stram restarts. @tweise & @vrozov please review If you guys are fine with the approach, I will work on unit tests. You can merge this pull request into a Git repository by running: $ git pull https://github.com/sandeshh/apex-core APEXCORE-426 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/apex-core/pull/416.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #416 ---- ---- > Support work preserving AM recovery > ----------------------------------- > > Key: APEXCORE-426 > URL: https://issues.apache.org/jira/browse/APEXCORE-426 > Project: Apache Apex Core > Issue Type: Improvement > Reporter: Thomas Weise > Assignee: Sandesh > Labels: apex-hadoop-version > > On app master failure, the streaming containers should continue running. > As of 2.2, YARN will automatically terminate all containers and the > replacement app master will relaunch them. Once we move to a newer minimum > Hadoop version, we should leverage work preserving restart. > The mechanism in Apex containers to locate the new master process are already > in place. > -- This message was sent by Atlassian JIRA (v6.3.4#6332)