[ 
https://issues.apache.org/jira/browse/FLINK-2079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Metzger resolved FLINK-2079.
-----------------------------------
       Resolution: Fixed
    Fix Version/s: 0.9

Resolved in http://git-wip-us.apache.org/repos/asf/flink/commit/11b021b0

> Add watcher to YARN TM containers to detect stopped actor system
> ----------------------------------------------------------------
>
>                 Key: FLINK-2079
>                 URL: https://issues.apache.org/jira/browse/FLINK-2079
>             Project: Flink
>          Issue Type: Improvement
>          Components: TaskManager, YARN Client
>    Affects Versions: 0.9
>            Reporter: Robert Metzger
>            Assignee: Robert Metzger
>             Fix For: 0.9
>
>
> I experienced an OutOfMemoryError (caused by the usercode) while running 
> Flink on YARN.
> It seems that the TaskManager is correctly detecting the fatal error, however 
> the JVM is not shutting down, so YARN won't bring up new containers.
> Therefore, I want to start a thread on the YarnTaskManagerRunner which 
> periodically (every 30 seconds) checks whether the actor system is still 
> running. If not, its doing a System.exit(1).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to