[
https://issues.apache.org/jira/browse/FLINK-5107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683092#comment-15683092
]
ASF GitHub Bot commented on FLINK-5107:
---------------------------------------
GitHub user StefanRRichter opened a pull request:
https://github.com/apache/flink/pull/2837
[FLINK-5107] Introduced limit for prior execution attempt history
This PR addresses the problem of JobManager going out of memory for a large
history of prior execution attempts by pruning the history in FIFO fashion,
only keeping a limited history size.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/StefanRRichter/flink limit-prior-executions
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/2837.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2837
----
commit f9053060fe4d396e9a8917ce8d422b0ebbcc044f
Author: Stefan Richter <[email protected]>
Date: 2016-11-18T18:07:56Z
[FLINK-5107] Introduced limit for prior execution attempt history
----
> Job Manager goes out of memory from long history of prior execution attempts
> ----------------------------------------------------------------------------
>
> Key: FLINK-5107
> URL: https://issues.apache.org/jira/browse/FLINK-5107
> Project: Flink
> Issue Type: Bug
> Components: JobManager
> Reporter: Stefan Richter
> Assignee: Stefan Richter
>
> We have observed that the job manager can run out of memory during long
> running jobs with many vertexes. Analysis of the heap dump shows, that the
> ever-growing history of prior execution attempts is the culprit for this
> problem.
> We should limit this history to a number of n most recent attempts.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)