Shammon created FLINK-25586: ------------------------------- Summary: ExecutionGraphInfoStore in session cluster should split failed and successful jobs Key: FLINK-25586 URL: https://issues.apache.org/jira/browse/FLINK-25586 Project: Flink Issue Type: Sub-task Components: Runtime / Coordination Affects Versions: 1.14.2, 1.13.5, 1.12.7 Reporter: Shammon
In flink session cluster, jobs are stored in `FileExecutionGraphInfoStore`. When the count of jobs in it reaches `jobstore.cache-size` or the live time of jobs reaches `jobstore.expiration-time`, the specify jobs will be removed. We can't holds too many jobs for performance reason, but we should hold failed jobs for longer time to trace the cause of failure. So it's better to split failed and successful jobs in `FileExecutionGraphInfoStore` and support independent max-capacity for them. -- This message was sent by Atlassian Jira (v8.20.1#820001)