[
https://issues.apache.org/jira/browse/FLINK-20833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17258774#comment-17258774
]
Till Rohrmann commented on FLINK-20833:
---------------------------------------
Thanks for creating this ticket [~ZhenqiuHuang]. I like the idea in general.
Before starting this effort, I think we need a bit more concrete proposal how
to exactly do it and where to place it. I would suggest to not add it directly
to the {{ExecutionGraph}} since this structure is already too overloaded with
responsibilities. A starting pointer could be the {{ExecutionFailureHandler}}
which is responsible for handling execution failures.
> Expose pluggable interface for exception analysis and metrics reporting in
> Execution Graph
> -------------------------------------------------------------------------------------------
>
> Key: FLINK-20833
> URL: https://issues.apache.org/jira/browse/FLINK-20833
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Coordination
> Affects Versions: 1.12.0
> Reporter: Zhenqiu Huang
> Priority: Minor
>
> For platform users of Apache flink, people usually want to classify the
> failure reason( for example user code, networking, dependencies and etc) for
> Flink jobs and emit metrics for those analyzed results. So that platform can
> provide an accurate value for system reliability by distinguishing the
> failure due to user logic from the system issues.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)