[
https://issues.apache.org/jira/browse/FLINK-17672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Stephan Ewen updated FLINK-17672:
---------------------------------
Summary: OperatorCoordinators receive failure notifications on task
failure instead of on task restarts (was: OperatorCoordinators receive
failure notifications on task failurenstead of restarts)
> OperatorCoordinators receive failure notifications on task failure instead
> of on task restarts
> -----------------------------------------------------------------------------------------------
>
> Key: FLINK-17672
> URL: https://issues.apache.org/jira/browse/FLINK-17672
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Coordination
> Reporter: Stephan Ewen
> Assignee: Stephan Ewen
> Priority: Critical
> Fix For: 1.11.0
>
>
> Currently, the OperatorCoordinators receive failure notifications on task
> restart. That follows the same approach as the InputSplit assigners from the
> legacy sources (after which the integration of the Coordinators with the
> Scheduler was modeled).
> However, propagating the failure notifications during the actual failure is
> more intuitive, and also improve situations where tasks fail but don't get
> restarted for a while (this can happen for batch tasks when a TM dies and no
> spare resources are available). In those cases, the coordinator can react
> much earlier to the failure.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)