[jira] [Updated] (FLINK-17672) OperatorCoordinators receive failure notifications on task failure instead of on task restarts

Stephan Ewen (Jira) Sat, 16 May 2020 09:36:25 -0700


     [ 
https://issues.apache.org/jira/browse/FLINK-17672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Stephan Ewen updated FLINK-17672:
---------------------------------
    Summary:  OperatorCoordinators receive failure notifications on task 
failure instead of on task restarts  (was:  OperatorCoordinators receive 
failure notifications on task failurenstead of restarts)

>  OperatorCoordinators receive failure notifications on task failure instead 
> of on task restarts
> -----------------------------------------------------------------------------------------------
>
>                 Key: FLINK-17672
>                 URL: https://issues.apache.org/jira/browse/FLINK-17672
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Coordination
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>            Priority: Critical
>             Fix For: 1.11.0
>
>
>  Currently, the OperatorCoordinators receive failure notifications on task 
> restart. That follows the same approach as the InputSplit assigners from the 
> legacy sources (after which the integration of the Coordinators with the 
> Scheduler was modeled).
> However, propagating the failure notifications during the actual failure is 
> more intuitive, and also improve situations where tasks fail but don't get 
> restarted for a while (this can happen for batch tasks when a TM dies and no 
> spare resources are available). In those cases, the coordinator can react 
> much earlier to the failure.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (FLINK-17672) OperatorCoordinators receive failure notifications on task failure instead of on task restarts

Reply via email to