[jira] [Commented] (FLINK-13169) IT test for fine-grained recovery (task executor failures)

Till Rohrmann (JIRA) Mon, 15 Jul 2019 19:13:22 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16885778#comment-16885778
 ]


Till Rohrmann commented on FLINK-13169:
---------------------------------------

Made this a critical issue for 1.9.0. I would like to get this included to 
ensure that fine grained recovery works in 1.9.0. cc [~tzulitai] [~ykt836].

> IT test for fine-grained recovery (task executor failures)
> ----------------------------------------------------------
>
>                 Key: FLINK-13169
>                 URL: https://issues.apache.org/jira/browse/FLINK-13169
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Coordination
>            Reporter: Andrey Zagrebin
>            Assignee: Andrey Zagrebin
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 1.9.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The BatchFineGrainedRecoveryITCase can be extended with an additional test 
> failure strategy which abruptly shuts down the task executor. This leads to 
> the loss of all previously completed and the in-progress mapper result 
> partitions. The fail-over strategy should restart the current in-progress 
> mapper which will get the PartitionNotFoundException because the previous 
> result becomes unavailable and the previous mapper has to be restarted as 
> well. The same should happen subsequently with all previous mappers. When the 
> source is recomputed, all mappers has to be restarted again to recalculate 
> their lost results.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

[jira] [Commented] (FLINK-13169) IT test for fine-grained recovery (task executor failures)

Reply via email to