[ 
https://issues.apache.org/jira/browse/FLINK-18238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135695#comment-17135695
 ] 

Zhijiang commented on FLINK-18238:
----------------------------------

bq. RPC abort notifications were introduced in 1.11 but we've encountered this 
issue in 1.10, right? 

The motivation for introducing abort notification is for saving the efforts of 
clearing the invalid checkpoint, and it did in 1.11. After this improvement we 
prevented the barrier broadcasting to downstream side, so it encountered the 
deadlock issue now. This issue is not in 1.10.

> RemoteChannelThroughputBenchmark deadlocks
> ------------------------------------------
>
>                 Key: FLINK-18238
>                 URL: https://issues.apache.org/jira/browse/FLINK-18238
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Checkpointing
>    Affects Versions: 1.11.0
>            Reporter: Piotr Nowojski
>            Assignee: Yingjie Cao
>            Priority: Blocker
>             Fix For: 1.11.0
>
>         Attachments: consoleText_remote_benchmark_deadlock.txt
>
>
> In the last couple of days 
> {{RemoteChannelThroughputBenchmark.remoteRebalance}} deadlocked for the 
> second time:
> http://codespeed.dak8s.net:8080/job/flink-master-benchmarks/6019/



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to