[ 
https://issues.apache.org/jira/browse/FLINK-21821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Piotr Nowojski reopened FLINK-21821:
------------------------------------

I would keep it open, as we can still clean up the closing code and fix a race 
condition in disabling interrupts while closing resources. Currently this is 
guarded by a volatile variable, but there might be a race condition when:
1. interrupter thread first checked the {{shouldInterruptOnCancel}} flag
2. {{shouldInterruptOnCancel}} flag switched to false as Task/StreamTask 
entered cleaning up phase
3. interrupter issued an interrupt while Task/StreamTask are closing/releasing 
resources, potentially causing a memory leak

> Clean up Task/StreamTask cancellation and remove unnecessary concurrency 
> -------------------------------------------------------------------------
>
>                 Key: FLINK-21821
>                 URL: https://issues.apache.org/jira/browse/FLINK-21821
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Network, Runtime / Task
>    Affects Versions: 1.13.0
>            Reporter: Kezhu Wang
>            Priority: Major
>
> I think {{AbstractInvokable.cancel}} should only do flag things but not 
> closing staff. Otherwise task canceler and task thread will contend on these 
> closing tasks.
> It is same for {{BatchTask.cancel}} which might be confirmed as one potential 
> cause for FLINK-21728.
> cc  [~dwysakowicz] [~roman_khachatryan] [~pnowojski] [~AHeise] 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to