[
https://issues.apache.org/jira/browse/FLINK-13371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chesnay Schepler closed FLINK-13371.
------------------------------------
Resolution: Fixed
master: 7b95f32d01730bcc75ded42e41d3668a1802a69b
1.9: b5ab84c6238ff1f69f2151ed580410ae4c63acd7
> Release partitions in JM if producer restarts
> ---------------------------------------------
>
> Key: FLINK-13371
> URL: https://issues.apache.org/jira/browse/FLINK-13371
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Coordination, Runtime / Network
> Affects Versions: 1.9.0
> Reporter: Andrey Zagrebin
> Assignee: Chesnay Schepler
> Priority: Blocker
> Labels: pull-request-available
> Fix For: 1.9.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> As discussed in FLINK-13245, there can be a case that producer does not even
> detect any consumption attempt if consumer fails before the connection is
> established. It means we cannot fully rely on shuffle service for the release
> on consumption in case of consumer failure. When producer restarts it will
> leak partitions from the previous attempt. Previously we had an explicit
> release call for this case in Execution.cancel/suspend. Basically JM has to
> explicitly release all partitions produced by the previous task execution
> attempt in case of producer restart, including `released on consumption`
> partitions. For this change, we might need to track all partitions in
> PartitionTrackerImpl.
--
This message was sent by Atlassian JIRA
(v7.6.14#76016)