[ https://issues.apache.org/jira/browse/FLINK-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15875234#comment-15875234 ]
Biao Liu edited comment on FLINK-5836 at 2/21/17 2:26 AM: ---------------------------------------------------------- [~wenlong.lwl] has already been working on it, reassign to him. was (Author: sleepy): [~wenlong.lwl] was already working on this, reassign to him. > Race condition between slot offering and task deployment > -------------------------------------------------------- > > Key: FLINK-5836 > URL: https://issues.apache.org/jira/browse/FLINK-5836 > Project: Flink > Issue Type: Bug > Components: Distributed Coordination > Affects Versions: 1.3.0 > Reporter: Till Rohrmann > Assignee: Wenlong Lyu > Labels: flip-6 > > The Flip-6 code has a race condition when offering slots to a {{JobManager}} > which directly deploys tasks to the offered slots. In such a situation it is > possible that the deploy call overtakes the acknowledge message for the slot > offering. As a result, the slots are not marked yet as active and the > deployment will fail. > I propose to fix this problem by first activating all offered slots before > sending the slot offer message to the {{JobManager}}. Consequently, we'll > deactivate and free slots which haven't been accepted by the {{JobManager}} > once we've received the offering acknowledge message. -- This message was sent by Atlassian JIRA (v6.3.15#6346)