[jira] [Commented] (FLINK-7231) SlotSharingGroups are not always released in time for new restarts

ASF GitHub Bot (JIRA) Thu, 20 Jul 2017 06:36:26 -0700

    [ 
https://issues.apache.org/jira/browse/FLINK-7231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16094693#comment-16094693
 ]


ASF GitHub Bot commented on FLINK-7231:
---------------------------------------

Github user StephanEwen commented on the issue:

    https://github.com/apache/flink/pull/4370
  
    Thanks for the review!
    Merging this...
    
    The commit by Niko was probably because github was slightly out of sync 
with the apache git repo and thought that commit was part of the diff...


> SlotSharingGroups are not always released in time for new restarts
> ------------------------------------------------------------------
>
>                 Key: FLINK-7231
>                 URL: https://issues.apache.org/jira/browse/FLINK-7231
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination
>    Affects Versions: 1.3.1
>            Reporter: Stephan Ewen
>            Assignee: Stephan Ewen
>             Fix For: 1.4.0, 1.3.2
>
>
> In the case where there are not enough resources to schedule the streaming 
> program, a race condition can lead to a sequence of the following errors:
> {code}
> java.lang.IllegalStateException: SlotSharingGroup cannot clear task 
> assignment, group still has allocated resources.
> {code}
> This eventually recovers, but may involve many fast restart attempts before 
> doing so.
> The root cause is that slots are not cleared before the next restart attempt.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (FLINK-7231) SlotSharingGroups are not always released in time for new restarts

Reply via email to