[ 
https://issues.apache.org/jira/browse/FLINK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061257#comment-16061257
 ] 

Cliff Resnick commented on FLINK-6964:
--------------------------------------

I ran with your newer precondition. It actually succeeded once, but failed the 
next two runs, hung on org.apache.flink.runtime.state.SharedStateRegistry  - 
Attempt to register for key WindowOperator...  

I tried with just a a single slot, but that also hung. The log above represents 
the hang condition.

All the above logged here 
https://gist.github.com/cresny/0e109f843730b64d3a330f8fb06bb8a6

The good news is there was an exception around state registry

> Fix recovery for incremental checkpoints in StandaloneCompletedCheckpointStore
> ------------------------------------------------------------------------------
>
>                 Key: FLINK-6964
>                 URL: https://issues.apache.org/jira/browse/FLINK-6964
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>            Reporter: Stefan Richter
>            Assignee: Stefan Richter
>
> {{StandaloneCompletedCheckpointStore}} does not register shared states ion 
> resume. However, for externalized checkpoints, it register the checkpoint 
> from which it resumed. This checkpoint gets added to the completed checkpoint 
> store as part of resume.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to