[
https://issues.apache.org/jira/browse/FLINK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061257#comment-16061257
]
Cliff Resnick commented on FLINK-6964:
--------------------------------------
I ran with your newer precondition. It actually succeeded once, but failed the
next two runs, hung on org.apache.flink.runtime.state.SharedStateRegistry -
Attempt to register for key WindowOperator...
I tried with just a a single slot, but that also hung. The log above represents
the hang condition.
All the above logged here
https://gist.github.com/cresny/0e109f843730b64d3a330f8fb06bb8a6
The good news is there was an exception around state registry
> Fix recovery for incremental checkpoints in StandaloneCompletedCheckpointStore
> ------------------------------------------------------------------------------
>
> Key: FLINK-6964
> URL: https://issues.apache.org/jira/browse/FLINK-6964
> Project: Flink
> Issue Type: Bug
> Components: State Backends, Checkpointing
> Reporter: Stefan Richter
> Assignee: Stefan Richter
>
> {{StandaloneCompletedCheckpointStore}} does not register shared states ion
> resume. However, for externalized checkpoints, it register the checkpoint
> from which it resumed. This checkpoint gets added to the completed checkpoint
> store as part of resume.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)