[
https://issues.apache.org/jira/browse/FLINK-19359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17200177#comment-17200177
]
Arpith Prakash commented on FLINK-19359:
----------------------------------------
Yes I agree checkpoints may fail or in-progress which results in not getting
{{_metadata}} to be created . But if we have plenty of them missing it, then we
might loss all those saved information.
> Restore from Checkpoint fails if checkpoint folders is corrupt/partial
> ----------------------------------------------------------------------
>
> Key: FLINK-19359
> URL: https://issues.apache.org/jira/browse/FLINK-19359
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Checkpointing
> Affects Versions: 1.8.0
> Reporter: Arpith Prakash
> Priority: Major
> Attachments: Checkpoints.png
>
>
> I'm using Flink 1.8.0 version and have enabled externalized checkpoint to
> hdfs location, we have seen few scenarios where checkpoint folders will have
> checkpoint files but only missing "*_metadata*" file. If we attempt to
> restore application from this path, application fails with exception "Could
> not find *_metadata* file. There is similar discussion in Flink user mailing
> list with subject "Zookeeper connection loss causing checkpoint corruption"
> around it. I've attached sample snapshot on how folder structure looks as
> well.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)