Hi, Chen-che, I think it may be similar to FLINK-12381 <https://issues.apache.org/jira/browse/FLINK-12381>. You could adopt the suggestion like setting the job-id as you could see the comment below the ticket. I think you could also share your environment in this ticket to let us know more information.
On Mon, Apr 25, 2022 at 9:58 AM Chen-Che Huang <acmic...@gmail.com> wrote: > Hi all, > > We recently encountered a random issue. When our Flink application is > doing checkpoint creation, it occasionally fails because it thinks the > medatafile of the checkpoint already exists. However, the medata file does > not exist actually. We use Flink version 1.14.4 and the checkpoints are > stored at google cloud storage. Does anyone encounter the same problem? Any > comment is appreciated. > > Caused by: org.apache.hadoop.fs.FileAlreadyExistsException: > 'gs://bucket/flink-app/00000000000000000000000000000000/chk-5812/_metadata' > already exists > > Best wishes, > > Chen-Che Huang > >