Hi Swapnil, I am not familiar with the StreamingFileSink, however, this sounds like a checkpointing issue to me FileSink should keep its sink state, and remove from the state the files that it *really successfully* sinks (perhaps you may want to add a validation here with S3 to check file integrity). This leaves us in the state with the failed files, partial files etc.
--- Oytun Tez *M O T A W O R D* The World's Fastest Human Translation Platform. oy...@motaword.com — www.motaword.com On Fri, Aug 16, 2019 at 6:02 PM Swapnil Kumar <swku...@zendesk.com> wrote: > Hello, We are using Flink to process input events and aggregate and write > o/p of our streaming job to S3 using StreamingFileSink but whenever we try > to restore the job from a savepoint, the restoration fails with missing > part files error. As per my understanding, s3 deletes those > part(intermittent) files and can no longer be found on s3. Is there a > workaround for this, so that we can use s3 as a sink? > > -- > Thanks, > Swapnil Kumar >