Re: File System State Backend

2017-09-08 Thread rnosworthy
Can I utilize disk on the job manager for this or do I need a dedicated disk storage vm? How do I specify not only directory but ip address of the checkpoint data directory? Is there any docs to configure a a state backend without using hdfs or s3? thanks for your help Ryan -- Sent from: http

Re: File System State Backend

2017-09-08 Thread Stephan Ewen
Hi! Checkpoints in Flink need to go to a file system that is accessible across machines. Otherwise there could be no recovery of a data of a failed machine. The cleanup is also triggered by a different node than the node that checkpoints - hence you see no cleanup in your setup. Best, Stephan O

Re: File System State Backend

2017-09-08 Thread rnosworthy
Thanks for the response. Thats correct, they do not get purged/deleted while the job is running. I have 3 concurrent jobs running and there are 3 directories in the data directory. /var/data/flink/2375c69006bfeca9644171f31b444dff /var/data/flink/c3264bb6d5e068d6440bbb21069b7d28 /var/data/flink/f8

Re: File System State Backend

2017-09-08 Thread Stefan Richter
Hi, I just tried out checkpoint with FsStateBackend in 1.3.2 and everything works as expected for me. Can you give a bit more detail what you mean by „checkpoint data is not cleaning“? For example, is it not cleaned up while the job is running and accumulating „chk-[ID]“ directories or is somet

File System State Backend

2017-09-07 Thread rnosworthy
Flink 1.3.2 I have 1 vm for the job manager and another for task manager. I have a custom windowing trigger shown below. My checkpoint data is not clearing. I have tried to inject a fileStateThresholdSize when instantiating the FsStateBackend object, but that didn't work. I have tried explicit