Hi Stephan,

Thank you for the brief explanation.

Yes I have already enabled Object Reuse mode because of which I see
significant improvement.

I am currently running on r3.4xlarge having 122GB memory, as you suggested I
had increased the checkpoint interval to 10minutes and minimum pause between
checkpoints was 5 minutes, here the complete processing was done in 8
minutes :) (before even a single checkpoint was triggered)

That's why I decreased the checkpoint interval to 3 minutes, but observed
that pipeline stops for a long amount of time for checkpoint, here the Kafka
source was taking the maximum time to acknowledge and complete the
checkpoints (4minutes timeout) , it failed for 3 consecutive time.

Can't we make Kafka do asynchronous checkpoints ? because I see consistent
failure of checkpoints for Kafka. I have not observed window checkpoints
getting failed as they are done asynchronously.

 



--
View this message in context: 
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Re-Checkpointing-with-RocksDB-as-statebackend-tp11752p11879.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at 
Nabble.com.

Reply via email to