danny0405 commented on PR #10312:
URL: https://github.com/apache/hudi/pull/10312#issuecomment-1853186628

   > @danny0405 @cuibo01 Read through the JIRA ticket. While I understand how 
the state of the TM and JM can cause the potential data loss, I am still not 
very sure how the TM and JM reaches that state.
   > 
   > Can you please describe the Flink job that i can use to try and replicate 
this?
   > 
   > Thank you!
   
   The reporter is saying when a checkpoint finishes before the JM handles the 
success event, if a task fails, it would clean the buffer and when committing, 
some valid files may got deleted, that cause a data loss.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to