danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1121853297

   > @danny0405 : going back to your original example in the description. If a 
file slice was written in C1, and never updated in any of the future commits, 
at C10 or C11, even if cleaner detects all data files pertaining to C1 to C4 
needs to be deleted, latest file slice in C1 will never be touched. Cleaner 
will always ensure latest file slice for any file group will never be cleaned 
up. So, can you help understand why we might see FileNotFoundIssue?
   
   My description was not that accurate, because the use the Snapshot 
Isolation, when the reader starts reading the C1 file slice `s1` where it is 
the latest in the file group at C9, a subsequent C10 then modifies the C1 file 
slice and the cleaner starts working, the `s1` would be cleaned.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to