Zouxxyy opened a new pull request, #11600:
URL: https://github.com/apache/hudi/pull/11600

   ### Change Logs
   
   Suppose we have two concurrent jobs, write and clean, here is the problem 
flow:
   
   1. insert job: write partition p1, but has not started writing files
   2. clean job: through list partitions, get p1, and finds that there are no 
files in p1, then adds p1 to the partitions list to be deleted
   3. insert job, starts writing files to partition p1 and commits
   4. clean job, deletes partition p1, data is lost
   
   ### Impact
   
   fix it
   
   ### Risk level (write none, low medium or high below)
   
   medium
   
   ### Documentation Update
   
   none
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to