beyond1920 opened a new pull request, #9811:
URL: https://github.com/apache/hudi/pull/9811

   ### Change Logs
   
   After Insert overwrite into a non-empty partition/table with empty datasets, 
the subsequent query results are incorrect
   For example, current dataset of  partition dt=2021-07-17 is 
   
   ||id||name||dt||
   |1|'a'|'2021-07-17'|
    
   Using the following SQL to overwrite partition dt=2021-07-17
   
   {code:java}
   insert overwrite table targetT partition(dt='2021-07-17') select id, name, 
price from sourceT where  dt=2021-07-17 and product='test'
   {code}
   If the input dataset is empty. We expect after insert overwrite, the 
partition of dt=2021-07-17 is empty.
   However, the query result is still the old data.
   
   This pr aims to fix this bug.
   
   ### Impact
   
   NA
   
   ### Risk level (write none, low medium or high below)
   
   NA
   
   ### Documentation Update
   
   NA
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to