beyond1920 opened a new pull request, #9811:
URL: https://github.com/apache/hudi/pull/9811
### Change Logs
After Insert overwrite into a non-empty partition/table with empty datasets,
the subsequent query results are incorrect
For example, current dataset of partition dt=2021-07-17 is
||id||name||dt||
|1|'a'|'2021-07-17'|
Using the following SQL to overwrite partition dt=2021-07-17
{code:java}
insert overwrite table targetT partition(dt='2021-07-17') select id, name,
price from sourceT where dt=2021-07-17 and product='test'
{code}
If the input dataset is empty. We expect after insert overwrite, the
partition of dt=2021-07-17 is empty.
However, the query result is still the old data.
This pr aims to fix this bug.
### Impact
NA
### Risk level (write none, low medium or high below)
NA
### Documentation Update
NA
### Contributor's checklist
- [ ] Read through [contributor's
guide](https://hudi.apache.org/contribute/how-to-contribute)
- [ ] Change Logs and Impact were stated clearly
- [ ] Adequate tests were added if applicable
- [ ] CI passed
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]