JsZero opened a new pull request, #6090:
URL: https://github.com/apache/hive/pull/6090

   …nal Iceberg table may unexpectedly delete other partitions
   
   ### What changes were proposed in this pull request?
   This PR addresses 
[HIVE-29218](https://issues.apache.org/jira/browse/HIVE-29218), which reports 
an issue when using LOAD OVERWRITE on multi-level partitioned Iceberg tables 
with appendFile as the underlying implementation.
   
   The problem was that the overwrite logic could mistakenly delete metadata of 
partitions that should not be affected. For example, given a table partitioned 
by pcol1 and pcol2, with existing data in partition pcol1=x/pcol2=y, running 
LOAD OVERWRITE into another partition such as pcol1=x/pcol2=z would incorrectly 
remove the metadata for pcol1=x/pcol2=y just because it partially overlaps on 
pcol1=x.
   
   The PR fix the overwrite handling logic in Iceberg LOAD implementation to 
ensure that only the target partition is overwritten.
   
   ### Why are the changes needed?
   The current implementation may inadvertently delete metadata that should be 
preserved.
   
   ### Does this PR introduce _any_ user-facing change?
   No. This change only fixes incorrect behavior where unrelated partitions' 
metadata might be unintentionally deleted during LOAD OVERWRITE. The intended 
user-facing behavior remains unchanged.
   
   ### How was this patch tested?
   ut


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to