Fokko commented on issue #2159:
URL: 
https://github.com/apache/iceberg-python/issues/2159#issuecomment-3031132431

   Hey @koenvo thanks for raising this discussion. Nothing is set in stone, so 
there are always possibilities to optimize, and I agree, we started with rough 
building blocks.
   
   The nice thing of the current approach is, when there is nothing to delete, 
it will only create an append operation. Also creating two snapshots 
(`DELETE`+`APPEND`) instead of just a `OVERWRITE`, makes it more transparent to 
other clients to what happend to the table. Although this should not be at 
every expense. Currently, we actually produce:
   
   - `DELETE`, `APPEND` for the delete operation
   - `APPEND` for the insert operation
   
   Also, potentially generating quite a bit of data and metadata, so there is 
definitely room for improvement.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to