Fokko commented on issue #2159: URL: https://github.com/apache/iceberg-python/issues/2159#issuecomment-3031132431
Hey @koenvo thanks for raising this discussion. Nothing is set in stone, so there are always possibilities to optimize, and I agree, we started with rough building blocks. The nice thing of the current approach is, when there is nothing to delete, it will only create an append operation. Also creating two snapshots (`DELETE`+`APPEND`) instead of just a `OVERWRITE`, makes it more transparent to other clients to what happend to the table. Although this should not be at every expense. Currently, we actually produce: - `DELETE`, `APPEND` for the delete operation - `APPEND` for the insert operation Also, potentially generating quite a bit of data and metadata, so there is definitely room for improvement. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
