laskoviymishka opened a new issue, #999: URL: https://github.com/apache/iceberg-go/issues/999
Parent: #589 Depends on #998. Once new rows get `_row_id`s on first write, the next piece is preserving lineage through rewrites: - **Copy-on-write:** when a row is rewritten into a new file, its `_row_id` is preserved and `_last_updated_sequence_number` is set to the new snapshot's sequence number. - **Merge-on-read:** position-delete or DV records the row, but if the same row reappears in an equality-delete-induced rewrite, lineage must thread through. Tests: scan after a CoW overwrite shows preserved `_row_id`s with bumped `_last_updated_sequence_number`; scan after a MoR delete + reinsert shows new `_row_id` for reinserted rows (because they are logically new). Spec: [Iceberg row lineage](https://iceberg.apache.org/spec/#row-lineage). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
