laskoviymishka opened a new issue, #999:
URL: https://github.com/apache/iceberg-go/issues/999

   Parent: #589
   
   Depends on #998. Once new rows get `_row_id`s on first write, the next piece 
is preserving lineage through rewrites:
   
   - **Copy-on-write:** when a row is rewritten into a new file, its `_row_id` 
is preserved and `_last_updated_sequence_number` is set to the new snapshot's 
sequence number.
   - **Merge-on-read:** position-delete or DV records the row, but if the same 
row reappears in an equality-delete-induced rewrite, lineage must thread 
through.
   
   Tests: scan after a CoW overwrite shows preserved `_row_id`s with bumped 
`_last_updated_sequence_number`; scan after a MoR delete + reinsert shows new 
`_row_id` for reinserted rows (because they are logically new).
   
   Spec: [Iceberg row lineage](https://iceberg.apache.org/spec/#row-lineage).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to