jackye1995 opened a new pull request #2354:
URL: https://github.com/apache/iceberg/pull/2354
This is the continuation for #2010 for adding a concept that describes how a
row in a table should be uniquely identified. I have the implementation ready
up to the Spark SQL extension to update the row identifier, and will separate
them into multiple PRs for review. This PR should have the same amount of
content as what openInx had in the old PR.
This PR adds row identifier to the Table and TableMetadata API, and writes
the metadata information as something like:
```json
...
"default-row-id-version": 1,
"row-ids": [
{
"row-id-version": 3,
"fields": [
{
"source-id": 1
},
{
"source-id": 3
}
]
}
],
...
```
I will add reasons behind the namings inline.
@openinx @rdblue @aokolnychyi
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]