[GitHub] [iceberg] jackye1995 opened a new pull request #2354: Core: add row identifier to format v2

GitBox Sat, 20 Mar 2021 19:45:21 -0700


jackye1995 opened a new pull request #2354:
URL: https://github.com/apache/iceberg/pull/2354



   This is the continuation for #2010 for adding a concept that describes how a 
row in a table should be uniquely identified. I have the implementation ready 
up to the Spark SQL extension to update the row identifier, and will separate 
them into multiple PRs for review. This PR should have the same amount of 
content as what openInx had in the old PR.
   
   This PR adds row identifier to the Table and TableMetadata API, and writes 
the metadata information as something like:
   
   ```json
     ...
     "default-row-id-version": 1,
     "row-ids": [
       {
         "row-id-version": 3,
         "fields": [
           {
             "source-id": 1
           },
           {
             "source-id": 3
           }
         ]
       }
     ],
     ...
   ```
   
   I will add reasons behind the namings inline.
   
   @openinx @rdblue @aokolnychyi 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] jackye1995 opened a new pull request #2354: Core: add row identifier to format v2

Reply via email to