mehtaashish23 commented on issue #1949: URL: https://github.com/apache/iceberg/issues/1949#issuecomment-748189545
@rdblue started an email thread for this, will summarize the details here as it concludes, but here is the detail. For us, our main use case is primary key based dataSets (like MySQL bin log export) where the DELETE and MERGE always update records based on the primary key. Considering that I know the primary key and this fixed use case of primary key- based updates, I can easily construct back CDC from the appended/deleted data from the table, by taking full outer join on primary key between appended data and deleted data, and expose what all rows were updated/inserted/deleted, along with the previous value in case of updates/deletes. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
