ashvina opened a new pull request, #596: URL: https://github.com/apache/incubator-xtable/pull/596
This change adds the missing `tightBounds` property to XTable's representation of Delta file stats. This property is included only when a Delta table has the deletion vectors property set. This fix addresses the exception thrown due to the missing property, which was causing commit log parsing to fail. Additionally, this change fixes the issue of double counting a data file. When adding deletion vector information to an existing data file, Delta Lake adds two entries in the commit log: one to remove the old entry (which has empty or outdated delete vector information) and another to add the updated delete vector information. Both entries are for the same data file path which was already added by an existing commit. While XTable currently doesn’t convert deletion vectors, it can avoid duplicate counting of the data file by ignoring the entries created when delete vector information is updated. This change addresses this also. New tests have been added to validate the changes. Fixes #595 Depends on #588 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
