pvary opened a new pull request #2046: URL: https://github.com/apache/iceberg/pull/2046
When used from Hive queries it would be useful to serialize the tables at the time of the query compilation for multiple reasons: - If we use the same snapshot during the query execution we could have consistent results - If we have do not have to access the catalog during the query execution then we can save HMS calls The Serialization is implemented for BaseTables in #1920. This PR aims to do the same for the Metadata tables too. Things which might worth to check: - Moved the SerializationUtil class to the core package - currently needed only for the tests, but I thought this would be ok - Every `writeReplace()` method is exactly the same for the specific types for metadata tables. Might worth to consider moving `ops`, `table`, `name` to the `BaseMetadataTable`. Did not do this because the change I did is not significant compared to the other quasi duplicated code, and there might be other reasons I am not aware of which would prevent this refactor. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
