singhpk234 opened a new pull request, #14462: URL: https://github.com/apache/iceberg/pull/14462
### About the change Presently the Expressions and the tranfroms are when serialized they are first converted to am unbound expression / transform which reference stuff by name, and then we look up stuff by name when we deserialze and want to bind them back. This proposal introduces a notion of `IDReference` which apart from name includes field ID of the columns and when this info is Serialized in the representation. This is super helpful in cases such as Row Access Policy where the catalog returns back an expression which needs to enforced by the engine, for that its important for the catalog to give back bound expression to protect the cases of column rename / drop (in case reading old snaphot). Read Restrictions spec here : - https://github.com/apache/iceberg/pull/13879 This has been discussed a couple of times in the Catalog community Syncs : - https://docs.google.com/document/d/1iPGVCIcr-M0XtAiudOguWAvmqIdVgpYN5vz5ohO8PKw/edit?tab=t.0#heading=h.cr6o1g2rn5hc Never the less we have been iterating on this PR in my fork as well in the community sync to get some initial feedbacks - https://github.com/singhpk234/iceberg/pull/270 Opening this PR since the feedbacks looks positive and ready for wider discussion. Note: This has spec change too -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
