rdblue commented on PR #14502: URL: https://github.com/apache/iceberg/pull/14502#issuecomment-3762244527
I don't know that I think this is a good idea. I think that the primary problem is that the snapshot summary may persist partition information that could be sensitive. To me, the right solution is to stop embedding partition information in the snapshot summary and instead capture that data (if it is needed) using the metrics reporting framework and REST endpoint. That solution to getting partition metrics keeps partition info out of the snapshot summaries and tracks it through a separate path where it can be transient or protected differently. If the primary reason for introducing this is to stop leaking partition summary information in snapshots, then I'd recommend solving that problem more directly with something like a catalog override that suppresses them. Or just drop them at the catalog level when processing `AddSnapshot` changes. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
