RussellSpitzer commented on issue #13855: URL: https://github.com/apache/iceberg/issues/13855#issuecomment-3245779907
My summary thoughts, Currently metrics are not good for this because we have almost no way to determine the difference between, didn't store metrics for a column and column wasn't written. In general we assume that if we don't see metrics, that column exists. Linking Schema has a similar issue, If I don't have metrics for an optional column it could be missing or It could have values, so I can't make the call. It would let me avoid columns that don't exist entirely in the leaked schema so there is a benefit there. I just worry that the more common case is schemas with optional columns (possibly many many optional columns) where we aren't storing metrics. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org