kamcheungting-db opened a new pull request, #16583: URL: https://github.com/apache/iceberg/pull/16583
InternalData.ReadBuilder had no way to pass a filter expression to the underlying format reader. For Parquet, this meant row-group skipping (already implemented via ParquetMetricsRowGroupFilter) was never reachable from internal metadata reads such as partition statistics scans. This PR adds InternalData.read(format, file, filterHint) as the primary entry point for filtered reads. The hint is a best-effort I/O optimization — Parquet uses it for row-group skipping; Avro ignores it. Callers are responsible for correctness via a residual filter. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
