aokolnychyi opened a new pull request #131: Fix collection of bounds for small decimals in ParquetMetrics URL: https://github.com/apache/incubator-iceberg/pull/131 This PR resolves #125. `ParquetMetrics` uses `ParquetConversions$fromParquetPrimitive`, which assumes that decimals are always represented as binary in Parquet. The last statement is not true according to[ the Parquet spec](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). As a consequence, Iceberg might collect invalid lower/upper bounds that can lead to skipping wrong files. See the issue description for an example.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
