pitrou opened a new issue, #14870: URL: https://github.com/apache/arrow/issues/14870
### Describe the enhancement requested The `Statistics` structure in Parquet files provides two ways of specifying lower and upper bounds for a data page: * `min` and `max` are legacy fields for compatibility with older writers, with ill-defined comparison semantics in most cases except for signed integers * `min_value` and `max_value` are "new" fields (introduced in 2017! - see https://github.com/apache/parquet-format/commit/041708da1af52e7cb9288c331b542aa25b68a2b6 and https://github.com/apache/parquet-format/commit/bef5438990116725af041cdd8ced2bca0ed2608a) with well-defined comparison semantics depending on the logical type Currently Parquet C++ supports only the legacy fields `min` and `max`. We should add support for reading and writing the newer ones, with the appropriate semantics on the write path. ### Component(s) Parquet -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
