pitrou opened a new issue, #14870:
URL: https://github.com/apache/arrow/issues/14870

   ### Describe the enhancement requested
   
   The `Statistics` structure in Parquet files provides two ways of specifying 
lower and upper bounds for a data page:
   * `min` and `max` are legacy fields for compatibility with older writers, 
with ill-defined comparison semantics in most cases except for signed integers
   * `min_value` and `max_value` are "new" fields (introduced in 2017! - see 
https://github.com/apache/parquet-format/commit/041708da1af52e7cb9288c331b542aa25b68a2b6
 and 
https://github.com/apache/parquet-format/commit/bef5438990116725af041cdd8ced2bca0ed2608a)
 with well-defined comparison semantics depending on the logical type
   
   Currently Parquet C++ supports only the legacy fields `min` and `max`. We 
should add support for reading and writing the newer ones, with the appropriate 
semantics on the write path.
   
   
   ### Component(s)
   
   Parquet


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to