raulcd opened a new issue, #47449:
URL: https://github.com/apache/arrow/issues/47449

   ### Describe the enhancement requested
   
   Currently we drop all statistics if `SortOrder` is `UNKNOWN`. This seems too 
broad and there are some statistics, like `null_count` that could be maintained.
   
   
https://github.com/apache/arrow/blob/6f6138b7eedece0841b04f4e235e3bedf5a3ee29/cpp/src/parquet/metadata.cc#L330-L335
   
   Clearing `min/max` but allowing to keep `null_count` when `SortOrder` is 
`UNKNOWN` would allow users to use them.
   
   We could probably also keep `num_values` and `distinct_count`.
   
   ### Component(s)
   
   C++, Parquet


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to