crepererum opened a new issue #264:
URL: https://github.com/apache/arrow-rs/issues/264


   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   After #256 we completely ignore NaNs in parquet statistics. However, there 
are good reasons to fully order floats and include NaN somewhere:
   
   - Databases usually need it, see 
https://github.com/cockroachdb/cockroach/issues/18860 and if DataFusion wants 
to follow the Postgres dialect we will have the same issue there
   - Other parquet users are interested as well: 
[PARQUET-1222](https://issues.apache.org/jira/browse/PARQUET-1222)
   - Just ignoring NaN is kinda having an implicit NULL for float while we 
already have a NULL-mask
   
   **Describe the solution you'd like**
   Put NaNs at the end of the float scale, so the order is:
   
   1. -inf
   2. "ordinary" numbers
   3. +inf
   4. NaN
   
   **Describe alternatives you've considered**
   Keeping the status quo (aka current master).
   
   **Additional context**
   .
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to