asfimport commented on issue #406:
URL: https://github.com/apache/parquet-format/issues/406#issuecomment-2184154149

   [Jan 
Finis](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691184) / 
@jfinis:
   I would be willing to propose a fixing commit for this, but I'm not part of 
ASF and the whole process, yet, so I don't know exactly how to get that going. 
I could start a PR on the parquet-format github repo. Is that the right point 
to suggest changes to the spec/parquet.thrift?
   
   Sidepoint: Note that NaN being larger than all other values is also:
   - The semantic that SQL has for NaN
   - What parquet-mr seems to be doing right now. At least, I have found 
parquet files that have NaN written as max_value in row group statistics.
     
     However, treating NaN as something extra by maintaining NaN counts will 
allow incorporating any NaN semantics the query engine wishes to use.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to