asfimport commented on issue #406: URL: https://github.com/apache/parquet-format/issues/406#issuecomment-2184154149
[Jan Finis](https://issues.apache.org/jira/browse/PARQUET-2249?#comment-17691184) / @jfinis: I would be willing to propose a fixing commit for this, but I'm not part of ASF and the whole process, yet, so I don't know exactly how to get that going. I could start a PR on the parquet-format github repo. Is that the right point to suggest changes to the spec/parquet.thrift? Sidepoint: Note that NaN being larger than all other values is also: - The semantic that SQL has for NaN - What parquet-mr seems to be doing right now. At least, I have found parquet files that have NaN written as max_value in row group statistics. However, treating NaN as something extra by maintaining NaN counts will allow incorporating any NaN semantics the query engine wishes to use. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
