alamb commented on issue #264: URL: https://github.com/apache/arrow-rs/issues/264#issuecomment-833679622
I think keeping "Nan" as the min/max value in the stats is unlikely going to be useful for query processing, much like storing NULLs as a min/max value is not super helpful). I would personally suggest keeping them out of the statistics and adding additional metadata (like "contains NaNs" or whatever) if that is important information to retain. If we really want to add NaN to the stats I think it would help to articulate an actual usecase where having a NaN there would be useful -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
