westonpace opened a new pull request #10729: URL: https://github.com/apache/arrow/pull/10729
This fixes two issues. * The null_count must be obtained from the indices array and not the values array * The min/max should be based on referenced values and not all values in the values array Note: This further adds a dependency from parquet onto arrow::compute (I use it both to compute the unique indices and to take the referenced values). This dependency already existed (column_writer.cc relies on arrow::compute::Cast) so I'm pretty sure this isn't a problem. Related: ARROW-8891 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
