Weston Pace created PARQUET-2068:
------------------------------------
Summary: [C++] [Parquet] Use arrow compute to determine min/max of
dictionaries (possibly other arrays?)
Key: PARQUET-2068
URL: https://issues.apache.org/jira/browse/PARQUET-2068
Project: Parquet
Issue Type: Improvement
Components: parquet-cpp
Reporter: Weston Pace
parquet::Comparator is currently used to calculate the min & max values of an
array. This should be benchmarked against arrow::compute's MinMax kernel (once
it supports all necessary data types). The latter should be more aggressive
with SIMD resulting in better performance.
Even if there is no performance difference the MinMax kernel should be used
when computing dictionary statistics as the current implementation requires
making a copy of the dictionary values array (see ARROW-12513)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)