Tim Armstrong created PARQUET-840:
-------------------------------------
Summary: Min-max for int96 is computed incorrectly in parquet-mr
Key: PARQUET-840
URL: https://issues.apache.org/jira/browse/PARQUET-840
Project: Parquet
Issue Type: Bug
Components: parquet-mr
Affects Versions: 1.8.0
Reporter: Tim Armstrong
The min-max stats implementation computes the min and max values of an int96
column using the BinaryStatistics class, which uses the lexical ordering of
signed bytes. Other integer types order based on the actual integer value.
The spec does not specify an ordering, but parquet-mr is internally
inconsistent - there is no reason why int96 should be ordered in a completely
different way from int64.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)