Zoltán Borók-Nagy created IMPALA-8498:
-----------------------------------------

             Summary: Write column index for floating types when NaN is not 
present
                 Key: IMPALA-8498
                 URL: https://issues.apache.org/jira/browse/IMPALA-8498
             Project: IMPALA
          Issue Type: Bug
            Reporter: Zoltán Borók-Nagy


IMPALA-7304 disabled column index writing for floating point columns until 
PARQUET-1222 is resolved.

PARQUET-1222 is responsible for defining a total order for floating values, but 
the problematic values are only the NaNs. Therefore we can write the column 
index if NaNs are not present in the data. Parquet-MR also does this, following 
the principles in 
[https://github.com/apache/parquet-format/blob/75eb7a7b84e6e62bfb09668b6d8d40b12597456e/src/main/thrift/parquet.thrift#L827-L834]
 

Impala should follow this behavior, and also when storing zeroes, it should 
store -0.0 as minimum and +0.0 as maximum.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to