[ 
https://issues.apache.org/jira/browse/IMPALA-8498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951405#comment-16951405
 ] 

ASF subversion and git services commented on IMPALA-8498:
---------------------------------------------------------

Commit f65c2a754fe3011a17aa3bde093c117e57ffedcb in impala's branch 
refs/heads/master from norbert.luksa
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f65c2a7 ]

IMPALA-8498: Write column index for floating types when NaN is not present

IMPALA-7307 disabled column index writing for floating point columns
until PARQUET-1222 is resolved. However, the problematic values are
only the NaNs. Therefore we can write column index if NaNs are not
present in data.

Testing:
  * Added tests which should fail if a column index is
    present while having NaN values in the column.

Change-Id: Ic9d367500243c8ca142a16ebfeef6c841f013434
Reviewed-on: http://gerrit.cloudera.org:8080/14264
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Write column index for floating types when NaN is not present
> -------------------------------------------------------------
>
>                 Key: IMPALA-8498
>                 URL: https://issues.apache.org/jira/browse/IMPALA-8498
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Norbert Luksa
>            Priority: Major
>              Labels: ramp-up
>
> IMPALA-7304 disabled column index writing for floating point columns until 
> PARQUET-1222 is resolved.
> PARQUET-1222 is responsible for defining a total order for floating values, 
> but the problematic values are only the NaNs. Therefore we can write the 
> column index if NaNs are not present in the data. Parquet-MR also does this, 
> following the principles in 
> [https://github.com/apache/parquet-format/blob/75eb7a7b84e6e62bfb09668b6d8d40b12597456e/src/main/thrift/parquet.thrift#L827-L834]
>  
> Impala should follow this behavior, and also when storing zeroes, it should 
> store -0.0 as minimum and +0.0 as maximum.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to