[ 
https://issues.apache.org/jira/browse/PARQUET-323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15902218#comment-15902218
 ] 

Lars Volker commented on PARQUET-323:
-------------------------------------

We discussed this issue in today's Parquet sync and agreed to deprecate INT96. 
As a replacement to store timestamps (the most common use for INT96) we will 
encourage all projects who currently use INT96 to switch to INT64 and either 
use the TIMESTAMP_MILLIS or TIMESTAMP_MICROS logical types.

We will not fix the ordering issues around INT96 that resulted in parquet-mr 
writing wrong min/max statistics.

> INT96 should be marked as deprecated
> ------------------------------------
>
>                 Key: PARQUET-323
>                 URL: https://issues.apache.org/jira/browse/PARQUET-323
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-format
>            Reporter: Cheng Lian
>
> As discussed in the mailing list, {{INT96}} is only used to represent nanosec 
> timestamp in Impala for some historical reasons, and should be deprecated. 
> Since nanosec precision is rarely a real requirement, one possible and simple 
> solution would be replacing {{INT96}} with {{INT64 (TIMESTAMP_MILLIS)}} or 
> {{INT64 (TIMESTAMP_MICROS)}}.
> Several projects (Impala, Hive, Spark, ...) support INT96.
> We need a clear spec of the replacement and the path to deprecation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to