[ 
https://issues.apache.org/jira/browse/KUDU-1594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15476992#comment-15476992
 ] 

David Alves commented on KUDU-1594:
-----------------------------------

TIMESTAMP's current format was created to match postgresql's timestamp without 
timezone, which Impala was maybe going to match later. If that's not going to 
happen then I'm ambivalent. Impala's type is unique enough that we might just 
want to store it in an INT96 like parquet does instead of calling that the new 
TIMESTAMP, which begs the question what will we call TIMESTAMP then?

> Rename TIMESTAMP type to avoid confusion with other timestamp types
> -------------------------------------------------------------------
>
>                 Key: KUDU-1594
>                 URL: https://issues.apache.org/jira/browse/KUDU-1594
>             Project: Kudu
>          Issue Type: Improvement
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>
> Kudu aims to be part of the Hadoop ecosystem, and other tools in the Hadoop 
> ecosystem store timestamps differently than Kudu. For example:
> - Parquet has TIMESTAMP_MILLIS which is milliseconds since the Unix epoch.
> - Impala internally stores a {64-bit nanoseconds since midnight, 32-bit 
> Julian day number}, and when storing in Parquet, uses Parquet's INT96 type to 
> store this.
> - Hive internally uses a 32-bit seconds-since-Unix-epoch, plus an optional 
> nanoseconds component
> To avoid adding to the confusion, we should name our time more explicitly (eg 
> UNIX_MICROTIMESTAMP or UNIXTIME_MICROS or somesuch)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to