[jira] [Resolved] (PARQUET-903) C++: Add option to set RPATH to ORIGIN

2017-03-06 Thread Wes McKinney (JIRA)

 [ 
https://issues.apache.org/jira/browse/PARQUET-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved PARQUET-903.
--
   Resolution: Fixed
Fix Version/s: cpp-1.0.0

Issue resolved by pull request 265
[https://github.com/apache/parquet-cpp/pull/265]

> C++: Add option to set RPATH to ORIGIN
> --
>
> Key: PARQUET-903
> URL: https://issues.apache.org/jira/browse/PARQUET-903
> Project: Parquet
>  Issue Type: New Feature
>  Components: parquet-cpp
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
> Fix For: cpp-1.0.0
>
>
> Made this an option as the actual RPATH value differs between OSX and Linux.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (PARQUET-907) Optionally store Page level metadata in the footer to enable predicate pushdowns

2017-03-06 Thread Julien Le Dem (JIRA)
Julien Le Dem created PARQUET-907:
-

 Summary: Optionally store Page level metadata in the footer to 
enable predicate pushdowns
 Key: PARQUET-907
 URL: https://issues.apache.org/jira/browse/PARQUET-907
 Project: Parquet
  Issue Type: Improvement
  Components: parquet-format
Reporter: Julien Le Dem






--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (PARQUET-906) add logical type timestamp with timezone (per SQL)

2017-03-06 Thread Julien Le Dem (JIRA)
Julien Le Dem created PARQUET-906:
-

 Summary: add logical type timestamp with timezone (per SQL)
 Key: PARQUET-906
 URL: https://issues.apache.org/jira/browse/PARQUET-906
 Project: Parquet
  Issue Type: Improvement
  Components: parquet-format
Reporter: Julien Le Dem
Priority: Minor


We need to clarify the spec here.
TODO: validate the following points.
timestamp with timezone (per SQL)
- each value has timezone
- TZ can be different for each value




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (PARQUET-905) Add "Floating Timestamp" logical type

2017-03-06 Thread Julien Le Dem (JIRA)
Julien Le Dem created PARQUET-905:
-

 Summary: Add "Floating Timestamp" logical type
 Key: PARQUET-905
 URL: https://issues.apache.org/jira/browse/PARQUET-905
 Project: Parquet
  Issue Type: Improvement
  Components: parquet-format
Reporter: Julien Le Dem


Unlike current Parquet Timestamp stored in UTC, a "floating timestamp" has no 
timezone, it is up to the reader to interpret the timestamps based on their 
timezone.
This is the behavior of a Timestamp in the sql standard





--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (PARQUET-904) Define INT96 ordering

2017-03-06 Thread Julien Le Dem (JIRA)
Julien Le Dem created PARQUET-904:
-

 Summary: Define INT96 ordering
 Key: PARQUET-904
 URL: https://issues.apache.org/jira/browse/PARQUET-904
 Project: Parquet
  Issue Type: Improvement
  Components: parquet-format
Reporter: Julien Le Dem


Currently int96 binary ordering doesn't match its natural ordering.
We should either specify this or declare int96 not ordered and link to the type 
replacing it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Updated] (PARQUET-323) INT96 should be marked as deprecated

2017-03-06 Thread Julien Le Dem (JIRA)

 [ 
https://issues.apache.org/jira/browse/PARQUET-323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Julien Le Dem updated PARQUET-323:
--
Description: 
As discussed in the mailing list, {{INT96}} is only used to represent nanosec 
timestamp in Impala for some historical reasons, and should be deprecated. 
Since nanosec precision is rarely a real requirement, one possible and simple 
solution would be replacing {{INT96}} with {{INT64 (TIMESTAMP_MILLIS)}} or 
{{INT64 (TIMESTAMP_MICROS)}}.

Several projects (Impala, Hive, Spark, ...) support INT96.
We need a clear spec of the replacement and the path to deprecation.

  was:As discussed in the mailing list, {{INT96}} is only used to represent 
nanosec timestamp in Impala for some historical reasons, and should be 
deprecated. Since nanosec precision is rarely a real requirement, one possible 
and simple solution would be replacing {{INT96}} with {{INT64 
(TIMESTAMP_MILLIS)}} or {{INT64 (TIMESTAMP_MICROS)}}.


> INT96 should be marked as deprecated
> 
>
> Key: PARQUET-323
> URL: https://issues.apache.org/jira/browse/PARQUET-323
> Project: Parquet
>  Issue Type: Bug
>  Components: parquet-format
>Reporter: Cheng Lian
>
> As discussed in the mailing list, {{INT96}} is only used to represent nanosec 
> timestamp in Impala for some historical reasons, and should be deprecated. 
> Since nanosec precision is rarely a real requirement, one possible and simple 
> solution would be replacing {{INT96}} with {{INT64 (TIMESTAMP_MILLIS)}} or 
> {{INT64 (TIMESTAMP_MICROS)}}.
> Several projects (Impala, Hive, Spark, ...) support INT96.
> We need a clear spec of the replacement and the path to deprecation.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (PARQUET-890) C++: Support I/O of DATE columns in parquet_arrow

2017-03-06 Thread Uwe L. Korn (JIRA)

[ 
https://issues.apache.org/jira/browse/PARQUET-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15897158#comment-15897158
 ] 

Uwe L. Korn commented on PARQUET-890:
-

PR: https://github.com/apache/parquet-cpp/pull/266

> C++: Support I/O of DATE columns in parquet_arrow
> -
>
> Key: PARQUET-890
> URL: https://issues.apache.org/jira/browse/PARQUET-890
> Project: Parquet
>  Issue Type: New Feature
>  Components: parquet-cpp
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>
> Currently we can only write Timestamp columns but no DATE columns to/from 
> Arrow. This did not occur earlier as the Pandas<->Parquet tests never pass 
> actuall date columns but Timestamp columns to Parquet.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)