Re: Purpose of isAdjustedToUTC for time type?

2021-06-22 Thread Zoltan Ivanfi
gt; Just to clarify we only expect "isAdjustedToUTC = false" flags to be > written/read? > > On Mon, Jun 21, 2021 at 2:05 AM Zoltan Ivanfi wrote: > >> Hi, >> >> The UTC normalization mechanism was created primarily with timestamps >> (date + time) in mind. The flag

Re: Purpose of isAdjustedToUTC for time type?

2021-06-21 Thread Zoltan Ivanfi
Hi, The UTC normalization mechanism was created primarily with timestamps (date + time) in mind. The flag was only added to pure times for the sake of consistency. In my opinion implementations should write the correct flag for future proofness and check the flag when reading, but I don't think

[jira] [Assigned] (PARQUET-1353) The random data generator used for tests repeats the same value over and over again

2019-12-02 Thread Zoltan Ivanfi (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi reassigned PARQUET-1353: -- Assignee: (was: Zoltan Ivanfi) > The random data generator used for te

Re: [VOTE] Release Apache Parquet 1.11.0 RC7

2019-11-26 Thread Zoltan Ivanfi
+1 (binding) - I have read through the problem reports in this e-mail thread (one caused by the use of a private method via reflection an another one caused by having mixed versions of the libraries on the classpath) and I am convinced that they do not block the release. - Signature and hash of

Re: release process - using rc tags

2019-10-30 Thread Zoltan Ivanfi
+1 On Wed, Oct 30, 2019 at 2:03 PM Gabor Szadovszky wrote: > > Dear All, > > Our current tagging policy in the release process requires to use the same > tag for all the release candidates which means at RC2 we remove the tag > from RC1 head and adds again to the RC2 head and so on. I think it

[jira] [Assigned] (PARQUET-1337) Current block alignment logic may lead to several row groups per block

2019-09-26 Thread Zoltan Ivanfi (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi reassigned PARQUET-1337: -- Assignee: (was: Zoltan Ivanfi) > Current block alignment logic may l

[jira] [Created] (PARQUET-1628) Accept local timestamps annotated with the legacy timestamp types

2019-07-18 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1628: -- Summary: Accept local timestamps annotated with the legacy timestamp types Key: PARQUET-1628 URL: https://issues.apache.org/jira/browse/PARQUET-1628 Project

[jira] [Created] (PARQUET-1627) Update specification so that legacy timestamp logical types can be written for local semantics as well

2019-07-18 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1627: -- Summary: Update specification so that legacy timestamp logical types can be written for local semantics as well Key: PARQUET-1627 URL: https://issues.apache.org/jira/browse

Re: Floating point data compression for Apache Parquet

2019-07-16 Thread Zoltan Ivanfi
gt; Pull request: https://github.com/apache/parquet-format/pull/144 < > > > https://github.com/apache/parquet-format/pull/144> > > > > > > > > > > > > > > > I also have a WIP patch for adding the "BYTE_STREAM_SPLIT" encoding to > >

Re: [VOTE] Parquet Bloom filter spec sign-off

2019-07-15 Thread Zoltan Ivanfi
s. > > > > > > > > On Wed, Jul 10, 2019 at 5:26 PM Zoltan Ivanfi > > wrote: > > > > > > Hi Junjie, > > > > > > Since there are ongoing improvements addressing review comments, I would > > > hold off with the vote for a fe

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-11 Thread Zoltan Ivanfi
Hi Wes, Sounds good to me. Br, Zoltan On Thu, Jul 11, 2019 at 4:14 PM Wes McKinney wrote: > > On Thu, Jul 11, 2019 at 8:17 AM Zoltan Ivanfi > wrote: > > > > Hi Wes, > > > > I did a little bit of testing using pyarrow 0.14.0. I know that this > > is n

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-11 Thread Zoltan Ivanfi
es (this is in the end not really > > related to parquet, but to how pandas gets converted to pyarrow) > > > > Op do 11 jul. 2019 om 09:17 schreef Zoltan Ivanfi > >: > > > >> ... > >> This matched my expectations up until pd_mixed. I was surpr

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-11 Thread Zoltan Ivanfi
allowing other readers to enjoy the default you believe > > is warranted. It seems most straightforward that it would be an additional > > parameter on the TimestampType, but I supposed it could reside in the > > FileMetaData struct (on the assumption that the schema elements, having >

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-11 Thread Zoltan Ivanfi
now whether that is the intended behaviour, but in any case, based on this short manual testing, the new timestamp types written by pyarrow are interopable with the Java library. Br, Zoltan On Wed, Jul 10, 2019 at 4:30 PM Wes McKinney wrote: > > Correct > > On Wed, Jul 10, 2019 at 9:2

[jira] [Updated] (PARQUET-1222) Specify a well-defined sorting order for float and double types

2019-07-10 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1222: --- Description: Currently parquet-format specifies the sort order for floating point numbers

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-10 Thread Zoltan Ivanfi
issues and others we need to make a new release within the next 7-10 > days. We can point you to nightly Python builds to make testing for > you easier so you don't have to build the project yourself. > > - Wes > > On Wed, Jul 10, 2019 at 9:11 AM Zoltan Ivanfi > wrote: > > >

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-10 Thread Zoltan Ivanfi
Hi, Oh, and one more thing: Before releasing the next Arrow version incorporating the new logical types, we should definitely test that their behaviour matches that of parquet-mr. When is the next release planned to come out? Br, Zoltan On Wed, Jul 10, 2019 at 3:57 PM Zoltan Ivanfi wrote

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-10 Thread Zoltan Ivanfi
; we were doing in the C++ library prior to the introduction of the > LogicalType metadata. > > On Wed, Jul 10, 2019 at 8:20 AM Zoltan Ivanfi > wrote: > > > > Hi Wes, > > > > Both of the semantics are deterministic in one aspect and indeterministic > > in

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-10 Thread Zoltan Ivanfi
Hi Tim, In my opinion the specification of the older timestamp types only allowed UTC-normalized storage, since these types were defined as the number of milli/microseconds elapsed since the Unix epoch. This clearly defines the meaning of the numeric value 0 as 0 seconds after the Unix epoch,

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-10 Thread Zoltan Ivanfi
lly non-existent, whereas I'm hearing about > Spark gotchas all the time. > > On Tue, Jul 9, 2019 at 11:34 AM Zoltan Ivanfi > wrote: > > > > Hi Wes, > > > > The rules for TIMESTAMP forward-compatibility were created based on the > > assumption that TIMESTAMP_MIL

Re: [VOTE] Parquet Bloom filter spec sign-off

2019-07-10 Thread Zoltan Ivanfi
Hi Junjie, Since there are ongoing improvements addressing review comments, I would hold off with the vote for a few more days until the specification settles. Br, Zoltan On Wed, Jul 10, 2019 at 9:32 AM 俊杰陈 wrote: > Hi Parquet committers and developers > > We are waiting for your important

Re: Forward compatibility issues with TIMESTAMP_MILLIS/MICROS ConvertedType

2019-07-09 Thread Zoltan Ivanfi
Hi Wes, The rules for TIMESTAMP forward-compatibility were created based on the assumption that TIMESTAMP_MILLIS and TIMESTAMP_MICROS have only been used in the instant aka. UTC-normalized semantics so far. This assumption was supported by two sources: 1. The specification: parquet-format

Re: Floating point data compression for Apache Parquet

2019-07-08 Thread Zoltan Ivanfi
ND just then we make adjustments to > the Apache Parquet spec? > > > Any piece of advice is welcome! > > > Regards, > > Martin > > > -- > *From:* Zoltan Ivanfi > *Sent:* Friday, July 5, 2019 4:21:39 PM > *To:* Radev, Martin >

Re: Floating point data compression for Apache Parquet

2019-07-05 Thread Zoltan Ivanfi
ithout > any concerns for a license. > > > *> **Are any of the investigated strategies applicable to DECIMAL values?* > The lossy compressors SZ and ZFP do not support that outside of the box. I > could communicate with the SZ developers to come to a decision how this can >

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-05 Thread Zoltan Ivanfi
, Zoltan On Fri, Jul 5, 2019 at 1:33 PM Zoltan Ivanfi wrote: > Hi, > > I just noticed that yesterday I misunderstood that the Bloom filter is a > part of the column chunk metadata, when in fact it is only the offset of it > that is stored there. In this case we definitely need to pay

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-05 Thread Zoltan Ivanfi
. > > > > > > On Fri, Jul 5, 2019 at 5:14 PM 俊杰陈 wrote: > > > > > > > Sure, please see this PR > > > > <https://github.com/apache/parquet-format/pull/140> or update file > > here > > > > < > > > > &

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-04 Thread Zoltan Ivanfi
and c++ implementation, and I think that is not hard to > > > handle. So I propose to use xxHash (the XXH64 version) as the default > > > hash strategy and deprecate previous murmur3 hash. > > > > > > I will update vote thread as well to make it clearer to all.

Re: Floating point data compression for Apache Parquet

2019-07-03 Thread Zoltan Ivanfi
ortant to go over. > > > Let me know if you have any questions or concerns. > > > Regards, > > Martin > > > From: Zoltan Ivanfi > Sent: Thursday, June 13, 2019 2:16:56 PM > To: Parquet Dev > Cc: Raoofy, Amir; Karlstetter, Roman > Subject: Re: Flo

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-03 Thread Zoltan Ivanfi
gt; > On Wed, Jul 3, 2019 at 6:08 PM Zoltan Ivanfi > wrote: > > > > Hi Junjie, > > > > I think the vote is ambigous in its current form (can people vote on one > > option only or can they vote on both?) and has a low chance of getting > > votes in general

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-03 Thread Zoltan Ivanfi
ers and developers? > > I 'd like to start a vote firstly, you could either provide your input here > or on vote thread. > > > > On Mon, Jul 1, 2019 at 8:20 PM Zoltan Ivanfi > wrote: > > > Hi, > > > > I would like to clarify one point of my previous e-

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-07-01 Thread Zoltan Ivanfi
Hi, I would like to clarify one point of my previous e-mail: While I reasoned that for compressions and encodings we should avoid picking algorithms superseded by better ones, I also reasoned that for bloom filters we do not necessarily have to be as strict, because a reader with missing

New PMC member: Gabor Szadovszky

2019-06-28 Thread Zoltan Ivanfi
Hi, The Project Management Committee (PMC) for Apache Parquet has invited Gabor Szadovszky to become a member of the PMC and we are pleased to announce that he has accepted. Congratulations, Gabor! Br, Zoltan

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-06-28 Thread Zoltan Ivanfi
Hi, I think the concern was not about the lack of any specific hash algorithm, but about the choice of the one that got added. Generally for compressions and encodings, we are very picky about which ones to add to specification, because it has to be implemented in every language binding. This is

New committer: Nandor Kollar

2019-06-25 Thread Zoltan Ivanfi
Hi, The Project Management Committee (PMC) for Apache Parquet has invited Nandor Kollar to become a committer and we are pleased to announce that he has accepted. Congratulations and welcome, Nandor! Br, Zoltan

New committer: Fokko Driesprong

2019-06-25 Thread Zoltan Ivanfi
Hi, The Project Management Committee (PMC) for Apache Parquet has invited Fokko Driesprong to become a committer and we are pleased to announce that he has accepted. Congratulations and welcome, Fokko! Br, Zoltan

Re: [DISCUSS] Prepare release for parquet-format 2.7.0?

2019-06-24 Thread Zoltan Ivanfi
Hi, Actually there is a repo at https://github.com/apache/parquet-testing that may be used for making sure that the Java, C++ and other implementations are interoperable. But in the context of a parquet-format release I don't think we need tests for the interoperability of implementations,

Re: Floating point data compression for Apache Parquet

2019-06-13 Thread Zoltan Ivanfi
Hi Martin, Thanks for your interest in improving Parquet. Efficient encodings are really important in a big data file format, so this topic is definitely worth researching and personally I am looking forward to your report. Whether to add any new encodings to Parquet, however, can not be answered

[jira] [Resolved] (PARQUET-1588) Bump Apache Thrift to 0.12.0 in parquet-format

2019-06-12 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1588. Resolution: Fixed > Bump Apache Thrift to 0.12.0 in parquet-for

[jira] [Updated] (PARQUET-1588) Bump Apache Thrift to 0.12.0 in parquet-format

2019-06-12 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1588: --- Fix Version/s: format-2.7.0 > Bump Apache Thrift to 0.12.0 in parquet-for

[jira] [Commented] (PARQUET-1588) Bump Apache Thrift to 0.12.0 in parquet-format

2019-06-12 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16861932#comment-16861932 ] Zoltan Ivanfi commented on PARQUET-1588: It already existed, just not as "2.7.0" but

[jira] [Reopened] (PARQUET-1588) Bump Apache Thrift to 0.12.0

2019-06-12 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi reopened PARQUET-1588: As we discussed, let's stick to your original approach of separate JIRA-s for parquet-mr

[jira] [Updated] (PARQUET-1588) Bump Apache Thrift to 0.12.0 in parquet-format

2019-06-12 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1588: --- Summary: Bump Apache Thrift to 0.12.0 in parquet-format (was: Bump Apache Thrift

[jira] [Updated] (PARQUET-1590) [parquet-format] Add Java 11 to Travis

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1590: --- Summary: [parquet-format] Add Java 11 to Travis (was: Build against Java 11) > [parq

[jira] [Reopened] (PARQUET-1590) Build against Java 11

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi reopened PARQUET-1590: > Build against Java 11 > - > > Key:

[jira] [Updated] (PARQUET-1499) [parquet-mr] Add Java 11 to Travis

2019-06-11 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1499: --- Summary: [parquet-mr] Add Java 11 to Travis (was: Add Java 11 build to the repository

Re: Add support for Java 11

2019-06-11 Thread Zoltan Ivanfi
s://issues.apache.org/jira/browse/PARQUET-1496>. Would be great to fix > all the remaining issues that are blocking for parquet-format, do a release > and then resolve the remaining issues of parquet-format. I'm happy to work > on this as well. > > Cheers, Fokko > > Op d

Re: Add support for Java 11

2019-06-11 Thread Zoltan Ivanfi
Hi Fokko, Have you seen https://issues.apache.org/jira/browse/PARQUET-1551 and its children? There are some more blocking issues mentioned there. Br, Zoltan On Mon, Jun 10, 2019 at 9:19 PM Driesprong, Fokko wrote: > > Hi all, > > I'm working towards making Parquet compatible with Java 11.

Re: [vote] Merge bloom-filter branch to master

2019-06-11 Thread Zoltan Ivanfi
Hi, It has been merged into master but has not been released yet. In fact, I asked for a minor change before releasing it: https://github.com/apache/parquet-format/commit/54839ad5e04314c944fed8aa4bc6cf15e4a58698#r31084264 It may seem like a nit, but I think the naming of the parquet structures is

[jira] [Resolved] (PARQUET-1585) Update old external links in the code base

2019-05-24 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1585. Resolution: Fixed Fix Version/s: 1.11.0 > Update old external links in the c

[jira] [Created] (PARQUET-1585) Update old external links in the code base

2019-05-24 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1585: -- Summary: Update old external links in the code base Key: PARQUET-1585 URL: https://issues.apache.org/jira/browse/PARQUET-1585 Project: Parquet Issue

Re: Writing INT96 timestamp in parquet from either avro/protobuf records

2019-05-21 Thread Zoltan Ivanfi
> I assume once it’s in Hive, similar support will propagate to the original > Presto parquet reader which uses Hive? > > Thanks. > > - > Ying > > On Fri, May 17, 2019 at 4:00 AM Zoltan Ivanfi > wrote: > > > Hi Ying, > > > > Int

Re: Writing INT96 timestamp in parquet from either avro/protobuf records

2019-05-17 Thread Zoltan Ivanfi
Hi Ying, Int64 timestamp support is already in the works for Hive, but merging it into the codebase is blocked on the release of parquet-mr 1.11.0 at this moment. Here are the JIRA-s you can track: - HIVE-21215: Read Parquet INT64 timestamp - HIVE-21216: Write Parquet INT64 timestamp There is

Re: Parquet vs. other Open Source Columnar Formats

2019-05-17 Thread Zoltan Ivanfi
Hi, Regarding available open-source columnar formats, I have also come across https://carbondata.apache.org/ but do not really know anything about it other than it exists. Br, Zoltan On Thu, May 16, 2019 at 11:27 PM Wes McKinney wrote: > hi Brian, > > Anecdotal evidence suggests that Parquet

[jira] [Created] (PARQUET-1572) Clarify the definition of timestamp types

2019-05-09 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1572: -- Summary: Clarify the definition of timestamp types Key: PARQUET-1572 URL: https://issues.apache.org/jira/browse/PARQUET-1572 Project: Parquet Issue Type

[jira] [Comment Edited] (PARQUET-1496) [Java] Update Scala for JDK 11 compatibility

2019-05-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16801910#comment-16801910 ] Zoltan Ivanfi edited comment on PARQUET-1496 at 5/3/19 3:01 PM

Re: Parquet Sync - Meeting notes

2019-05-02 Thread Zoltan Ivanfi
Hi, I would like to add the following to the notes for topic "1. key signing": - Zoltan brought up the question of whether and how PMC-s from the US could remotely sign the keys of committers/PMC-s located in Europe. - Julien and Ryan commented that for the purpose of signing releases it is not

Key signing (was: [VOTE] Release Apache Parquet 1.11.0 RC6)

2019-04-29 Thread Zoltan Ivanfi
, Zoltan On Mon, Apr 29, 2019 at 7:52 PM Wes McKinney wrote: > > On Mon, Apr 29, 2019 at 12:48 PM Zoltan Ivanfi > wrote: > > > > Hi, > > > > An excerpt from > > https://www.apache.org/dev/release-signing#verifying-signature : "A > > signature is v

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-04-29 Thread Zoltan Ivanfi
llowing? - Our keys have been committed to the central KEYS file using our apache credentials. - We could personally confirm this in the next Parquet sync. - We could even read the key ID-s out loud if needed. Br, Zoltan On Mon, Apr 29, 2019 at 7:11 PM Zoltan Ivanfi wrote: > > Hi Wes, >

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-04-29 Thread Zoltan Ivanfi
ly > > > > > quiet. > > > > > > > > > > Thanks. > > > > > > > > > > On 4/3/19, 10:38 AM, "Andy Grove" wrote: > > > > > > > > > > CAUTION – UNVERIFIED EXTERNAL EMAIL > > > > > >

[jira] [Updated] (PARQUET-1556) Problem with Maven repo specifications in POMs of dependencies in some development environments

2019-04-04 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1556: --- Description: Running {{mvn verify}} based on the instructions in the README results

[jira] [Updated] (PARQUET-1556) Problem with Maven repo specifications in POMs of dependencies in some development environments

2019-04-04 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1556: --- Summary: Problem with Maven repo specifications in POMs of dependencies in some

[jira] [Commented] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-04 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16809841#comment-16809841 ] Zoltan Ivanfi commented on PARQUET-1556: I came to the conclusion that the only possible source

[jira] [Comment Edited] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-04 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16809841#comment-16809841 ] Zoltan Ivanfi edited comment on PARQUET-1556 at 4/4/19 1:16 PM: I came

[jira] [Updated] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1556: --- Description: Running {{mvn verify}} based on the instructions in the README results

[jira] [Commented] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16808877#comment-16808877 ] Zoltan Ivanfi commented on PARQUET-1556: That's a very good point, thanks for raising it. We

[jira] [Updated] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1556: --- Description: Running mvn verify based on the instructions in the README results

[jira] [Updated] (PARQUET-1556) Add twitter maven repo to POM for hadoop-lzo dependency

2019-04-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1556: --- Summary: Add twitter maven repo to POM for hadoop-lzo dependency (was: Instructions

[jira] [Commented] (PARQUET-1556) Instructions are missing for configuring twitter maven repo for hadoop-lzo dependency

2019-04-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16808851#comment-16808851 ] Zoltan Ivanfi commented on PARQUET-1556: Now that is strange. If I issue this command: {code

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-04-03 Thread Zoltan Ivanfi
orking on setting up my m2 settings to be able to read from > there, but this is something that really needs to be documented. > > Once I figure it out, I will create a JIRA + PR to update the README. > > Thanks. > > On 4/3/19, 8:55 AM, "Zoltan Ivanfi&q

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-04-03 Thread Zoltan Ivanfi
to this jar in maven, so would appreciate some guidance. > > > > Thanks, > > > > Andy, > > > > > > On 3/21/19, 3:40 PM, "Zoltan Ivanfi" wrote: > > > > CAUTION – UNVERIFIED EXTERNAL EMAIL > > > > > > Hi Wes

[jira] [Comment Edited] (PARQUET-1496) [Java] Update Scala for JDK 11 compatibility

2019-03-27 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16803012#comment-16803012 ] Zoltan Ivanfi edited comment on PARQUET-1496 at 3/27/19 4:55 PM

[jira] [Updated] (PARQUET-1496) [Java] Update Scala for JDK 11 compatibility

2019-03-27 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1496: --- Summary: [Java] Update Scala for JDK 11 compatibility (was: [Java] Update Scala to 2.12

[jira] [Commented] (PARQUET-1496) [Java] Update Scala to 2.12

2019-03-27 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16803012#comment-16803012 ] Zoltan Ivanfi commented on PARQUET-1496: According to [https://docs.scala-lang.org/overviews

[jira] [Resolved] (PARQUET-1497) [Java] javax annotations dependency missing for Java 11

2019-03-27 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1497. Resolution: Fixed Fix Version/s: 1.11.0 > [Java] javax annotations depende

[jira] [Comment Edited] (PARQUET-1496) [Java] Update Scala to 2.12

2019-03-26 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16801910#comment-16801910 ] Zoltan Ivanfi edited comment on PARQUET-1496 at 3/26/19 4:35 PM

[jira] [Commented] (PARQUET-1496) [Java] Update Scala to 2.12

2019-03-26 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16801910#comment-16801910 ] Zoltan Ivanfi commented on PARQUET-1496: There seems to be an unresolvable circular

[jira] [Updated] (PARQUET-1497) [Java] javax annotations dependency missing for Java 11

2019-03-26 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1497: --- Summary: [Java] javax annotations dependency missing for Java 11 (was: [Java] Building

[jira] [Created] (PARQUET-1551) Support Java 11 - top-level JIRA

2019-03-26 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1551: -- Summary: Support Java 11 - top-level JIRA Key: PARQUET-1551 URL: https://issues.apache.org/jira/browse/PARQUET-1551 Project: Parquet Issue Type: Task

[jira] [Updated] (PARQUET-1550) CleanUtil does not work in Java 11

2019-03-26 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1550: --- Issue Type: Bug (was: Task) > CleanUtil does not work in Java

[jira] [Created] (PARQUET-1550) CleanUtil does not work in Java 11

2019-03-26 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1550: -- Summary: CleanUtil does not work in Java 11 Key: PARQUET-1550 URL: https://issues.apache.org/jira/browse/PARQUET-1550 Project: Parquet Issue Type: Task

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-03-21 Thread Zoltan Ivanfi
PS: Oh, and Java 11 is not supported, only Java 8. You also need to have mvn installed. Zoltan On Thu, Mar 21, 2019 at 10:40 PM Zoltan Ivanfi wrote: > Hi Wes, > > Here is a list of steps (the first part is probably the same as for > parquet-cpp): > > 1. Download the

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-03-21 Thread Zoltan Ivanfi
the end (followed by a few extra lines). Br, Zoltan On Thu, Mar 21, 2019 at 7:58 PM Wes McKinney wrote: > Are there any instructions written down about how to verify this release? > > On Wed, Mar 20, 2019 at 8:50 AM Zoltan Ivanfi > wrote: > > > > +1 (binding) &

Re: [VOTE] Release Apache Parquet 1.11.0 RC6

2019-03-20 Thread Zoltan Ivanfi
+1 (binding) signature matches git hash matches the git tag source tarball matches the git tag unit tests and integration tests pass On Tue, Mar 19, 2019 at 3:00 PM Gabor Szadovszky wrote: > Dear Parquet Users and Developers, > > I propose the following RC to be released as the official Apache

[jira] [Resolved] (PARQUET-1543) Execute the TIMESTAMP types roadmap

2019-02-28 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1543. Resolution: Not A Problem Accidentally opened JIRA for the wrong project. > Exec

[jira] [Created] (PARQUET-1543) Execute the TIMESTAMP types roadmap

2019-02-28 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1543: -- Summary: Execute the TIMESTAMP types roadmap Key: PARQUET-1543 URL: https://issues.apache.org/jira/browse/PARQUET-1543 Project: Parquet Issue Type: Task

[jira] [Updated] (PARQUET-1533) TestSnappy() throws OOM exception with Parquet-1485 change

2019-02-19 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1533: --- Description: Parquet-1485 initialize the buffer size(inputBuffer and outputBuffer) from 0

Re: [VOTE] Release Apache Parquet 1.11.0 RC3

2019-02-12 Thread Zoltan Ivanfi
PR to document how to install thrift > > on > > > > OSX: https://github.com/apache/parquet-mr/pull/595 > > > > > > > > Cheers > > > > Uwe > > > > > > > > On Fri, Jan 18, 2019, at 1:55 PM, Anna Szonyi wrote

Re: [VOTE] Release Apache Parquet 1.11.0 RC3

2019-01-17 Thread Zoltan Ivanfi
Hi, Friendly reminder to please vote for the release. We need 2 more binding +1 votes. Thanks, Zoltan On Sat, Jan 12, 2019 at 3:07 AM 俊杰陈 wrote: > +1 (non-binding) > * contents looks good > * unit tests passed > > > Zoltan Ivanfi 于2019年1月11日周五 下午9:31写道:

Adding more timestamp types to on-disk storage formats

2019-01-17 Thread Zoltan Ivanfi
Hi, There is an ongoing effort amongst the SQL engines of the Hadoop stack to support different timestamp semantics. This development has some implications for the low-level timestamp types as well. The new timestamp types added to the different SQL engines will rely on the decisions of the lower

Re: [Discussion] How to build bloom filter in parquet

2019-01-17 Thread Zoltan Ivanfi
Hi, I like the idea of specifying the maximum acceptable size of the bloom filter bit vector. I think it would be much better than specifying the expected number of distinct values (which we can not expect from the API consumer in my opinion). The desired false positives probability could still

[jira] [Resolved] (PARQUET-1491) Conditional debug logging in InternalParquetRecordReader to reduce GC

2019-01-14 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1491. Resolution: Not A Problem > Conditional debug logging in InternalParquetRecordRea

Re: [VOTE] Release Apache Parquet 1.11.0 RC3

2019-01-11 Thread Zoltan Ivanfi
> > +1 (non-binding) > > Cheers, > Gabor > > On Wed, Jan 9, 2019 at 4:51 PM Zoltan Ivanfi > wrote: > > > Dear Parquet Users and Developers, > > > > I propose the following RC to be released as the official Apache > > Parquet 1.11.0 release: &g

Re: Deploy parquet-format snapshot to maven repo

2019-01-10 Thread Zoltan Ivanfi
ne' and' mvn install' to > before_install section in .travis.yml the build move forward to build. Also > I agree with you about never depend on SNAPSHOT jar. > > > Zoltan Ivanfi 于2019年1月7日周一 下午10:38写道: >> >> Hi Junjie, >> >> There seems to be some pr

[jira] [Updated] (PARQUET-1490) Add branch-specific Travis steps

2019-01-09 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1490: --- Description: The script for the main branch has to make sure that POM files in the master

[jira] [Created] (PARQUET-1490) Add branch-specific Travis steps

2019-01-09 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1490: -- Summary: Add branch-specific Travis steps Key: PARQUET-1490 URL: https://issues.apache.org/jira/browse/PARQUET-1490 Project: Parquet Issue Type

[VOTE] Release Apache Parquet 1.11.0 RC3

2019-01-09 Thread Zoltan Ivanfi
Dear Parquet Users and Developers, I propose the following RC to be released as the official Apache Parquet 1.11.0 release: The commit id is 8be767d12cca295cf9858a521725fc440b0c6f93 * This corresponds to the tag: apache-parquet-1.11.0 *

[jira] [Resolved] (PARQUET-1461) Third party code does not compile after parquet-mr minor version update

2019-01-09 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi resolved PARQUET-1461. Resolution: Fixed > Third party code does not compile after parquet-mr minor vers

[jira] [Created] (PARQUET-1487) Do not write original type for timezone-agnostic timestamps

2019-01-07 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1487: -- Summary: Do not write original type for timezone-agnostic timestamps Key: PARQUET-1487 URL: https://issues.apache.org/jira/browse/PARQUET-1487 Project: Parquet

Re: [VOTE] Release Apache Parquet 1.11.0 RC2

2019-01-03 Thread Zoltan Ivanfi
able to vote, but probably not until after the holidays. I > > presume many others got very busy with end of year and are in the same > > boat. > > > > - Wes > > > > On Mon, Dec 17, 2018 at 6:07 AM Zoltan Ivanfi > > wrote: > > > > > > Hi, > &g

[jira] [Updated] (PARQUET-1478) Can't read spec compliant, 3-level lists via parquet-proto

2019-01-03 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1478: --- Affects Version/s: 1.11.0 > Can't read spec compliant, 3-level lists via parquet-pr

  1   2   3   4   >