Parquet sync notes for 2018-12-18

2018-12-21 Thread Ryan Blue
Hi everyone, Here are the notes from Tuesday’s sync. Sorry they’re a bit late! *Attendees and topics*: - Zoltan Ivanfi (Cloudera) - New release candidate - Anna Szonyi (Cloudera) - New release candidate - Gabor Szadovszky (Cloudera) - New release candidate - Gidon Gershinsky (IBM) -

parquet-sync notes December 5 2018

2018-12-05 Thread Julien Le Dem
Deepak: encryption, column statistics Zoltan: vote on the release Nandor: Ryan (netflix): release candidate, validation of the release, encryption Gidon (IBM): update encryption Lars (Cloudera Impala): Qinghui (Criteo): PR in parquet-proto, next release. Replace current proto compiler:

Re: parquet sync notes

2018-10-15 Thread Aniket Mokashi
I would like to attend the next sync. Where do I find instructions to join this meeting? On Tue, Oct 9, 2018 at 10:13 AM Julien Le Dem wrote: > Gabor (Cloudera): column index, benchmark, nested types (filter, indexes) > Anna (Cloudera): process, feature branches, etiquette of waiting for >

parquet sync notes

2018-10-09 Thread Julien Le Dem
Gabor (Cloudera): column index, benchmark, nested types (filter, indexes) Anna (Cloudera): process, feature branches, etiquette of waiting for someone? Blocked Zoltan (Cloudera): Feature branches? When to review them? Nandor (Cloudera) parquet file with multiple row groups, schema evolution

Re: parquet sync notes

2018-09-27 Thread Zoltan Ivanfi
Hi, I have created the feature branches: - https://github.com/apache/parquet-mr/tree/bloom-filter - https://github.com/apache/parquet-format/tree/bloom-filter - https://github.com/apache/parquet-mr/tree/encryption - https://github.com/apache/parquet-format/tree/encryption I have also

Re: parquet sync notes

2018-09-26 Thread 俊杰陈
Hi Zoltan PR #62 contains some rebase info which is not relate to change itself so I created PR#99. Actually it only contains one file change now, I will add another document file later. Zoltan Ivanfi 于2018年9月26日周三 下午3:19写道: > Hi, > > It seems to me that PR #99 does not supersede PR #62, as

Re: parquet sync notes

2018-09-26 Thread Zoltan Ivanfi
Hi, It seems to me that PR #99 does not supersede PR #62, as the latter affects 16 files but the former only modifies a single one. Or has the rest of the changes been already merged to the codebase from another PR? I checked the history and I don't see anything related. Thanks, Zoltan On Wed,

Re: parquet sync notes

2018-09-25 Thread 俊杰陈
Hi the pr28 and pr62 of parquet-format was closed. Will we create a feature branch for bloom filter on parquet-mr as well? Julien Le Dem 于2018年9月26日周三 上午12:48写道: > Lars (Cloudera Impala): listen in. > Zoltan, Gabor and Nandor (Cloudera): > >- feature branch reviewed and merged >-

parquet sync notes

2018-09-25 Thread Julien Le Dem
Lars (Cloudera Impala): listen in. Zoltan, Gabor and Nandor (Cloudera): - feature branch reviewed and merged - Parquet-format release - - Define scope Ryan (Netflix) Junjie (tencent): bloom filter Jim Apple (cloud service): bloom filter in parquet-mr? Since they got in parquet-cpp

Re: Parquet sync notes

2018-06-12 Thread Gidon Gershinsky
There are four PRs, each dependent on its predecessor. Please review in this order: 1) #94 in parquet-format: Thrift additions (crypto structures) 2) #95 in parquet-format: encryption/decryption of footer, headers and column metadata - via cipher interfaces 3) #471 in parquet-mr: crypto

Parquet sync notes

2018-06-12 Thread Julien Le Dem
QingHui (Criteo): parquet-protobuf Lars (impala), Jim (Cloudera): Bloom filter benchmarks Ryan (Netflix): JunJie (Intel): Bloomfilter and dictionary comparison benchmarks Gidon (IBM): Encryption, feedback Xinli Shang (Uber): Encryption Bloomfilter and dictionary comparison benchmarks: -

Parquet sync notes

2018-06-07 Thread Julien Le Dem
Attendees / Agenda: Gidon (IBM): Parquet encryption. Uber, Vertica, Amazon Anna, Gabor, Nandor (Cloudera): Review for column indexing Junjie (tencent): Bloom filter Lars (Cloudera impala) Jim (Cloudera): Bloom filter Deepak (Vertica): Encryption Qinghui, Benoit (Criteo): parquet protobuf. Parquet