There are four PRs, each dependent on its predecessor. Please review in
this order:

1) #94 in parquet-format: Thrift additions (crypto structures)

2) #95 in parquet-format: encryption/decryption of footer, headers and
column metadata - via cipher interfaces

3) #471 in parquet-mr: crypto package: implementation of cipher interfaces,
AES-* algorithms, configuration and API of Parquet encryption

4) #472 in parquet-mr: utilization of crypto package in existing Parquet
classes for encryption/decryption of pages, and passing an encryptor object
to ParquetWriter/Reader.



On Tue, Jun 12, 2018 at 7:50 PM, Julien Le Dem <julien.le...@gmail.com>
wrote:

>  QingHui (Criteo): parquet-protobuf
> Lars (impala), Jim (Cloudera): Bloom filter benchmarks
> Ryan (Netflix):
> JunJie (Intel): Bloomfilter and dictionary comparison benchmarks
> Gidon (IBM): Encryption, feedback
> Xinli Shang (Uber): Encryption
>
> Bloomfilter and dictionary comparison benchmarks:
>
>    - PARQUET-41
>    - Feedback to find the number of distinct values for which bloom filter
>    outperforms filter based search
>    - Action: JunJie to share code and update benchmark
>
> Encryption:
>
>    - Progress on multi-key design: need review
>    - Need review on PR as well
>    - Discussion on how to pass parameters down to Parquet to specify what
>    to encrypt
>    - Action: Gidon to share PR again and others to review.
>

Reply via email to