Deepak: encryption, column statistics

Zoltan: vote on the release

Nandor:

Ryan (netflix): release candidate, validation of the release, encryption

Gidon (IBM): update encryption

Lars (Cloudera Impala):

Qinghui (Criteo): PR in parquet-proto, next release.

Replace current proto compiler: maven-protoc plugin: more portable

Support Enum in protobuf in backward compatibility

Steven (Yelp)

Protobuf:

   -

   2 PRs:


   -

   More portable proto plugin
   -

   Enum support. PARQUET-1455, https://github.com/apache/parquet-mr/pull/561
   -

      Makes it consistent with protobuf behavior.


   -

   Actions:


   -

   Merge PR with the new proto plugin.
   -

   Have a committer familiar with proto (Benoit) review #561


Encryption:

   -

   Mention of the order preserving encryption. Can be used to compare
   encrypted statistics
   -

   Finalizing the spec
   -

      No more technical changes
      -

      Just clarifying the details
      -

      Goal to merge it before the end of the year.
      -

   Action:
   -

      Gidon to send an updated version in a few days
      -

      Will start a thread this week


Release candidate:

   -

   Need to vote on the release
   -

   What tests have been done?
   -

      Unit tests
      -

      Benchmark for perf
      -

      Cloudera Internal integration tests.
      -

         Some updates to that integration pipeline due to parquet changes
         (proto, shaded avro)
         -

         Hive -> hive, Hive -> impala, impala -> hive tested
         -

         Ran spark unit tests
         -

      Not run:
      -

         Spark benchmark
         -

      parquet-cpp lagging behind
      -

      Action:
      -

         Gabor, Zoltan: Produce a summary
         -

         Write a validator that verifies the contract of statistics (ex:
         all values greater than the min)

Reply via email to