[
https://issues.apache.org/jira/browse/PARQUET-1659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16935610#comment-16935610
]
Gidon Gershinsky commented on PARQUET-1659:
-------------------------------------------
Sounds good, we are basically in sync on this. Lets understand the technical
details behind the observed difference. For each encrypted module type (pages,
page headers, columnmetadata, etc) could you find the number of modules and the
total size of each type in your files (by simply adding a debug code in the
right places).
> Add AES-CTR to Parquet Encryption
> ----------------------------------
>
> Key: PARQUET-1659
> URL: https://issues.apache.org/jira/browse/PARQUET-1659
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-cpp, parquet-format, parquet-mr
> Affects Versions: format-2.6.0
> Reporter: Xinli Shang
> Priority: Minor
> Labels: pull-request-available
>
> AES-GCM-CTR perform GCM encryption on metadata and CTR encryption on data.
> AES-CTR would perform CTR encryption on both.
> During Perf testing, we found AES-CTR can improve read/write performance by
> ~10% comparing with AES-GCM-CTR.
>
> I checked with Gidon and the initial assumption was that AES-GCM-CTR would
> have similar performance as AES-CTR. But with recent performance
> benchmarking, we found it is worthy to introduce AES-CTR. Since many
> companies strive for parquet performance improvement.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)