[
https://issues.apache.org/jira/browse/PARQUET-1178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17035421#comment-17035421
]
Gidon Gershinsky commented on PARQUET-1178:
-------------------------------------------
[~jasbru] Currently, this can't be run on a Databricks cluster - besides the
Thrift structures in parquet-format-2.7.0, it will also require a Java
implementation of parquet encryption and key management libraries (not merged
yet, but we're working on this).
> Parquet modular encryption
> --------------------------
>
> Key: PARQUET-1178
> URL: https://issues.apache.org/jira/browse/PARQUET-1178
> Project: Parquet
> Issue Type: New Feature
> Reporter: Gidon Gershinsky
> Assignee: Gidon Gershinsky
> Priority: Major
>
> A mechanism for modular encryption and decryption of Parquet files. Allows to
> keep data fully encrypted in the storage - while enabling efficient analytics
> on the data, via reader-side extraction / authentication / decryption of data
> subsets required by columnar projection and predicate push-down.
> Enables fine-grained access control to column data by encrypting different
> columns with different keys.
> Supports a number of encryption algorithms, to account for different security
> and performance requirements.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)