Here <https://docs.google.com/document/d/1Q-d98Os_aJahUynznPrWvXwWQeN0aFDRhZj3hXt_JOM> is the link for the Cell-Level encryption pre-design. Feel free to share the feedback in the file directly by adding comments.
On Wed, Jan 26, 2022 at 9:51 AM Xinli shang <[email protected]> wrote: > 1/26/2022 > > Attendees: Xinli Shang, Gidon Gershinsky, Pavi Subenderan, Jason Zhang > > 1. > > Data masking > 1. > > Pavi: Will create a PR by next week > 2. > > PARQUET-2062 <https://issues.apache.org/jira/browse/PARQUET-2062> > 3. > > Will have a high-level design sent out soon > 2. > > Cell level encryption > 1. > > Xinli: Will send out the draft design soon > 2. > > Key questions: Should we have the same key for all the cells in the > same column? It could generate millions of keys if we do it. > 3. > > There are two options explored: 1)Use FPE to encrypt in place, 2) > add extra columns to utilize existing modular encryption. Will have > them in > the design. > 3. > > Release of 1.13.0 > 1. > > Data masking(null) > 1. > > PARQUET-2062 <https://issues.apache.org/jira/browse/PARQUET-2062> > will be done in a few weeks. > 2. > > ID resolution instead of name > 1. > > PARQUET-2006 <https://issues.apache.org/jira/browse/PARQUET-2062>, > need to see if it needs specification change and the scope of the > change > and ETA. We will decide should we include it in 1.13.0. > > > > Xinli Shang > Apache Parquet PMC Chair > Teach Lead Manager at Uber Data Infra > > > -- Xinli Shang
