1/26/2022
Attendees: Xinli Shang, Gidon Gershinsky, Pavi Subenderan, Jason Zhang
1.
Data masking
1.
Pavi: Will create a PR by next week
2.
PARQUET-2062 <https://issues.apache.org/jira/browse/PARQUET-2062>
3.
Will have a high-level design sent out soon
2.
Cell level encryption
1.
Xinli: Will send out the draft design soon
2.
Key questions: Should we have the same key for all the cells in the
same column? It could generate millions of keys if we do it.
3.
There are two options explored: 1)Use FPE to encrypt in place, 2) add
extra columns to utilize existing modular encryption. Will have
them in the
design.
3.
Release of 1.13.0
1.
Data masking(null)
1.
PARQUET-2062 <https://issues.apache.org/jira/browse/PARQUET-2062>
will be done in a few weeks.
2.
ID resolution instead of name
1.
PARQUET-2006 <https://issues.apache.org/jira/browse/PARQUET-2062>,
need to see if it needs specification change and the scope of
the change
and ETA. We will decide should we include it in 1.13.0.
Xinli Shang
Apache Parquet PMC Chair
Teach Lead Manager at Uber Data Infra