Xinli Shang created PARQUET-1792:
------------------------------------
Summary: Add 'mask' command to parquet-tools/parquet-cli
Key: PARQUET-1792
URL: https://issues.apache.org/jira/browse/PARQUET-1792
Project: Parquet
Issue Type: Improvement
Components: parquet-mr
Affects Versions: 1.12.0
Reporter: Xinli Shang
Assignee: Xinli Shang
Fix For: 1.12.0
Some personal data columns need to be masked instead of being
pruned(Parquet-1791). We need a tool to replace the raw data columns with
masked value. The masked value could be hash, null, redact etc. For the
unchanged columns, they should be moved as a whole like 'merge', 'prune'
command in Parquet-tools.
Implementing this feature in file format is 10X faster than doing it by
rewriting the table data in the query engine.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)