[ https://issues.apache.org/jira/browse/PARQUET-2223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jiashen Zhang updated PARQUET-2223: ----------------------------------- Description: h1. Background h2. What is Data Masking? Data masking is the process of obfuscating sensitive data. Instead of revealing PII data, masking allows us to return NULLs, hashes or redacted data in its place. With data masking, users who are in the correct permission groups can retrieve the original data and users without permissions will receive masked data. h2. Why do we need it? * Fined-Grained Access Control h2. Why do we want to enhance data masking? Users might not have all permissions for all columns, existing code doesn’t have support for us to skip columns that users don’t have permissions to access. This enhancement will add this support so that users can decide to skip some columns to avoid decryption error. h1. Design Requirements # Users can skip some columns with a configuration h1. Proposed solution Key idea is to modify the request schema by removing skipped columns from the schema. > Parquet Data Masking for Column Encryption > ------------------------------------------ > > Key: PARQUET-2223 > URL: https://issues.apache.org/jira/browse/PARQUET-2223 > Project: Parquet > Issue Type: Task > Reporter: Jiashen Zhang > Priority: Minor > > h1. Background > h2. What is Data Masking? > Data masking is the process of obfuscating sensitive data. Instead of > revealing PII data, masking allows us to return NULLs, hashes or redacted > data in its place. With data masking, users who are in the correct permission > groups can retrieve the original data and users without permissions will > receive masked data. > h2. Why do we need it? > * Fined-Grained Access Control > h2. Why do we want to enhance data masking? > > Users might not have all permissions for all columns, existing code doesn’t > have support for us to skip columns that users don’t have permissions to > access. This enhancement will add this support so that users can decide to > skip some columns to avoid decryption error. > h1. Design Requirements > # Users can skip some columns with a configuration > h1. Proposed solution > Key idea is to modify the request schema by removing skipped columns from the > schema. -- This message was sent by Atlassian Jira (v8.20.10#820010)