Quanlong Huang created HIVE-22823:
-------------------------------------

             Summary: Support column masking policy on nested columns
                 Key: HIVE-22823
                 URL: https://issues.apache.org/jira/browse/HIVE-22823
             Project: Hive
          Issue Type: New Feature
            Reporter: Quanlong Huang


Hive currently ignores column masking policies on nested columns. Only masking 
policies on top level columns are recognized.

For example, create a table with nested columns as:
{code:sql}
CREATE TABLE customers(
  id int,   name string, 
  email_preferences 
struct<email_format:string,frequency:string,categories:struct<promos:boolean,surveys:boolean>>,
 
 addresses 
map<string,struct<street_1:string,street_2:string,city:string,state:string,zip_code:string>>,
 
  orders 
array<struct<order_id:string,order_date:string,items:array<struct<product_id:int,sku:string,name:string,price:double,qty:int>>>>
) stored as ORC;
{code}

In Ranger, create a column masking policy on a nested column 
"email_preferences.email_format" to nullify the values. The policy won't be 
applied in queries  on this table, neither query like "select 
email_preferences.email_format from customers" nor "select email_preferences 
from customers".

It's good to support column masking policies on nested columns.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to