Fang-Yu Rao created IMPALA-9597:
-----------------------------------
Summary: Deduplicate Ranger audits when a column is referenced
multiple times
Key: IMPALA-9597
URL: https://issues.apache.org/jira/browse/IMPALA-9597
Project: IMPALA
Issue Type: Improvement
Components: Frontend
Reporter: Fang-Yu Rao
Assignee: Fang-Yu Rao
After [IMPALA-9350|https://issues.apache.org/jira/browse/IMPALA-9350], Impala
is able to produce the corresponding Ranger audits when a query involves
policies of column masking. However, duplicate audit events will be produced if
a column is referenced multiple times in a query.
For instance, since the following query would result in 2 calls to
{{SelectStmt#analyze()}} on the same table, given that there is a column
masking policy for the column of {{string_col}}, we will see 2 duplicate audit
events for this column.
{noformat}
with iv as (select id, bool_col, string_col from functional.alltypestiny)
select * from iv;
{noformat}
We should thus deduplicate the audits in the case described above.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)