[ 
https://issues.apache.org/jira/browse/IMPALA-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18081557#comment-18081557
 ] 

ASF subversion and git services commented on IMPALA-14961:
----------------------------------------------------------

Commit e1ca23d627532bb17228e3d455c55a03b3e28f49 in impala's branch 
refs/heads/master from Gokul Kolady
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=e1ca23d62 ]

IMPALA-14961: Query Profile Redaction

The query profile redaction component protects sensitive profile content
by replacing user, host, IP, SQL, table, and column identifiers with
deterministic aliases while preserving enough structure for analysis.
Table and column identifiers are targeted for redaction if they
appear in the select columns, where columns, join columns, and analyzed
query sections of the profile. The component also provides a reverse
unredaction path that restores aliases in a string back to original
identifiers using the captured alias map.

Tests added: query-profile-redaction-test.cc with golden-file coverage
for a TPC-DS profile redaction output, validation of alias-based
unredaction behavior, and negative tests ensuring non-JSON profile input
is rejected for both redaction and unredaction paths.

Code Generated-by: Cursor AI

Change-Id: If0c5b4911a64888f319f212155df6e08c1800b32
Reviewed-on: http://gerrit.cloudera.org:8080/24282
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Query Profile Redactor
> ----------------------
>
>                 Key: IMPALA-14961
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14961
>             Project: IMPALA
>          Issue Type: Task
>            Reporter: Gokul Kolady
>            Assignee: Gokul Kolady
>            Priority: Major
>
> Add a query profile redactor that ingests a JSON query profile. It should 
> redact user names, host names, IP addresses, and any table/column names that 
> appear in the select columns, where columns, join columns, and analyzed query 
> sections of the profile. We should maintain consistent aliasing in the 
> redacted version of the profile and include functionality to un-redact text 
> that includes aliases from the redaction process.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to