[
https://issues.apache.org/jira/browse/HIVE-18023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Aihua Xu updated HIVE-18023:
----------------------------
Resolution: Fixed
Fix Version/s: 3.0.0
Status: Resolved (was: Patch Available)
Pushed to master. Thanks Yongzhi reviewing the code.
> Redact the expression in lineage info
> -------------------------------------
>
> Key: HIVE-18023
> URL: https://issues.apache.org/jira/browse/HIVE-18023
> Project: Hive
> Issue Type: Improvement
> Components: Logging
> Affects Versions: 2.1.0
> Reporter: Aihua Xu
> Assignee: Aihua Xu
> Priority: Trivial
> Fix For: 3.0.0
>
> Attachments: HIVE-18023.1.patch
>
>
> The query redactor is redacting the query itself while the expression shown
> in lineage info is not, which may still expose sensitive info. The following
> query
> {{select customers.id, customers.name from customers where
> customers.addresses['shipping'].zip_code ='1234-5678-1234-5678';}} will have
> a log entry in lineage. The expression should also be redacted.
> {noformat}
> [HiveServer2-Background-Pool: Thread-43]:
> {"version":"1.0","user":"hive","timestamp":1510179280,"duration":40747,"jobIds":["job_1510150684172_0006"],"engine":"mr","database":"default","hash":"a2b4721a0935e3770d81649d24ab1cd4","queryText":"select
> customers.id, customers.name from customers where
> customers.addresses['shipping'].zip_code
> ='XXXX-XXXX-XXXX-XXXX'","edges":[{"sources":[2],"targets":[0],"edgeType":"PROJECTION"},{"sources":[3],"targets":[1],"edgeType":"PROJECTION"},{"sources":[],"targets":[0,1],"expression":"(addresses['shipping'].zip_code
> =
> '1234-5678-1234-5678')","edgeType":"PREDICATE"}],"vertices":[{"id":0,"vertexType":"COLUMN","vertexId":"customers.id"},{"id":1,"vertexType":"COLUMN","vertexId":"customers.name"},{"id":2,"vertexType":"COLUMN","vertexId":"default.customers.id"},{"id":3,"vertexType":"COLUMN","vertexId":"default.customers.name"}]}
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)