[
https://issues.apache.org/jira/browse/HIVE-26524?focusedWorklogId=814019&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-814019
]
ASF GitHub Bot logged work on HIVE-26524:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 05/Oct/22 20:02
Start Date: 05/Oct/22 20:02
Worklog Time Spent: 10m
Work Description: kasakrisz commented on code in PR #3588:
URL: https://github.com/apache/hive/pull/3588#discussion_r985669610
##########
ql/src/test/results/clientpositive/llap/masking_10.q.out:
##########
@@ -137,9 +136,7 @@ STAGE PLANS:
Tez
#### A masked pattern was here ####
Edges:
- Reducer 2 <- Map 1 (CUSTOM_SIMPLE_EDGE), Reducer 4 (CUSTOM_SIMPLE_EDGE)
- Reducer 3 <- Map 1 (SIMPLE_EDGE), Reducer 2 (SIMPLE_EDGE)
- Reducer 4 <- Map 1 (SIMPLE_EDGE)
+ Reducer 2 <- Map 1 (SIMPLE_EDGE), Map 3 (SIMPLE_EDGE)
Review Comment:
This is the query after applying the masking
```
select `alias01`.`key`, `alias01`.`value`, `alias02`.`a`, `alias02`.`value`,
`alias03`.`key`, `alias03`.`value` from
(SELECT `key`, CAST(reverse(value) AS string) AS `value`,
BLOCK__OFFSET__INSIDE__FILE, INPUT__FILE__NAME, ROW__ID, ROW__IS__DELETED FROM
`default`.`masking_test` WHERE key % 2 = 0 and key < 10)`alias01`
left join
(
select 2017 as `a`, `value` from (SELECT `key`, CAST(reverse(value) AS
string) AS `value`, BLOCK__OFFSET__INSIDE__FILE, INPUT__FILE__NAME, ROW__ID,
ROW__IS__DELETED FROM `default`.`masking_test` WHERE key % 2 = 0 and key <
10)`masking_test` group by 1, 2
) `alias02`
on `alias01`.key = `alias02`.`a`
left join
(SELECT `key`, CAST(reverse(value) AS string) AS `value`,
BLOCK__OFFSET__INSIDE__FILE, INPUT__FILE__NAME, ROW__ID, ROW__IS__DELETED FROM
`default`.`masking_test` WHERE key % 2 = 0 and key < 10)`alias03`
on `alias01`.key = `alias03`.key
```
The first join has a condition: `alias01.key = alias02.a`
In the left branch there is a Filter on `key`: `key % 2 = 0 and key < 10`
In the right branch `a` is constant `2017` so the join condition is going to
be evaluated always `false` and that join is replaced by its left branch
##########
ql/src/test/results/clientpositive/llap/ppd_udf_col.q.out:
##########
@@ -80,22 +80,9 @@ STAGE DEPENDENCIES:
STAGE PLANS:
Stage: Stage-0
Fetch Operator
- limit: -1
+ limit: 0
Processor Tree:
- TableScan
- alias: src
- filterExpr: (UDFToDouble(key) = 100.0D) (type: boolean)
- Filter Operator
- predicate: (UDFToDouble(key) = 100.0D) (type: boolean)
- Limit
- Number of rows: 0
- Select Operator
- expressions: key (type: string)
- outputColumnNames: _col0
- Select Operator
- expressions: _col0 (type: string), rand() (type: double),
'4' (type: string)
- outputColumnNames: _col0, _col1, _col2
- ListSink
+ ListSink
Review Comment:
This is the empty plan
```
STAGE PLANS:
Stage: Stage-0
Fetch Operator
limit: 0
Processor Tree:
ListSink
```
Issue Time Tracking
-------------------
Worklog Id: (was: 814019)
Time Spent: 5h 40m (was: 5.5h)
> Use Calcite to remove sections of a query plan known never produces rows
> ------------------------------------------------------------------------
>
> Key: HIVE-26524
> URL: https://issues.apache.org/jira/browse/HIVE-26524
> Project: Hive
> Issue Type: Improvement
> Components: CBO
> Reporter: Krisztian Kasa
> Assignee: Krisztian Kasa
> Priority: Major
> Labels: pull-request-available
> Time Spent: 5h 40m
> Remaining Estimate: 0h
>
> Calcite has a set of rules to remove sections of a query plan known never
> produces any rows. In some cases the whole plan can be removed. Such plans
> are represented with a single {{Values}} operators with no tuples. ex.:
> {code:java}
> select y + 1 from (select a1 y, b1 z from t1 where b1 > 10) q WHERE 1=0
> {code}
> {code:java}
> HiveValues(tuples=[[]])
> {code}
> Other cases when plan has outer join or set operators some branches can be
> replaced with empty values moving forward in some cases the join/set operator
> can be removed
> {code:java}
> select a2, b2 from t2 where 1=0
> union
> select a1, b1 from t1
> {code}
> {code:java}
> HiveAggregate(group=[{0, 1}])
> HiveTableScan(table=[[default, t1]], table:alias=[t1])
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)