[ 
https://issues.apache.org/jira/browse/CALCITE-4683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated CALCITE-4683:
------------------------------------
    Labels: pull-request-available  (was: )

> In-list to join with new project generated cause replaced root leaves 
> property to be missing
> --------------------------------------------------------------------------------------------
>
>                 Key: CALCITE-4683
>                 URL: https://issues.apache.org/jira/browse/CALCITE-4683
>             Project: Calcite
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.26.0, 1.27.0
>         Environment: jdk8
>  
>            Reporter: yanjing.wang
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When Calcite converts a in list to a join, the original relational algebra 
> root may be replaced by a new project, for example
>  
> {code:java}
> SELECT * FROM (SELECT '20210101' AS dt, deptno, max(cast(deptno2 as
> varchar(200))) as m FROM (SELECT emp.deptno as deptno, dept.deptno as deptno2 
> FROM emp
> JOIN dept on emp.deptno = dept.deptno) tmp GROUP BY deptno) WHERE cast(deptno 
> as
> varchar) in ('1', '3', '5')
> {code}
> When we set InSubQueryThreshold value to 2, then the relational algebra tree 
> will be converted from 
> {code:java}
> LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1])
>   LogicalAggregate(group=[{0}], M=[MAX($1)])
>     LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
>       LogicalJoin(condition=[=($7, $9)], joinType=[inner])
>         LogicalTableScan(table=[[CATALOG, SALES, EMP]])
>         LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
> {code}
> to
> {code:java}
> LogicalJoin(condition=[=($3, $4)], joinType=[inner])
>   LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1], 
> DEPTNO0=[CAST($0):VARCHAR NOT NULL])
>     LogicalAggregate(group=[{0}], M=[MAX($1)])
>       LogicalProject(DEPTNO=[$7], $f1=[CAST($9):VARCHAR(200) NOT NULL])
>         LogicalJoin(condition=[=($7, $9)], joinType=[inner])
>           LogicalTableScan(table=[[CATALOG, SALES, EMP]])
>           LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
>   LogicalAggregate(group=[{0}])
>     LogicalValues(tuples=[[{ '1' }, { '3' }, { '5' }]])
> {code}
> We can see that LogicalProject(DT=['20210101'], DEPTNO=[$0], M=[$1]) with 
> leaves being true has been replaced by LogicalProject(DT=['20210101'], 
> DEPTNO=[$0], M=[$1], DEPTNO0=[CAST($0):VARCHAR NOT NULL]) with leaves being 
> false. 
> Finally, the query results java.lang.AssertionError: Conversion to relational 
> algebra failed to preserve datatypes. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to