[ 
https://issues.apache.org/jira/browse/CALCITE-1507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15723361#comment-15723361
 ] 

Maryann Xue commented on CALCITE-1507:
--------------------------------------

Could you please kindly review my PR for this issue 
(https://github.com/apache/calcite/pull/333)?
In order to verify the count-preserving case in {{testSortJoinTranspose6()}}, I 
added unique keys into SALES tables. As a result, AggregateFilterTransposeRule 
and AggregateJoinTransposeRule were not fired in a few tests in RelOptRuleTest, 
since they bail out for already unique group keys. Also, change of uniqueness 
will affect row count number in RelMetadataTest. So I made some adjustment to 
the affected tests by replacing unique key columns with non-unique key columns.

> OFFSET cannot be pushed through a JOIN if the non-preserved side of outer 
> join is not count-preserving
> ------------------------------------------------------------------------------------------------------
>
>                 Key: CALCITE-1507
>                 URL: https://issues.apache.org/jira/browse/CALCITE-1507
>             Project: Calcite
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.10.0
>            Reporter: Maryann Xue
>            Assignee: Maryann Xue
>            Priority: Minor
>
> If the non-preserved side of the outer join is not count-preserving, for each 
> row from the preserved side, there can be zero, one or multiple matches from 
> the non-preserved side, which means the join can produce one or multiple 
> rows. So it is safe to push a LIMIT through, but it is invalid to push an 
> OFFSET through.
> Take this query as an example:
> {code}
> select d.deptno, empno
>     from sales.dept d
>     left join sales.emp e using (deptno)
>     order by d.deptno offset 1
> {code}
> And rows from "dept" and "emp" tables are like:
> {code}
> "dept"
>   deptno
>   10
>   20
>   30
> "emp"
>   empno    deptno
>   101      10
>   102      10
>   105      30
> {code}
> The expected output is:
> {code}
> d.deptno    e.empno
> 10          102
> 20          null
> 30          105
> {code}
> While after applying SortJoinTransposeRule, the rel becomes:
> {code}
> LogicalProject(DEPTNO=[$0], EMPNO=[$2])
>   LogicalSort(sort0=[$0], dir0=[ASC], offset=[1])
>     LogicalJoin(condition=[=($0, $9)], joinType=[left])
>       LogicalSort(sort0=[$0], dir0=[ASC], offset=[1])
>         LogicalTableScan(table=[[CATALOG, SALES, DEPT]])
>       LogicalTableScan(table=[[CATALOG, SALES, EMP]])
> {code}
> And the output will now be:
> {code}
> d.deptno    e.empno
> 20          null
> 30          105
> {code}
> because deptno "10" has been skipped from the left relation by the pushed 
> through Sort node.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to