[
https://issues.apache.org/jira/browse/CALCITE-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17748376#comment-17748376
]
JingDas edited comment on CALCITE-5756 at 7/28/23 3:00 AM:
-----------------------------------------------------------
This [PR|https://github.com/apache/calcite/pull/3264] contains two features,
one is
[RelMetadataQuery#getForeignKeys|https://github.com/JingDas/calcite/blob/f1c55fcde9f6a1c25529dd00ebc679e87f57a39f/core/src/main/java/org/apache/calcite/rel/metadata/RelMetadataQuery.java#L526]
which is get foreign key metadata from current relNode
the other is expanding
[ProjectJoinRemoveRule|https://github.com/JingDas/calcite/blob/f1c55fcde9f6a1c25529dd00ebc679e87f57a39f/core/src/main/java/org/apache/calcite/rel/rules/ProjectJoinRemoveRule.java#L79]
to support inner join which depends on feature one.
I notice that the code in the PR is too much which mostly is test code, should
I split the RP into two PRs
one is for feature one and the other is for feature two, or maintain the
current situation. If you hava any ideas, please let me know. Thanks.
was (Author: JIRAUSER292370):
This [PR|https://github.com/apache/calcite/pull/3264] contains two features,
one is
[RelMetadataQuery#getForeignKeys|https://github.com/JingDas/calcite/blob/f1c55fcde9f6a1c25529dd00ebc679e87f57a39f/core/src/main/java/org/apache/calcite/rel/metadata/RelMetadataQuery.java#L526]
which is get foreign key metadata from current relNode
the other is expanding ProjectJoinRemoveRule to support inner join which
depends on feature one.
I notice that the code in the PR is too much which mostly is test code, should
I split the RP into two PRs
one is for feature one and the other is for feature two, or maintain the
current situation. If you hava any ideas, please let me know. Thanks.
> Expand ProjectJoinRemoveRule to support inner join removal by using the
> foreign-unique constraints
> --------------------------------------------------------------------------------------------------
>
> Key: CALCITE-5756
> URL: https://issues.apache.org/jira/browse/CALCITE-5756
> Project: Calcite
> Issue Type: Improvement
> Components: core
> Reporter: JingDas
> Assignee: JingDas
> Priority: Major
> Labels: pull-request-available
>
> Join elimination is a useful optmize improvement.
> Consider a query that joins the two tables but does not make use of the Dept
> columns:
> {code:java}
> SELECT Emp.name, Emp.salary
> FROM Emp, Dept
> WHERE Emp.deptno = Dept.dno {code}
> Assuming Emp.deptno is the foreign-key and is non-null, Dept.dno is the
> unique-key. The sql above can be rewritten as following. remove the Dept
> table without affecting the resultset.
> {code:java}
> SELECT Emp.name, Emp.salary
> FROM Emp {code}
> Without redundant join elimination, this query execution may perform poorly.
> The optimize improvement is also available in SQL Server, Oracle and
> Snowflake and so on.
> In Calcite, i think that is also useful. The infrastructure that join
> elimination depend on is already available.
> The main steps are as follows:
> 1. Analyse the column used by project, and then split them to left and right
> side.
> 2. Acccording to the project info above and outer join type, bail out in some
> scene.
> 3. Get join info such as join keys.
> 4. For inner join check foreign and unique keys, these may use
> RelMetadataQuery#getForeignKeys(newly add, similar to
> RelMetadataQuery#getUniqueKeys),
> RelOptTable#getReferentialConstraints.
> 5. Check removing side join keys are areColumnsUnique both for outer join and
> inner join.
> 6. If all done, calculate the fianl project and transform.
> Please help me to check the improvement whether is useful or not.
> And i would like to add this improvement to Calcite.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)