[jira] [Commented] (CALCITE-5894) Add SortRemoveRedundantRule to remove redundant sort fields if they are functionally dependent by other sort fields

JingDas (Jira) Tue, 08 Aug 2023 20:10:05 -0700


    [ 
https://issues.apache.org/jira/browse/CALCITE-5894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17752235#comment-17752235
 ]


JingDas commented on CALCITE-5894:
----------------------------------

As said in the book [《Exploiting Functional Dependence in Query 
Optimization》|[https://cs.uwaterloo.ca/research/tr/2000/11/CS-2000-11.thesis.pdf]|https://cs.uwaterloo.ca/research/tr/2000/11/CS-2000-11.thesis.pdf],]

Functional dependency analysis is a valuable and challenging work. I think 
support order optimization
(in particular sort avoidance) by the relevant functional dependency metadata 
firstly, and the get complete functional dependency function step by step.

What do you think?

> Add SortRemoveRedundantRule to remove redundant sort fields if they are 
> functionally dependent by other sort fields
> -------------------------------------------------------------------------------------------------------------------
>
>                 Key: CALCITE-5894
>                 URL: https://issues.apache.org/jira/browse/CALCITE-5894
>             Project: Calcite
>          Issue Type: New Feature
>            Reporter: JingDas
>            Assignee: JingDas
>            Priority: Minor
>
> In some scene, Sort fields can be reduct, if sort fields contain unique key
> For example
> {code:java}
> SELECT ename, salary FROM Emp
> order by empno, ename{code}
> where `empno` is a key,  `ename` is redundant since `empno` alone is 
> sufficient to determine the order of any two records.
> So the SQL can be optimized as following:
> {code:java}
> SELECT name, Emp.salary FROM Emp
> order by empno{code}
> For another example:
> {code:java}
> SELECT e_agg.c, e_agg.ename
> FROM
> (SELECT count(*) as c, ename, job FROM Emp GROUP BY ename, job) AS e_agg
> ORDER BY e_agg.ename, e_agg.c {code}
> Although `e_agg.ename` is not a key but field `ename` is unique and not null, 
> it can be optimized as following:
> {code:java}
> SELECT e_agg.c, e_agg.ename
> FROM (SELECT count(*) as c, ename, job FROM Emp GROUP BY ename, job) AS e_agg
> ORDER BY e_agg.ename{code}
> Sorting is an expensive operation, however. Therefore, it is imperative that 
> sorting
> is optimized to avoid unnecessary sort field.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (CALCITE-5894) Add SortRemoveRedundantRule to remove redundant sort fields if they are functionally dependent by other sort fields

Reply via email to