[jira] [Commented] (CALCITE-5894) Add SortRemoveRedundantRule to remove redundant sort fields if they are functionally dependent by other sort fields

LakeShen (Jira) Tue, 08 Aug 2023 21:44:04 -0700


    [ 
https://issues.apache.org/jira/browse/CALCITE-5894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17752258#comment-17752258
 ]


LakeShen commented on CALCITE-5894:
-----------------------------------

FunctionalDependency is a very important optimization for an optimizer, and I 
think it would be an exciting feature if calcite could support functional 
dependencies. 

Functional dependency can be used in many aspects of optimization. Including 
order by, aggregate,selectivity estimation and so on.

We could create a umbrella jira to track FunctionalDependency feature,at the 
same time, under this umbrella jira, create multiple small task jiras, so that 
others can also contribute to the development.I would like to participate in 
the development of this Feature.

[~julianhyde] [~libenchao] [~jingda]  WDYT?

> Add SortRemoveRedundantRule to remove redundant sort fields if they are 
> functionally dependent by other sort fields
> -------------------------------------------------------------------------------------------------------------------
>
>                 Key: CALCITE-5894
>                 URL: https://issues.apache.org/jira/browse/CALCITE-5894
>             Project: Calcite
>          Issue Type: New Feature
>            Reporter: JingDas
>            Assignee: JingDas
>            Priority: Minor
>
> In some scene, Sort fields can be reduct, if sort fields contain unique key
> For example
> {code:java}
> SELECT ename, salary FROM Emp
> order by empno, ename{code}
> where `empno` is a key,  `ename` is redundant since `empno` alone is 
> sufficient to determine the order of any two records.
> So the SQL can be optimized as following:
> {code:java}
> SELECT name, Emp.salary FROM Emp
> order by empno{code}
> For another example:
> {code:java}
> SELECT e_agg.c, e_agg.ename
> FROM
> (SELECT count(*) as c, ename, job FROM Emp GROUP BY ename, job) AS e_agg
> ORDER BY e_agg.ename, e_agg.c {code}
> Although `e_agg.ename` is not a key but field `ename` is unique and not null, 
> it can be optimized as following:
> {code:java}
> SELECT e_agg.c, e_agg.ename
> FROM (SELECT count(*) as c, ename, job FROM Emp GROUP BY ename, job) AS e_agg
> ORDER BY e_agg.ename{code}
> Sorting is an expensive operation, however. Therefore, it is imperative that 
> sorting
> is optimized to avoid unnecessary sort field.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (CALCITE-5894) Add SortRemoveRedundantRule to remove redundant sort fields if they are functionally dependent by other sort fields

Reply via email to