[
https://issues.apache.org/jira/browse/FLINK-12173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17906203#comment-17906203
]
Jim Hughes commented on FLINK-12173:
------------------------------------
Hi [~lincoln.86xy], we have not done any benchmarking yet. I took a quick look
at Nexmark, and I do not believe there are queries of this form.
Is using that test harness the easiest/fastest way to benchmark things?
> Optimize "SELECT DISTINCT" into Deduplicate with keep first row
> ---------------------------------------------------------------
>
> Key: FLINK-12173
> URL: https://issues.apache.org/jira/browse/FLINK-12173
> Project: Flink
> Issue Type: Improvement
> Components: Table SQL / Planner
> Reporter: Jark Wu
> Assignee: Yiyu Tian
> Priority: Major
> Labels: pull-request-available
>
> The following distinct query can be optimized into deduplicate on keys "a, b,
> c, d" and keep the first row.
> {code:sql}
> SELECT DISTINCT a, b, c, d;
> {code}
> We can optimize this query into Deduplicate to get a better performance than
> GroupAggregate.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)