[
https://issues.apache.org/jira/browse/FLINK-6428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063134#comment-16063134
]
ASF GitHub Bot commented on FLINK-6428:
---------------------------------------
Github user sunjincheng121 commented on the issue:
https://github.com/apache/flink/pull/3817
Sure.
> Add support DISTINCT in dataStream SQL
> --------------------------------------
>
> Key: FLINK-6428
> URL: https://issues.apache.org/jira/browse/FLINK-6428
> Project: Flink
> Issue Type: New Feature
> Components: Table API & SQL
> Reporter: sunjincheng
> Assignee: sunjincheng
>
> Add support DISTINCT in dataStream SQL as follow:
> DATA:
> {code}
> (name, age)
> (kevin, 28),
> (sunny, 6),
> (jack, 6)
> {code}
> SQL:
> {code}
> SELECT DISTINCT age FROM MyTable"
> {code}
> RESULTS:
> {code}
> 28, 6
> {code}
> To DataStream:
> {code}
> inputDS
> .keyBy() // KeyBy on all fields
> .flatMap() // Eliminate duplicate data
> {code}
> [~fhueske] do we need this feature?
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)