[
https://issues.apache.org/jira/browse/FLINK-17074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17264243#comment-17264243
]
Theo Diefenthal commented on FLINK-17074:
-----------------------------------------
What would now be the preferred approach to select multiple fields from a POJO
as key? Say I have a POJO with 10 fields and I want to use three of them (with
different types) as key. With the old API, I just specified the three field
names. Now I need to build up a KeySelector, create a tuple and put my fields
in there and build up proper field types deduced from the POJO field types
myself? Is there a utility method somewhere?
> Deprecate DataStream.keyBy() that use tuple/expression keys
> -----------------------------------------------------------
>
> Key: FLINK-17074
> URL: https://issues.apache.org/jira/browse/FLINK-17074
> Project: Flink
> Issue Type: Improvement
> Components: API / DataStream, API / Scala
> Reporter: Aljoscha Krettek
> Assignee: Etienne Chauchot
> Priority: Major
> Labels: pull-request-available, starter
> Fix For: 1.11.0
>
>
> Currently you can either specify a {{KeySelector}}, tuple positions, and
> expression keys? I think {{KeySelectors}} are strictly superior and with
> lambdas (or function references) quite easy to use. Tuple/expression keys
> use reflection underneath to do the field accesses, so performance is
> strictly worse. Also, when using a {{KeySelector}} you will have a meaningful
> key type {{KEY}} in your operations while for tuple/expression keys the key
> type is simply {{Tuple}}.
> Tuple/expression keys were introduced before Java got support for lambdas in
> Java 8 and before we added the Table API. Nowadays, using a lambda is little
> more typing than using an expression key but is (possibly) faster and more
> type safe. The Table API should be used for these more
> expression-based/relational use cases.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)