[
https://issues.apache.org/jira/browse/FLINK-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Flink Jira Bot closed FLINK-1267.
---------------------------------
Resolution: Auto Closed
This issue was labeled "stale-minor" 7 ago and has not received any updates so
I have gone ahead and closed it. If you are still affected by this or would
like to raise the priority of this ticket please re-open, removing the label
"auto-closed" and raise the ticket priority accordingly.
> Add crossGroup operator
> -----------------------
>
> Key: FLINK-1267
> URL: https://issues.apache.org/jira/browse/FLINK-1267
> Project: Flink
> Issue Type: New Feature
> Components: API / DataSet, Runtime / Task
> Affects Versions: 0.7.0-incubating
> Reporter: Fabian Hueske
> Assignee: pietro pinoli
> Priority: Minor
> Labels: auto-closed, stale-assigned
>
> A common operator is to pair-wise compare or combine all elements of a group
> (there were two questions about this on the user mailing list, recently).
> Right now, this can be done in two ways:
> 1. {{groupReduce}}: consume and store the complete iterator in memory and
> build all pairs
> 2. do a self-{{Join}}: the engine builds all pairs of the full symmetric
> Cartesian product.
> Both approaches have drawbacks. The {{groupReduce}} variant requires that the
> full group fits into memory and is more cumbersome to implement for the user,
> but pairs can be arbitrarily built. The self-{{Join}} approach pushes most of
> the work into the system, but the execution strategy does not treat the
> self-join different from a regular join (both identical inputs are shuffled,
> etc.) and always builds the full symmetric Cartesian product.
> I propose to add a dedicated {{crossGroup()}} operator, that offers this
> functionality in a proper way.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)