[ 
https://issues.apache.org/jira/browse/FLINK-100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fabian Hueske updated FLINK-100:
--------------------------------
    Priority: Minor  (was: Major)

> Pact API Proposal: Add keyless CoGroup (send all to a single group)
> -------------------------------------------------------------------
>
>                 Key: FLINK-100
>                 URL: https://issues.apache.org/jira/browse/FLINK-100
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: GitHub Import
>            Priority: Minor
>              Labels: github-import
>             Fix For: pre-apache
>
>
> I propose to add a keyless version of CoGroup that groups both inputs in a 
> single group, analogous to the keyless Reducer version that was added in 
> https://github.com/dimalabs/ozone/pull/61
> ```
> CoGroupContract myCoGroup = CoGroupContract.builder(MyUdf.class)
>     .input1(contractA)
>     .input2(contractB)
>     .build();
> ```
> I have a use case where I need to process the output of two contracts in a 
> single udf and I currently have to use the workaround to add a constant field 
> and use this as grouping key.
> Adding a keyless version would reduce the overhead (network traffic, 
> serialization and code-writing) and give the compiler additional knowledge 
> (The compiler knows that there will be only a single group and a single udf 
> call. If setAvgRecordsEmittedPerStubCall is set, it could infer the output 
> cardinality)
> Furthermore I think this would be consequent, because CoGroup is like Reduce 
> for multiple inputs.
> ---------------- Imported from GitHub ----------------
> Url: https://github.com/stratosphere/stratosphere/issues/100
> Created by: [andrehacker|https://github.com/andrehacker]
> Labels: enhancement, 
> Created at: Sat Sep 14 23:15:59 CEST 2013
> State: open



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to