[ 
https://issues.apache.org/jira/browse/BEAM-12?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15146869#comment-15146869
 ] 

bakeypan commented on BEAM-12:
------------------------------

hi, Daniel
   Thank you so much for so detailed explanation.
    Because I have also made one dataflow-like framework and according my 
experence reduce one ParDo Transforms maybe good to performance and simplify 
the program.
    I think I have got your point,the design principle of DataFlow should be 
more important.So I have cleared about that.
    I think the issue can be close.
    Thx very much to Frances and Daniel for clarification

> Apply GroupByKey transforms on PCollection of normal type other than KV
> -----------------------------------------------------------------------
>
>                 Key: BEAM-12
>                 URL: https://issues.apache.org/jira/browse/BEAM-12
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-java-core
>            Reporter: bakeypan
>            Assignee: Frances Perry
>            Priority: Trivial
>
> Now the GroupByKey transforms can only apply on PCollection<KV<K,V>>.So I 
> have to transform PCollection<T> to PCollection<KV<K,V>> before I want to 
> apply GroupByKey.
> I think we can do better by apply GroupByKey on normal type of PCollection 
> other than KV.And user can offer one custome extract key function or we can 
> offer default extract key function.Just like this:
> PCollection<T> input = ...
> PCollection<KV<K,Iterable<V>>> result = input.apply(GroupByKey.<K, 
> V>create(new ExtractFn()));



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to