[ 
https://issues.apache.org/jira/browse/SPARK-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14060691#comment-14060691
 ] 

Hans Uhlig commented on SPARK-2278:
-----------------------------------

I think I am not communicating this very well. Perhaps I am looking for a 
different function entirely

JavaRDD<K,V> JavaRDD.collectBy<K,V>(Iterable<Vout> Function<Vin,Vout> 
func(Iterable<Vin>), Comparator comp, Partitioner partitioner, int 
numPartitions)

JavaPairRDD<K,V> JavaPairRDD.collectByKey<K,V>( Iterable<Tuple<Kout,Vout>> 
Function<Kin,Vin,Kin,Kout> func(Kin,Iterable<Vin>), Comparator comp, 
Partitioner partitioner, int numPartitions)

> groupBy & groupByKey should support custom comparator
> -----------------------------------------------------
>
>                 Key: SPARK-2278
>                 URL: https://issues.apache.org/jira/browse/SPARK-2278
>             Project: Spark
>          Issue Type: New Feature
>          Components: Java API
>    Affects Versions: 1.0.0
>            Reporter: Hans Uhlig
>
> To maintain parity with MapReduce you should be able to specify a custom key 
> equality function in groupBy/groupByKey similar to sortByKey. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to