Re: Sorting partitions in Java

2014-05-20 Thread Madhu
wse/SPARK-983>Andrew mentioned covers the rdd.sortPartitions() use case. Can someone comment on the scope of SPARK-983? Thanks! - -- Madhu https://www.linkedin.com/in/msiddalingaiah -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Sorting-pa

Re: Sorting partitions in Java

2014-05-20 Thread Sean Owen
On Tue, May 20, 2014 at 6:10 PM, Madhu wrote: > What you suggest looks an in-memory sort, which is fine if each partition is > small enough to fit in memory. Is it true that rdd.sortByKey(...) requires > partitions to fit in memory? I wasn't sure if there was some magic behind > the scenes that su

Re: Sorting partitions in Java

2014-05-20 Thread Andrew Ash
; don't fit in memory, sorting those partitions requires more work. For > > > these > > > > cases, I think there is value in having a robust partition sorting > > method > > > > that deals with it efficiently and reliably. > > > > > > > > Is th

Re: Sorting partitions in Java

2014-05-20 Thread Sandy Ryza
partitions? If > > not, > > > I don't mind developing and contributing a solution. > > > > > > > > > > > > > > > - > > > -- > > > Madhu > > > https://www.linkedin.com/in/msiddalingaiah > > > -- > > > View this message in context: > > > > > > http://apache-spark-developers-list.1001551.n3.nabble.com/Sorting-partitions-in-Java-tp6715p6719.html > > > Sent from the Apache Spark Developers List mailing list archive at > > > Nabble.com. > > > > > >

Re: Sorting partitions in Java

2014-05-20 Thread Andrew Ash
ng arbitrarily large partitions? If > not, > > I don't mind developing and contributing a solution. > > > > > > > > > > - > > -- > > Madhu > > https://www.linkedin.com/in/msiddalingaiah > > -- > > View this message in context: > > > http://apache-spark-developers-list.1001551.n3.nabble.com/Sorting-partitions-in-Java-tp6715p6719.html > > Sent from the Apache Spark Developers List mailing list archive at > > Nabble.com. > > >

Re: Sorting partitions in Java

2014-05-20 Thread Sandy Ryza
gt; Is there another solution for sorting arbitrarily large partitions? If not, > I don't mind developing and contributing a solution. > > > > > - > -- > Madhu > https://www.linkedin.com/in/msiddalingaiah > -- > View this message in context: > http://apac

Re: Sorting partitions in Java

2014-05-20 Thread Madhu
park-developers-list.1001551.n3.nabble.com/Sorting-partitions-in-Java-tp6715p6719.html Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

Re: Sorting partitions in Java

2014-05-20 Thread Sean Owen
gt; Thanks! > > > > - > -- > Madhu > https://www.linkedin.com/in/msiddalingaiah > -- > View this message in context: > http://apache-spark-developers-list.1001551.n3.nabble.com/Sorting-partitions-in-Java-tp6715.html > Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

Sorting partitions in Java

2014-05-20 Thread Madhu
. Ideally, it would be nice to have an efficient, robust method in RDD to sort each partition. Does something like that exist? Thanks! - -- Madhu https://www.linkedin.com/in/msiddalingaiah -- View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/Sorting-p