GitHub user ash211 opened a pull request:
https://github.com/apache/spark/pull/369
SPARK-1063 Add .sortBy(f) method on RDD
This never got merged from the apache/incubator-spark repo (which is now
deleted) but there had been several rounds of code review on this PR there.
I think this is ready for merging.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ash211/spark sortby
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/369.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #369
----
commit ca4490da536578ef4650039b099db8dafc9d6b66
Author: Andrew Ash <[email protected]>
Date: 2014-01-24T08:26:55Z
Add .sortBy(f) method on RDD
commit 0f685fd17584061d4b18419234bedb79843a0813
Author: Andrew Ash <[email protected]>
Date: 2014-02-14T06:11:27Z
Merge remote-tracking branch 'origin/master' into sortby
Conflicts:
core/src/main/scala/org/apache/spark/rdd/RDD.scala
core/src/test/scala/org/apache/spark/rdd/RDDSuite.scala
commit 7db3e849c5a9e4a3189ea594e349835cef6d307e
Author: Andrew Ash <[email protected]>
Date: 2014-02-14T06:27:06Z
Support ascending and numPartitions params in sortBy()
commit 381eef23f59a44b0555de9bb63fc8e598595ef32
Author: Andrew Ash <[email protected]>
Date: 2014-02-14T06:32:19Z
Correct silly typo
commit 8c53298cfeebcba7e08ef8c586816e7513daf11b
Author: Andrew Ash <[email protected]>
Date: 2014-02-25T00:19:02Z
Actually use ascending and numPartitions parameters
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---