Repository: spark Updated Branches: refs/heads/master 30a86acde -> 2b233f5fc
Documentation: add description for repartitionAndSortWithinPartitions Author: Madhu Siddalingaiah <[email protected]> Closes #3390 from msiddalingaiah/master and squashes the following commits: cbccbfe [Madhu Siddalingaiah] Documentation: replace <b> with <code> (again) 332f7a2 [Madhu Siddalingaiah] Documentation: replace <b> with <code> cd2b05a [Madhu Siddalingaiah] Merge remote-tracking branch 'upstream/master' 0fc12d7 [Madhu Siddalingaiah] Documentation: add description for repartitionAndSortWithinPartitions Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2b233f5f Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/2b233f5f Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/2b233f5f Branch: refs/heads/master Commit: 2b233f5fc4beb2c6ed4bc142e923e96f8bad3ec4 Parents: 30a86ac Author: Madhu Siddalingaiah <[email protected]> Authored: Mon Dec 1 08:45:34 2014 -0800 Committer: Josh Rosen <[email protected]> Committed: Mon Dec 1 08:45:34 2014 -0800 ---------------------------------------------------------------------- docs/programming-guide.md | 6 ++++++ 1 file changed, 6 insertions(+) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/spark/blob/2b233f5f/docs/programming-guide.md ---------------------------------------------------------------------- diff --git a/docs/programming-guide.md b/docs/programming-guide.md index 7a16ee8..5e0d5c1 100644 --- a/docs/programming-guide.md +++ b/docs/programming-guide.md @@ -934,6 +934,12 @@ for details. <td> Reshuffle the data in the RDD randomly to create either more or fewer partitions and balance it across them. This always shuffles all data over the network. </td> </tr> +<tr> + <td> <b>repartitionAndSortWithinPartitions</b>(<i>partitioner</i>) </td> + <td> Repartition the RDD according to the given partitioner and, within each resulting partition, + sort records by their keys. This is more efficient than calling <code>repartition</code> and then sorting within + each partition because it can push the sorting down into the shuffle machinery. </td> +</tr> </table> ### Actions --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
