Repository: spark
Updated Branches:
  refs/heads/master 30a86acde -> 2b233f5fc


Documentation: add description for repartitionAndSortWithinPartitions

Author: Madhu Siddalingaiah <[email protected]>

Closes #3390 from msiddalingaiah/master and squashes the following commits:

cbccbfe [Madhu Siddalingaiah] Documentation: replace <b> with <code> (again)
332f7a2 [Madhu Siddalingaiah] Documentation: replace <b> with <code>
cd2b05a [Madhu Siddalingaiah] Merge remote-tracking branch 'upstream/master'
0fc12d7 [Madhu Siddalingaiah] Documentation: add description for 
repartitionAndSortWithinPartitions


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2b233f5f
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/2b233f5f
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/2b233f5f

Branch: refs/heads/master
Commit: 2b233f5fc4beb2c6ed4bc142e923e96f8bad3ec4
Parents: 30a86ac
Author: Madhu Siddalingaiah <[email protected]>
Authored: Mon Dec 1 08:45:34 2014 -0800
Committer: Josh Rosen <[email protected]>
Committed: Mon Dec 1 08:45:34 2014 -0800

----------------------------------------------------------------------
 docs/programming-guide.md | 6 ++++++
 1 file changed, 6 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/spark/blob/2b233f5f/docs/programming-guide.md
----------------------------------------------------------------------
diff --git a/docs/programming-guide.md b/docs/programming-guide.md
index 7a16ee8..5e0d5c1 100644
--- a/docs/programming-guide.md
+++ b/docs/programming-guide.md
@@ -934,6 +934,12 @@ for details.
   <td> Reshuffle the data in the RDD randomly to create either more or fewer 
partitions and balance it across them.
     This always shuffles all data over the network. </td>
 </tr>
+<tr>
+  <td> <b>repartitionAndSortWithinPartitions</b>(<i>partitioner</i>) </td>
+  <td> Repartition the RDD according to the given partitioner and, within each 
resulting partition,
+  sort records by their keys. This is more efficient than calling 
<code>repartition</code> and then sorting within 
+  each partition because it can push the sorting down into the shuffle 
machinery. </td>
+</tr>
 </table>
 
 ### Actions


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to