Repository: spark
Updated Branches:
  refs/heads/branch-1.5 bc70043c8 -> bff05aaa0


[SPARK-10469] [DOC] Try and document the three options

>From JIRA:
Add documentation for tungsten-sort.
>From the mailing list "I saw a new "spark.shuffle.manager=tungsten-sort" 
>implemented in
https://issues.apache.org/jira/browse/SPARK-7081, but it can't be found its
corresponding description in
http://people.apache.org/~pwendell/spark-releases/spark-1.5.0-rc3-docs/configuration.html(Currenlty
there are only 'sort' and 'hash' two options)."

Author: Holden Karau <hol...@pigscanfly.ca>

Closes #8638 from holdenk/SPARK-10469-document-tungsten-sort.

(cherry picked from commit a76bde9dae54c4641e21f3c1ceb4870e3dc91881)
Signed-off-by: Andrew Or <and...@databricks.com>


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/bff05aaa
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/bff05aaa
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/bff05aaa

Branch: refs/heads/branch-1.5
Commit: bff05aaa099cc02465f258e44d581bd92370c09b
Parents: bc70043
Author: Holden Karau <hol...@pigscanfly.ca>
Authored: Thu Sep 10 11:49:53 2015 -0700
Committer: Andrew Or <and...@databricks.com>
Committed: Thu Sep 10 11:50:02 2015 -0700

----------------------------------------------------------------------
 docs/configuration.md | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/spark/blob/bff05aaa/docs/configuration.md
----------------------------------------------------------------------
diff --git a/docs/configuration.md b/docs/configuration.md
index 353efdb..e1fa321 100644
--- a/docs/configuration.md
+++ b/docs/configuration.md
@@ -458,9 +458,12 @@ Apart from these, the following properties are also 
available, and may be useful
   <td><code>spark.shuffle.manager</code></td>
   <td>sort</td>
   <td>
-    Implementation to use for shuffling data. There are two implementations 
available:
-    <code>sort</code> and <code>hash</code>. Sort-based shuffle is more 
memory-efficient and is
-    the default option starting in 1.2.
+    Implementation to use for shuffling data. There are three implementations 
available:
+    <code>sort</code>, <code>hash</code> and the new (1.5+) 
<code>tungsten-sort</code>.
+    Sort-based shuffle is more memory-efficient and is the default option 
starting in 1.2.
+    Tungsten-sort is similar to the sort based shuffle, with a direct binary 
cache-friendly
+    implementation with a fall back to regular sort based shuffle if its 
requirements are not
+    met.
   </td>
 </tr>
 <tr>


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to