Updated Branches: refs/heads/master 14bb465bb -> c46067f09
Update tuning.md Clarify when serializer is used based on recent user@ mailing list discussion. Project: http://git-wip-us.apache.org/repos/asf/incubator-spark/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-spark/commit/08afef37 Tree: http://git-wip-us.apache.org/repos/asf/incubator-spark/tree/08afef37 Diff: http://git-wip-us.apache.org/repos/asf/incubator-spark/diff/08afef37 Branch: refs/heads/master Commit: 08afef37a07c501b1ba14e3d6da445712852ca1e Parents: eb4296c Author: Andrew Ash <[email protected]> Authored: Mon Nov 25 17:08:52 2013 -0800 Committer: Andrew Ash <[email protected]> Committed: Mon Nov 25 17:08:52 2013 -0800 ---------------------------------------------------------------------- docs/tuning.md | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) ---------------------------------------------------------------------- http://git-wip-us.apache.org/repos/asf/incubator-spark/blob/08afef37/docs/tuning.md ---------------------------------------------------------------------- diff --git a/docs/tuning.md b/docs/tuning.md index f33fda3..a4be188 100644 --- a/docs/tuning.md +++ b/docs/tuning.md @@ -39,7 +39,8 @@ in your operations) and performance. It provides two serialization libraries: for best performance. You can switch to using Kryo by calling `System.setProperty("spark.serializer", "org.apache.spark.serializer.KryoSerializer")` -*before* creating your SparkContext. The only reason it is not the default is because of the custom +*before* creating your SparkContext. This setting configures the serializer used for not only shuffling data between worker +nodes but also when serializing RDDs to disk. The only reason Kryo is not the default is because of the custom registration requirement, but we recommend trying it in any network-intensive application. Finally, to register your classes with Kryo, create a public class that extends
