[GitHub] spark pull request: SPARK-4040. Update documentation to exemplify ...

srowen Thu, 30 Oct 2014 14:01:10 -0700

Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2964#discussion_r19636020
  
    --- Diff: docs/configuration.md ---
    @@ -21,16 +21,19 @@ application. These properties can be set directly on a
     [SparkConf](api/scala/index.html#org.apache.spark.SparkConf) passed to your
     `SparkContext`. `SparkConf` allows you to configure some of the common 
properties
     (e.g. master URL and application name), as well as arbitrary key-value 
pairs through the
    -`set()` method. For example, we could initialize an application as follows:
    +`set()` method. For example, we could initialize an application with one 
worker as follows:
     
     {% highlight scala %}
     val conf = new SparkConf()
    -             .setMaster("local")
    +             .setMaster("local[1]")
    --- End diff --
    
    I actually though the default behavior of `local` was 2 threads, but in the 
code I don't think that's true. I thought @mateiz mentioned one time that it's 
better to run with minimal parallelism by default to expose issues that might 
only appear when there are multiple executors.
    
    In any event, given that, and the thrust of this doc change, is it good to 
encourage people to use 1 worker? how about explicitly 2? Making it explicit is 
a small good thing anyway.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: SPARK-4040. Update documentation to exemplify ...

Reply via email to