Mortada Mehyar created SPARK-12760:
--------------------------------------

             Summary: inaccurate description for difference between local vs 
cluster mode 
                 Key: SPARK-12760
                 URL: https://issues.apache.org/jira/browse/SPARK-12760
             Project: Spark
          Issue Type: Question
          Components: Documentation
            Reporter: Mortada Mehyar
            Priority: Trivial


In the spark documentation there's an example for illustrating how `local` and 
`cluster` mode can differ 
http://spark.apache.org/docs/latest/programming-guide.html#example

" In local mode with a single JVM, the above code will sum the values within 
the RDD and store it in counter. This is because both the RDD and the variable 
counter are in the same memory space on the driver node." 

However the above doesn't seem to be true. Even in `local` mode it seems like 
the counter value should still be 0, because the variable will be summed up in 
the executor memory space, but the final value in the driver memory space is 
still 0. I tested this snippet and verified that in `local` mode the value is 
indeed still 0. 

Is the doc wrong or perhaps I'm missing something the doc is trying to say? 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to