rdd.collect() never does any processing on the workers. It brings the entire rdd as an in-memory collection back to driver
On Wed, Feb 24, 2016 at 10:58 PM, Anurag [via Apache Spark User List] < ml-node+s1001560n26320...@n3.nabble.com> wrote: > Hi Everyone > > I am new to Scala and Spark. > > I want to know > > 1. does Rdd.collect().foreach() do processing in parallel? > > 2. does Rdd.collect().map() do processing in parallel ? > > Thanks in advance. > Regards > Anurag > > ------------------------------ > If you reply to this email, your message will be added to the discussion > below: > > http://apache-spark-user-list.1001560.n3.nabble.com/rdd-collect-foreach-vs-rdd-collect-map-tp26320.html > To start a new topic under Apache Spark User List, email > ml-node+s1001560n1...@n3.nabble.com > To unsubscribe from Apache Spark User List, click here > <http://apache-spark-user-list.1001560.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=1&code=bGVhcm5pbmdzLmNoaXR0dXJpQGdtYWlsLmNvbXwxfC03NzExMjUwMg==> > . > NAML > <http://apache-spark-user-list.1001560.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml> > -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/rdd-collect-foreach-vs-rdd-collect-map-tp26320p26322.html Sent from the Apache Spark User List mailing list archive at Nabble.com.