Re: Spark optimization

Akhil Das Mon, 27 Oct 2014 00:21:43 -0700

There is no tool to tweak a spark cluster, but while writing the job, you
can consider the Tuning guidelines
<http://spark.apache.org/docs/latest/tuning.html>.


Thanks
Best Regards

On Mon, Oct 27, 2014 at 3:14 AM, Morbious <knowledgefromgro...@gmail.com>
wrote:

> I wonder if there is any tool to tweak spark (worker and master).
> I have 6 workers  (192 GB RAM, 32 cores CPU each) with 2 masters and see
> only small different between MapReduce from hadoop and Spark.
> I've tested word count on 50 GB file. During tests spark hung on 2 nodes
> for
> few minuts with message:
>
> 14/10/26 21:38:52 INFO scheduler.DAGScheduler: Submitting 2 missing tasks
> from Stage 0 (MappedRDD[8] at saveAsTextFile at
> NativeMethodAccessorImpl.java:-2)
> 14/10/26 21:38:52 INFO scheduler.TaskSchedulerImpl: Adding task set 0.0
> with
> 2 tasks
> 14/10/26 21:38:52 INFO spark.MapOutputTrackerMasterActor: Asked to send map
> output locations for shuffle 0 to sp...@spark-s2.test.org:41437
> 14/10/26 21:38:52 INFO spark.MapOutputTrackerMaster: Size of output
> statuses
> for shuffle 0 is 5942 bytes
> 14/10/26 21:38:52 INFO spark.MapOutputTrackerMasterActor: Asked to send map
> output locations for shuffle 0 to sp...@spark-s4.test.org:34546
>
> Best regards,
>
> Morbious
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/Spark-optimization-tp17290.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>

Re: Spark optimization

Reply via email to