Spark DIMSUM Memory requirement?

2015-12-01 Thread Parin Choganwala
Hi All,

I am trying to run RowMatrix.similarity(0.5) on 60K users (n) with 130k 
features (m) on spark 1.3.0.
Using 4 m3.2xlarge 30GB RAM and 8 cores but getting lots of ERROR 
YarnScheduler: Lost executor 1 on XXX.internal: remote Akka client disassociate

What could be the reason?
Is it shuffle memory that I should increase?

Thank You
Parin Choganwala



Stack Overflow Question

2015-11-13 Thread Parin Choganwala
EMR 4.1.0 + Spark 1.5.0 + YARN Resource Allocation

http://stackoverflow.com/q/33488869/1366507?sem=2