Which Spark version?
Are you using RDDs? Or datasets?
What type are the features? If string how large?
Is it spark standalone?
How do you train/configure the algorithm. How do you initially parse the data?
The standard driver and executor logs could be helpful.
> On 12 Oct 2016, at 09:24, 陈哲
Hi
I'm using spark ml to train RandomForest Model . There is about over 200,
000 lines in the training data file and about 100 features. I'm running
spark in local mode and with JAVA_OPTS like: -Xms1024m -Xmx10296m
-XX:+PrintGCDetails -XX:+PrintGCTimeStamps, but OOM error keep coming out,
I