Hello,
I ran ALS algorithm on 30 c4.8xlarge machines(60GB RAM each) with
dataset(1.4GB) Netflix dataset (Users: 480189, Items: 17770, Ratings: 99M)
*Command* I run:
/usr/lib/spark/bin/spark-submit --deploy-mode cluster --master yarn --jars
/usr/lib/spark/examples/jars/scopt_2.11-3.3.0.jar netfl
The scale factor was only to scale up the number of ratings in the dataset
for performance testing purposes, to illustrate the scalability of Spark
ALS.
It is not something you would normally do on your training dataset.
On Fri, 23 Sep 2016 at 20:07, Roshani Nagmote
wrote:
> Hello,
>
> I was wor