Re: Spark MLlib ALS algorithm

2016-09-25 Thread Roshani Nagmote
Hello, I ran ALS algorithm on 30 c4.8xlarge machines(60GB RAM each) with dataset(1.4GB) Netflix dataset (Users: 480189, Items: 17770, Ratings: 99M) *Command* I run: /usr/lib/spark/bin/spark-submit --deploy-mode cluster --master yarn --jars /usr/lib/spark/examples/jars/scopt_2.11-3.3.0.jar netfl

Re: Spark MLlib ALS algorithm

2016-09-23 Thread Nick Pentreath
The scale factor was only to scale up the number of ratings in the dataset for performance testing purposes, to illustrate the scalability of Spark ALS. It is not something you would normally do on your training dataset. On Fri, 23 Sep 2016 at 20:07, Roshani Nagmote wrote: > Hello, > > I was wor