subject:"Re\: Spark MLlib ALS algorithm"

Re: Spark MLlib ALS algorithm

2016-09-25 Thread Roshani Nagmote

Hello, I ran ALS algorithm on 30 c4.8xlarge machines(60GB RAM each) with dataset(1.4GB) Netflix dataset (Users: 480189, Items: 17770, Ratings: 99M) *Command* I run: /usr/lib/spark/bin/spark-submit --deploy-mode cluster --master yarn --jars /usr/lib/spark/examples/jars/scopt_2.11-3.3.0.jar netfl

Re: Spark MLlib ALS algorithm

2016-09-23 Thread Nick Pentreath

The scale factor was only to scale up the number of ratings in the dataset for performance testing purposes, to illustrate the scalability of Spark ALS. It is not something you would normally do on your training dataset. On Fri, 23 Sep 2016 at 20:07, Roshani Nagmote wrote: > Hello, > > I was wor