I don't think you need to employ a distributed system for working with this dataset. An SGD implementation on a single machine should easily handle the job.
Best, Sebastian 2017-07-12 9:26 GMT+02:00 Andrea Spina <andrea.sp...@radicalbit.io>: > Dear Ziyad, > > Yep, I had encountered same very long runtimes with ALS as well at the time > and I recorded improvements by increasing the number of blocks / decreasing > #TSs/TM like you've stated out. > > Cheers, > > Andrea > > > > > > > -- > View this message in context: http://apache-flink-user- > mailing-list-archive.2336050.n4.nabble.com/FlinkML-ALS-is- > taking-too-long-to-run-tp14154p14192.html > Sent from the Apache Flink User Mailing List archive. mailing list archive > at Nabble.com. >