Hi! We've done cool benchmark of popular ML libraries (including Spark ML) on Criteo 1TB dataset https://github.com/rambler-digital-solutions/criteo-1tb-benchmark
Spark ML was tested on a real production cluster and showed great results at scale. We'd like to see some feedback and tips for improvement. Have a look and spread the word! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Benchmark-of-XGBoost-Vowpal-Wabbit-and-Spark-ML-on-Criteo-1TB-Dataset-tp28640.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe e-mail: [email protected]
