Hi, It depends on the problem that you work on. Just as python and R, Mllib focuses on machine learning and SparkR will focus on statistics, if SparkR follow the way of R.
For instance, If you want to use glm to analyse data: 1. if you are interested only in parameters of model, and use this model to predict, then you should use Mllib 2. if your focus is on confidence of the model, for example the confidence interval of result and the significant level of parameters, you should choose SparkR. However, as there is no glm package to this purpose yet, you need to code it by yourself. Hope it can be helpful Cheers Gen On Thu, Aug 6, 2015 at 2:24 AM, praveen S <mylogi...@gmail.com> wrote: > I was wondering when one should go for MLib or SparkR. What is the > criteria or what should be considered before choosing either of the > solutions for data analysis? > or What is the advantages of Spark MLib over Spark R or advantages of > SparkR over MLib? >