Interesting. thanks for this information. On Fri, Jul 22, 2016 at 11:26 AM, Bryan Cutler <[email protected]> wrote:
> ML has a DataFrame based API, while MLlib is RDDs and will be deprecated > as of Spark 2.0. > > On Thu, Jul 21, 2016 at 10:41 PM, VG <[email protected]> wrote: > >> Why do we have these 2 packages ... ml and mlib? >> What is the difference in these >> >> >> >> On Fri, Jul 22, 2016 at 11:09 AM, Bryan Cutler <[email protected]> wrote: >> >>> Hi JG, >>> >>> If you didn't know this, Spark MLlib has 2 APIs, one of which uses >>> DataFrames. Take a look at this example >>> https://github.com/apache/spark/blob/master/examples/src/main/java/org/apache/spark/examples/ml/JavaLinearRegressionWithElasticNetExample.java >>> >>> This example uses a Dataset<Row>, which is type equivalent to a >>> DataFrame. >>> >>> >>> On Thu, Jul 21, 2016 at 8:41 PM, Jean Georges Perrin <[email protected]> >>> wrote: >>> >>>> Hi, >>>> >>>> I am looking for some really super basic examples of MLlib (like a >>>> linear regression over a list of values) in Java. I have found a few, but I >>>> only saw them using JavaRDD... and not DataFrame. >>>> >>>> I was kind of hoping to take my current DataFrame and send them in >>>> MLlib. Am I too optimistic? Do you know/have any example like that? >>>> >>>> Thanks! >>>> >>>> jg >>>> >>>> >>>> Jean Georges Perrin >>>> [email protected] / @jgperrin >>>> >>>> >>>> >>>> >>>> >>> >> >
