Re: Error spark-mahout when spark-submit mode cluster

2019-07-25 Thread Andrew Musselman
te: > > > Hi everybody, I'm trying to build a basic recomender with Spark and > Mahout > > on Scala. I use the follow mahout repo to compile mahout with scala 2.11 > > and spark 2.1.2 mahout_fork < > > https://github.com/actionml/mahout/tree/sparse-speedup-13.0> >

Re: Error spark-mahout when spark-submit mode cluster

2019-07-22 Thread Simanchal
unsubscribe On Wed, Aug 1, 2018 at 6:54 AM Jaume Galí wrote: > Hi everybody, I'm trying to build a basic recomender with Spark and Mahout > on Scala. I use the follow mahout repo to compile mahout with scala 2.11 > and spark 2.1.2 mahout_fork < > https://github.com/actionml/mah

Re: Error spark-mahout when spark-submit mode cluster

2018-08-08 Thread santosh kumar
user-unsubscribe@m.a.o On Wed, Aug 8, 2018 at 7:18 PM Eric Link wrote: > unsubscribe > > On Wed, Aug 1, 2018 at 8:54 AM Jaume Galí wrote: > > > Hi everybody, I'm trying to build a basic recomender with Spark and > Mahout > > on Scala. I use the follow mahout repo t

Re: Error spark-mahout when spark-submit mode cluster

2018-08-08 Thread Dmitriy Lyubimov
it is user-unsubscribe@m.a.o On Wed, Aug 8, 2018 at 6:47 AM, Eric Link wrote: > unsubscribe > > On Wed, Aug 1, 2018 at 8:54 AM Jaume Galí wrote: > > > Hi everybody, I'm trying to build a basic recomender with Spark and > Mahout > > on Scala. I use the follow ma

Re: Error spark-mahout when spark-submit mode cluster

2018-08-08 Thread Dmitriy Lyubimov
this is a Mahout-specific problem. Sorry if I did not say anything you had not already known. -d On Wed, Aug 1, 2018 at 6:54 AM, Jaume Galí wrote: > Hi everybody, I'm trying to build a basic recomender with Spark and Mahout > on Scala. I use the follow mahout repo to compile mahout with scal

Re: Error spark-mahout when spark-submit mode cluster

2018-08-08 Thread Amani Kongara
Unsubscribe On Wed, 8 Aug 2018 at 7:18 PM Eric Link wrote: > unsubscribe > > On Wed, Aug 1, 2018 at 8:54 AM Jaume Galí wrote: > > > Hi everybody, I'm trying to build a basic recomender with Spark and > Mahout > > on Scala. I use the follow mahout repo to com

Re: Error spark-mahout when spark-submit mode cluster

2018-08-08 Thread Eric Link
unsubscribe On Wed, Aug 1, 2018 at 8:54 AM Jaume Galí wrote: > Hi everybody, I'm trying to build a basic recomender with Spark and Mahout > on Scala. I use the follow mahout repo to compile mahout with scala 2.11 > and spark 2.1.2 mahout_fork < > https://github.com/actionml/mah

Error spark-mahout when spark-submit mode cluster

2018-08-01 Thread Jaume Galí
Hi everybody, I'm trying to build a basic recomender with Spark and Mahout on Scala. I use the follow mahout repo to compile mahout with scala 2.11 and spark 2.1.2 mahout_fork <https://github.com/actionml/mahout/tree/sparse-speedup-13.0> To execute my code I use spark-submit and it run fin

Re: Mahout and Spark 2.2 compatibility

2017-12-04 Thread Dmitriy Lyubimov
e. I'll also post back here. > > tg > > > On Mon, Dec 4, 2017 at 11:18 AM, Marc Cardus Garcia < > marc.car...@eurecat.org > > wrote: > > > Hello all, > > > > > > First time I write into this mailing list, so if there is something wrong > > with my

Re: Mahout and Spark 2.2 compatibility

2017-12-04 Thread Trevor Grant
wrote: > Hello all, > > > First time I write into this mailing list, so if there is something wrong > with my message please let me know. > > > I work for a company using Mahout and Spark. We have recently started a > project using Spark 2.2 and we would like to use Mahout b

Mahout and Spark 2.2 compatibility

2017-12-04 Thread Marc Cardus Garcia
Hello all, First time I write into this mailing list, so if there is something wrong with my message please let me know. I work for a company using Mahout and Spark. We have recently started a project using Spark 2.2 and we would like to use Mahout but if I am not wrong, according to his

Re: Setup questions for mahout with spark

2015-07-27 Thread Dmitriy Lyubimov
to setup a system with mahout and spark and would like some opinions regarding what versions of the components to use. The target system is a redhat enterprise linux box. What I would like to know are: * What version of java people is using, jre-1.7.0-openjdk ? * Should I use last final

Re: Mahout Vs Spark

2014-10-24 Thread Ted Dunning
, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality Reduction Y Y Java Y Y Scala N Y

Re: Mahout Vs Spark

2014-10-23 Thread Ted Dunning
...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality

Re: Mahout Vs Spark

2014-10-23 Thread thejas prasad
. This is applicable to all the libraries of python (including numpy). On Wed, Oct 22, 2014 at 3:54 AM, Ted Dunning ted.dunn...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark

Re: Mahout Vs Spark

2014-10-22 Thread Brian Dolan
. This is applicable to all the libraries of python (including numpy). On Wed, Oct 22, 2014 at 3:54 AM, Ted Dunning ted.dunn...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here

Re: Mahout Vs Spark

2014-10-22 Thread Martin, Nick
). On Wed, Oct 22, 2014 at 3:54 AM, Ted Dunning ted.dunn...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y

Re: Mahout Vs Spark

2014-10-22 Thread Mahesh Balija
(including numpy). On Wed, Oct 22, 2014 at 3:54 AM, Ted Dunning ted.dunn...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark

Re: Mahout Vs Spark

2014-10-22 Thread Dmitriy Lyubimov
For the record, this is all false dilemma (at least w.r.t. spark vs mahout spark bindings). The spark bindings have never been concieved as one vs another. Mahout scala bindings is on-top add-on to spark that just happens to rely on some of things in mahout-math. With spark one gets some major

Re: Mahout Vs Spark

2014-10-22 Thread Mahesh Balija
, just wanted to take some inputs from the active contributors. Best! Mahesh Balija. On Wed, Oct 22, 2014 at 6:57 PM, Dmitriy Lyubimov dlie...@gmail.com wrote: For the record, this is all false dilemma (at least w.r.t. spark vs mahout spark bindings). The spark bindings have never been

Mahout Vs Spark

2014-10-21 Thread Mahesh Balija
package and inbuilt support for Scala. I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality Reduction Y Y Java Y Y Scala N Y Python N Y Numpy N Y Hadoop Y Y Text Mining Y N

Re: Mahout Vs Spark

2014-10-21 Thread Ted Dunning
On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality Reduction Y Y Java Y Y Scala N Y

Re: Mahout Vs Spark

2014-10-21 Thread Vibhanshu Prasad
at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality Reduction Y Y Java Y Y Scala N Y Python N Y Numpy N

Re: Mahout Vs Spark

2014-10-21 Thread Lee S
balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small list, Features Mahout Spark Clustering Y Y Classification Y Y Regression Y Y Dimensionality Reduction Y Y Java Y Y Scala N Y Python N Y Numpy N Y Hadoop Y Y Text

Re: Mahout Vs Spark

2014-10-21 Thread Nil Kulkarni
. This is applicable to all the libraries of python (including numpy). On Wed, Oct 22, 2014 at 3:54 AM, Ted Dunning ted.dunn...@gmail.com wrote: On Tue, Oct 21, 2014 at 3:04 PM, Mahesh Balija balijamahesh@gmail.com wrote: I am trying to differentiate between Mahout and Spark, here is the small

Re: Mahout on Spark: random forest

2014-08-11 Thread Suneel Marthi
there is no Random Forest impl on Spark in Mahout yet. Ml-lib has a Random Forests impl why can't u use that instead. On Tue, Aug 12, 2014 at 2:19 AM, Sameer Tilak ssti...@live.com wrote: Hi All, We are currently using Weka. I looked the the site and read briefly about experimental

RE: Mahout on Spark: random forest

2014-08-11 Thread Sameer Tilak
From what I read on the mailing lists was that the rf implementation is on the roadmap and only decision tree support is included at present. However, I may be completely wrong. Date: Tue, 12 Aug 2014 02:24:21 +0530 Subject: Re: Mahout on Spark: random forest From: suneel.mar...@gmail.com

Re: Mahout on Spark: random forest

2014-08-11 Thread Dmitriy Lyubimov
I am not sure they have the forests yet. They have what seems to be a decently performant decision tree tho. On Mon, Aug 11, 2014 at 1:54 PM, Suneel Marthi suneel.mar...@gmail.com wrote: there is no Random Forest impl on Spark in Mahout yet. Ml-lib has a Random Forests impl why can't u use

RE: Mahout on Spark: random forest

2014-08-11 Thread Sameer Tilak
Yes, I have started working with that since morning and see how well it works for my problem. Date: Mon, 11 Aug 2014 14:01:12 -0700 Subject: Re: Mahout on Spark: random forest From: dlie...@gmail.com To: user@mahout.apache.org I am not sure they have the forests yet. They have what

Re: Mahout on Spark: random forest

2014-08-11 Thread SriSatish Ambati
problem. Date: Mon, 11 Aug 2014 14:01:12 -0700 Subject: Re: Mahout on Spark: random forest From: dlie...@gmail.com To: user@mahout.apache.org I am not sure they have the forests yet. They have what seems to be a decently performant decision tree tho. On Mon, Aug 11, 2014 at 1

Re: Mahout on Spark: random forest

2014-08-11 Thread Ted Dunning
H2O have a very high quality random high performance implementation of random forests. On Mon, Aug 11, 2014 at 1:49 PM, Sameer Tilak ssti...@live.com wrote: Hi All, We are currently using Weka. I looked the the site and read briefly about experimental nature of Mahout on Spark. I

Re: Mahout on Spark?

2014-02-19 Thread Dmitriy Lyubimov
Unfortunately methinks the prospects of something like Mahout/MLLib merge seem very unlikely due to vastly diverged approach to the basics of linear algebra (and other things). Just like one cannot grow single tree out of two trunks -- not easily, anyway. It is fairly easy to port (and

Re: Mahout on Spark?

2014-02-19 Thread Sean Owen
Agree that 'merging' is so infeasible as to not make sense. Mahout has been ML on M/R and that's it's thing, which seems fine. IMHO this project has been hurt by an active unwillingness to define scope, and pretending it's helpful to have little bits of lots of ideas and technologies. I also

Re: Mahout on Spark?

2014-02-19 Thread Dmitriy Lyubimov
PS I am moving along cost optimizer for spark-backed DRMs on some multiplicative pipelines that is capable of figuring different cost-based rewrites and R-Like DSL that mixes in-core and distributed matrix representations and blocks but it is painfully slow, i really only doing it like couple

Re: Mahout on Spark?

2014-02-19 Thread Gokhan Capan
, and streaming summarization stuff in Mahout to Spark-Streaming. Best, Gokhan On Wed, Feb 19, 2014 at 10:45 AM, Dmitriy Lyubimov dlie...@gmail.comwrote: PS I am moving along cost optimizer for spark-backed DRMs on some multiplicative pipelines that is capable of figuring different cost-based rewrites

Re: Mahout on Spark?

2014-02-19 Thread Sean Owen
learning based on mini-batches, and streaming summarization stuff in Mahout to Spark-Streaming. Best, Gokhan On Wed, Feb 19, 2014 at 10:45 AM, Dmitriy Lyubimov dlie...@gmail.comwrote: PS I am moving along cost optimizer for spark-backed DRMs on some multiplicative pipelines that is capable

Re: Mahout on Spark?

2014-02-19 Thread Sebastian Schelter
into vector based on the dictionary generated by seq2sparse before), machine learning based on mini-batches, and streaming summarization stuff in Mahout to Spark-Streaming. Best, Gokhan On Wed, Feb 19, 2014 at 10:45 AM, Dmitriy Lyubimov dlie...@gmail.comwrote: PS I am moving along cost optimizer

Mahout on Spark?

2014-02-18 Thread Ying Liao
Just wonder what is the future of Mahout. We are seeing new stuff from 0xdata and skytree. And spark is also design for in-memory iterative analysis. What about mahout? Will mahout run on top of spark in future? Thanks, Ying Liao

Re: Mahout on Spark?

2014-02-18 Thread Harshit Bapna
I am very eager to know the same from the community. Thanks for bringing it up. --Harshit On Tue, Feb 18, 2014 at 1:08 PM, Ying Liao yliao...@gmail.com wrote: Just wonder what is the future of Mahout. We are seeing new stuff from 0xdata and skytree. And spark is also design for in-memory

Re: Mahout on Spark?

2014-02-18 Thread Mohit Singh
In general, if you are interested in machine learning.. think there is already a machine learning specific initiative on spark called Mlbase ( http://www.mlbase.org/) and graphx (http://amplab.github.io/graphx/) for graphlab style ml. On Tue, Feb 18, 2014 at 1:14 PM, Harshit Bapna

Re: Mahout on Spark?

2014-02-18 Thread Nick Pentreath
analysis/summarizers and Ted's new t-digest (though with some monoid-ification this can be applied in Spark fairly trivially) It would be really cool to see if a Spark backend for Mahout could be developed (I know Dmitriy has looked at this in respect of DistributedMatrix stuff), or at least parts

Re: Mahout on Spark?

2014-02-18 Thread Ted Dunning
On Tue, Feb 18, 2014 at 1:58 PM, Nick Pentreath nick.pentre...@gmail.comwrote: My (admittedly heavily biased) view is Spark is a superior platform overall for ML. If the two communities can work together to leverage the strengths of Spark, and the large amount of good stuff in Mahout (as well

Re: Mahout on Spark?

2014-02-18 Thread Dmitriy Lyubimov
yes, this is a popular initiative. On Tue, Feb 18, 2014 at 1:08 PM, Ying Liao yliao...@gmail.com wrote: Just wonder what is the future of Mahout. We are seeing new stuff from 0xdata and skytree. And spark is also design for in-memory iterative analysis. What about mahout? Will mahout run on

Re: Mahout on Spark?

2014-02-18 Thread Nick Pentreath
I know the Spark/Mllib devs can occasionally be quite set in ways of doing certain things, but we'd welcome as many Mahout devs as possible to work together. It may be too late, but perhaps a GSoC project to look at a port of some stuff like co occurrence recommender and streaming k-means?

Re: Mahout on Spark?

2014-02-18 Thread Sebastian Schelter
I'm also convinced that Spark is a superior platform for executing distributed ML algorithms. We've had a discussion about a change from Hadoop to another platform some time ago, but at that point in time it was not clear which of the upcoming dataflow processing systems (Spark, Hyracks,