Re: Cloudera announces Oryx

2013-11-12 Thread Isabel Drost-Fromm
Sebastian, thanks for providing your perspective. On Tuesday, November 12, 2013 07:18:43 PM Sebastian Schelter wrote: > The lead developer of Impala answered the question whether Impala accepts > patches with the statement that Impala is developed by Cloudera engineers > and others can only look

Re: Cloudera announces Oryx

2013-11-12 Thread Dmitriy Lyubimov
On Tue, Nov 12, 2013 at 10:18 AM, Sebastian Schelter wrote: > > > @Sean > > However, I also cannot understand why Cloudera and you need to start a > new open source project that in many ways mirrors what mahout offers. > Why not contribute the algorithm implementations (the computation layer) > t

Re: Cloudera announces Oryx

2013-11-12 Thread Isabel Drost-Fromm
On Tuesday, November 12, 2013 10:33:39 AM Amir Sedighi wrote: > Seems Oryx is a Cloudera version of Myrrix. Is there any improvement list? For general design - how about using dev@mahout. For specific needs - how about filing tickets in JIRA. Best way to get improvements not only talked about b

Re: Cloudera announces Oryx

2013-11-12 Thread Sean Owen
On Tue, Nov 12, 2013 at 6:18 PM, Sebastian Schelter wrote: > However, I also cannot understand why Cloudera and you need to start a > new open source project that in many ways mirrors what mahout offers. > Why not contribute the algorithm implementations (the computation layer) > to mahout and bui

Re: Cloudera announces Oryx

2013-11-12 Thread Amir Sedighi
Seems Oryx is a Cloudera version of Myrrix.  Is there any improvement list? Regards, Amir. On Tuesday, November 12, 2013 8:46 PM, Isabel Drost-Fromm wrote: On Tuesday, November 12, 2013 04:27:48 PM Sean Owen wrote: > I like the benchmark sentiment. The two projects actually have little > o

Re: Cloudera announces Oryx

2013-11-12 Thread Sebastian Schelter
@Ted I don't see Cloudera buying Sean out of Mahout. As, I recall it, Sean stepped down as PMC Chair after a discussion on the future of mahout, where he saw his future vision for the project not concur with that of the others. He reduced his engagement with mahout and built myrrix first on his ow

Re: Cloudera announces Oryx

2013-11-12 Thread Andrew Musselman
I'd like to congratulate Sean and Cloudera on shipping a system that does a few things well and then lets you put them into production easily. This feels like the direction Mahout ought to go as well, and the group's been going toward a simpler system recently. My reason for using Mahout is that

Re: Cloudera announces Oryx

2013-11-12 Thread Isabel Drost-Fromm
On Tuesday, November 12, 2013 04:27:48 PM Sean Owen wrote: > I like the benchmark sentiment. The two projects actually have little > overlap in functionality, which is the essence of the reason why it's > a different project. One starting point of discussion for this dev list I would see valuable

Re: Cloudera announces Oryx

2013-11-12 Thread Sean Owen
On Tue, Nov 12, 2013 at 4:02 PM, Manuel Blechschmidt wrote: > It would be nice if Cloudera could publish some benchmarks. Cloudera vs. > Mahout vs. SAP HANA PAL vs. SPSS to give somebody the chances to enhance > Mahout in a way that it can catch up. Does this need to be a "versus" thing? I and

Re: Cloudera announces Oryx

2013-11-12 Thread Manuel Blechschmidt
Hallo Ted, hello Sean, I appreciate both of your work. I am not a contributor of code at all and I am just spreading the word around Mahout and creating some documentation and demo projects. I can understand that it is difficult to integrate the interests of an employer and of an open source pr

Re: Cloudera announces Oryx

2013-11-12 Thread Sean Owen
On Tue, Nov 12, 2013 at 2:13 PM, Ted Dunning wrote: > Cloudera's primary influence is to get you to ask to go emeritus, i.e. stop > contributing. > > You have contributed in the past. That's great. And now you work for > Cloudera. I started building on a new code base and left the PMC from abou

Re: Cloudera announces Oryx

2013-11-12 Thread Ted Dunning
On Tue, Nov 12, 2013 at 1:46 PM, Sean Owen wrote: > I think I'm the biggest single contributor to Mahout over time (? was > at one point), and so by extension Cloudera is. And this new project > is all open source. Surely that's maximally "walking the walk" in > these regards? > Absolutely not.

Re: Cloudera announces Oryx

2013-11-12 Thread Sean Owen
I think I'm the biggest single contributor to Mahout over time (? was at one point), and so by extension Cloudera is. And this new project is all open source. Surely that's maximally "walking the walk" in these regards? Mahout has served well for a long time as measured in Hadoop-years -- like 4+

Cloudera announces Oryx

2013-11-12 Thread Ted Dunning
Sean writes: We release Oryx today -- get some. #cloudera > #oryx > The Oryx open source project provides simple, real-time large-scale > machine learning infrastructure. It implements a few classes of algorithm > commonly