This does sound interesting. I too having been playing with spark, and porting the mahout random decision forest implementation I've been working on [1] to spark.
-Andy [1] https://issues.apache.org/jira/browse/MAHOUT-1153 On 13 March 2013 20:29, Nick Pentreath <[email protected]> wrote: > You'll all be the first to know as soon as I do have anything useful! (Well > to be fair probably Spark mailing list might be first :). > > N > > Sent from my iPhone > > On 13 Mar 2013, at 17:54, Ted Dunning <[email protected]> wrote: > >> Stick around! >> >> We would love to see the fruits of this. >> >> >> On Wed, Mar 13, 2013 at 1:01 AM, Nick Pentreath >> <[email protected]>wrote: >> >>> The main point of interest in this context is that I intend to build a >>> minimal first-cut machine learning library for Spark. This is likely to >>> involve porting / using parts of Mahout where it makes sense (or at the >>> very least taking major cues from Mahout implementations, as well as other >>> ML libraries). On the Java side it's also highly probable it would use >>> mahout-math (although there are some other options). >>> >>> I hope to start getting into this properly over the next couple of months. >>> So if anyone is interested on collaborating, I would very much welcome >>> help, discussion, idea-bouncing, etc. >>> -- Dr Andy Twigg Junior Research Fellow, St Johns College, Oxford Room 351, Department of Computer Science http://www.cs.ox.ac.uk/people/andy.twigg/ [email protected] | +447799647538
