That's a good idea and would be a nice starting point for my student. Forgive my ignorance, but where can I find the code? I can't see anything to download here:
http://lucene.apache.org/mahout/releases.html#Downloads (this seems empty right now) Miles 2008/5/23 Grant Ingersoll <[EMAIL PROTECTED]>: > > On May 23, 2008, at 10:00 AM, Miles Osborne wrote: > > I'm a new Mahout lurker and will be attending this >> >> As an aside, this summer I have two MSc projects implementing various >> machine learning stuff using Hadoop; the first one will probably be >> canopy >> clustering and >> > > Have you tried Mahout's canopy clustering? If it is close to what you > need, that would give you the time for the latter task. Of course, feedback > on our code would be great. > > > --time permitting-- discriminative training for log-linear >> models; the other project will look at simple smoothed relative frequency >> models (ie language modelling). >> >> Miles >> >> 2008/5/23 Grant Ingersoll <[EMAIL PROTECTED]>: >> >> Forwarded from someone on Hadoop: >>> http://upcoming.yahoo.com/event/506444/ >>> >>> Would be cool to have anyone in the UK from Mahout attend. >>> >>> -Grant >>> >>> >>> >> >> -- >> The University of Edinburgh is a charitable body, registered in Scotland, >> with registration number SC005336. >> > > > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
