On Oct 2, 2010, at 3:34 AM, Sean Owen wrote: > I'm also aware of a number of papers which at least used the code to crank > out some results for other research: > http://scholar.google.com/scholar?hl=en&q=mahout+'machine+learning'
Very cool. Didn't think to look there. > > On Sat, Oct 2, 2010 at 4:12 AM, Lance Norskog <[email protected]> wrote: > >> One of the northern European govt. studios (I think Finland) published a >> general paper. They were doing text mining/research on subtitles. >> >> Subtitles offer a more natural chopped-up form of language than formal >> grammatical writing. That could be a fun dataset. I don't know of any legal >> way to collect them. >> >> -------------------------- Grant Ingersoll http://lucenerevolution.org Apache Lucene/Solr Conference, Boston Oct 7-8
