I'm also aware of a number of papers which at least used the code to crank out some results for other research: http://scholar.google.com/scholar?hl=en&q=mahout+'machine+learning'
On Sat, Oct 2, 2010 at 4:12 AM, Lance Norskog <[email protected]> wrote: > One of the northern European govt. studios (I think Finland) published a > general paper. They were doing text mining/research on subtitles. > > Subtitles offer a more natural chopped-up form of language than formal > grammatical writing. That could be a fun dataset. I don't know of any legal > way to collect them. > >
