On Oct 2, 2010, at 3:34 AM, Sean Owen wrote:

> I'm also aware of a number of papers which at least used the code to crank
> out some results for other research:
> http://scholar.google.com/scholar?hl=en&q=mahout+'machine+learning'

Very cool.  Didn't think to look there.  

> 
> On Sat, Oct 2, 2010 at 4:12 AM, Lance Norskog <[email protected]> wrote:
> 
>> One of the northern European govt. studios (I think Finland) published a
>> general paper. They were doing text mining/research on subtitles.
>> 
>> Subtitles offer a more natural chopped-up form of language than formal
>> grammatical writing. That could be a fun dataset. I don't know of any legal
>> way to collect them.
>> 
>> 

--------------------------
Grant Ingersoll
http://lucenerevolution.org Apache Lucene/Solr Conference, Boston Oct 7-8

Reply via email to