Mail archives are likely useful for a mail based corpus. I agree with Andrzej about the rest of the docs, though.

On May 18, 2009, at 5:25 AM, André Warnier wrote:

Hi.
There has been an erlier suggestion here, later endorsed by someone else, to use the documentation of the Apache projects as a corpus. Being far from an expert, I am just naively wondering why the experts on this list seem to totally ignore it, without providing any argument. Is it somehow unsuitable, unpractical, inappropriate, bad, unfeasible, useless, uninteresting or ... ?


--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using Solr/Lucene:
http://www.lucidimagination.com/search

Reply via email to