Mail archives are likely useful for a mail based corpus. I agree with
Andrzej about the rest of the docs, though.
On May 18, 2009, at 5:25 AM, André Warnier wrote:
Hi.
There has been an erlier suggestion here, later endorsed by someone
else, to use the documentation of the Apache projects as a corpus.
Being far from an expert, I am just naively wondering why the
experts on this list seem to totally ignore it, without providing
any argument.
Is it somehow unsuitable, unpractical, inappropriate, bad,
unfeasible, useless, uninteresting or ... ?
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search