Otis Gospodnetic wrote:
OG: But Andrzej, you already wrote that indexing benchmark tool (which we never 
put anywhere in SVN, I'm afraid) that works on some freely available Reuters 
corpus, I believe.  Why couldn't that be adapted for testing Lucene, Egothor, 
and MG4J?

Hmm, yes, indeed I have ... It was so long ago I nearly forgot about it. :) I need to dust it off and see if it's of any use. It used the 20newsgroups corpus (~19,000 items). It could use the Reuters corpus, just the parser would have to be implemented.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to