Otis Gospodnetic wrote:
OG: But Andrzej, you already wrote that indexing benchmark tool (which we never put anywhere in SVN, I'm afraid) that works on some freely available Reuters corpus, I believe. Why couldn't that be adapted for testing Lucene, Egothor, and MG4J?
Hmm, yes, indeed I have ... It was so long ago I nearly forgot about it. :) I need to dust it off and see if it's of any use. It used the 20newsgroups corpus (~19,000 items). It could use the Reuters corpus, just the parser would have to be implemented.
-- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]