Robert, We are using TREC-3 data and Ad Hoc topics 151-200. The relevance judgments list contains 97,319 entries, of which 68,559 are unique document ids. The TIPSTER collection which was used in TREC-3 is around 750,000 documents.
Should we (a) index the entire 750,000 document collection, or (b) the document collection of the 68,559 unique documents listed in the qrels, or (c) should we limit our index to each specific topic (about 2,000 docs) i.e. to the documents listed for a particular topic in the qrels? Thanks, Ivan --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org