Hi, if you have documents that might exist in multiple collections, then you can use techniques from meta search. That is combining multiple search results from different collections. In this case, you can retrieve the top 100 or 1000 documents from each collection and merge them. You then rank documents by using some aggregation methods. It is known that using the sum of relevance scores produces good results.
If there are no shared documents between collections, you still can use the same approach but using different aggregation methods. One method is round robin. You start by selecting the first ranked document from each collection. Then, taking the second ranked document and so on. If that does not fit your needs, probably you should search for "federated or aggregated search techniques". These techniques are used by giant search engines to combine results from their search engine parts (images,video and web). You can find a lot of academic resources in these aspects. Regards Ameer -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-properly-correlate-relevance-in-a-search-across-multiple-collections-tp4157240p4157321.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org