On Donnerstag 18 Mai 2006 18:36, Ken Krugler wrote:

 > >Could someone describe how the results from multiple indices are merged
 > when using a MultiSearcher? My naive intuition is that the scores for
 > documents found in each index could be wildly different, so what
 > criteria is used to merge the scored docs?

 I believe they are blindly merged.

 Which means that the IDFs for terms between multiple indices must be
 > relatively equal, otherwise the results will be skewed.

That was the case in Lucene <= 1.4, in 1.9 it should be fixed, i.e. merging
is supposed to work correctly.

Sorry, you're right. In Lucene 1.9 the MultiSearcher has been improved. In Nutch, which uses its own implementation, this is still a problem as per:

http://issues.apache.org/jira/browse/NUTCH-92

-- Ken
--
Ken Krugler
Krugle, Inc.
+1 530-210-6378
"Find Code, Find Answers"

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to