On Feb 22, 2008, at 7:10 AM, Christian Kohlschütter (JIRA) wrote:
[ https://issues.apache.org/jira/browse/LUCENE-954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12571367
#action_12571367 ]
Christian Kohlschütter commented on LUCENE-954:
-----------------------------------------------
Grant,
sorry I was perhaps not too clear about it.
The distribution of scores of one Hits instance is currently not
comparable to another distribution of scores of another Hits object,
even if the underlying statistics are comparable/compatible/
identical. This is due to the case that the values are always
normalized to a maximum of 1.0.
As I said, my Federated Search system provides homogeneous
statistics (but not via MultiSearcher). In fact, it does not use
MultiSearcher for this, but a variant of the SRU/SRW/XCQL protocols
("SRX/FS"), where all communication is done via HTTP and XML. This
includes the exchange of Term/DF statistics. At the end, the system
makes several distributed Indexes appear as a single (read:
federated) index. In order to merge the results from each index,
Hits is used.
Gotcha. It wasn't clear that you were exchanging the searchers were
aware of global statistics. I think the use of "federated" is what
through me off. I have understood your description to be better
called "distributed search", whereas federated search is more along
the lines of disparate indexes that don't share statistics. I know,
"you say tomato,..."
In the distributed case, yes, it does make sense to be able to compare
the scores.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]