Re: setSimilarity on Query

Shailesh Kochhar Mon, 12 Nov 2007 20:51:35 -0800

Chris Hostetter wrote:

independent of the QueryParser aspects of your question, adding asetSimilarity method to the Query class would be a complete 180 of how itcurrently works right now.
Query classes have to have a getSimilarity method so that theirWeight/Scorer have a way to access the similarity functions ... but everycore type of query gets that similarity from the searcher being used whenhte query is executed.
if the Query class defined a "setSimilarity" then the similarity used byone query in a BooleanQuery might not be the same as another query in thesame query structure ... queryNorms, idfs, tfs ... could all be completleynonsensical.

The getSimilarity() implementation in Query actually invokesSearcher.getSimilarity() which in turn returns the value ofSimilarity.getDefault()

IndexSearcher has a corresponding setSimilarity() method which willoverride the value return value which makes it convenient for whatyou're trying to accomplish.

There is, however, another point of discord -- which is the Weightassociated with the Query (which is relevant if you want a differentimplementation of term weighting). Here the locus of control is inverted-- it is the Searcher which delegates to the Query in order to createthe Weight. In order to change the scoring implementation one needs toimplement a new Query class, a new Weight class, a new Similarity classand a new QueryParser.

A friendlier alternative I'd like to propose is a sort of Weight andSimilarity factory which is provided either to the top level Queryobject that is returned from parsing -- or to the Searcher object thatprocesses the query. The factory can then return Similarity and Weightimplementations that are identical for all parts of the query and whichare mutually consistent.

This would allow field specific Similarity and Weight implementationsand would also be backwards compatible.

A more logical extension point is probably long the lines of pastdiscussion towards making all of the Similarity methods take in a fieldname (so you could have a "PerFieldSimilarityWrapper" type implementation)and/or changing Searchable.getSimilarity to take in a fieldname param.
i don't think anyone every submitted a patch for either of those ideasthough ... if you check the mailing list archives you'll see there wereperformance concerns about one of them (i think it was the first onebecause some of those methods are in tight loops, which is unfortunatebecause it's the one that can be done in a backwards compatible way)





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: setSimilarity on Query

Reply via email to