In the FAQ it reads

 score_d = sum_t(tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t * boost_t)
* coord_q_d


1. I think the new document boost is missing, isn't it?
With that it should be something like

 score_d = sum_t(tf_q * idf_t / norm_q * tf_d * idf_t / norm_d_t * boost_t)
* coord_q_d * boost_d
Is that correct?


2. If I like the score to be independent of the number of terms in the
document (regarding them as essentially constant), is it enough to leave out
the norm_d_t factor?
I have seen that a norm factor between 0 and 255 is read with
IndexReader.norms() in TermScorer.score(). Is that the one?

>From what I further understand (and from digging in Witten/Moffat/Bell) the
norm_q factor is not calculated, since it stays the same for one query.

Just make some checkmarks, please :-)


Clemens






--------------------------------------
http://www.cmarschner.net





--------------------------------------
http://www.cmarschner.net

Reply via email to