Have you looked at the MoreLikeThis class in the similarity package?
On 8/30/06, Winton Davies <[EMAIL PROTECTED]> wrote:
Hi All, I'm scratching my head - can someone tell me which class implements an efficient multiple term TF.IDF Cosine similarity scoring mechanism? There is clearly the single TermScorer - but I can't find the class that would do a bucketed TF.IDF cosine - i.e. fill an accumulator with the tf.idf^2 for each of the term posting lists, until accumulator is full, and then compute the final score. I don't need a Boolean Query - at least this seems like overkill. Cheers, Winton --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]