Implement a custom similarity

Damerian Sun, 19 Feb 2012 10:47:16 -0800

Hello,

I am really new to Lucene, last week through this list i was reallysuccessfull into finding a solution to my problem.I have a new question now, i am trying to implement a new similarityclass that uses the Jaccard coefficient, i have been reading thejavadocs and a lot of other webpages on the matter, but my problem isthat i still cannot understand how to do it.So far i know that i have to subclass the DefaultSimilarity and (if i amnot wrong) i have to edit all the build in methods to return the corectscore. Since Jaccard coefficiency is the conjuction of thequery/document sets divided by the union of the two sets i think i onlyneed the coord(q,d) and all the rest measures in the default similaritycan return 1 to the score computation. My problem is that i cannotlocate how to obtain the number of terms that each document has.

Also do you think this approach is correct?

I would be gratefull if you could give me advice or point towards atutorial on the matter cause two days of searching were fruitless infinding an example code.

Thank you in advance.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Implement a custom similarity

Reply via email to