Hello, I am not sure that would work. I'd first index you document, and then use mlt with this document id and include set to true (added in latest ES release). Then you'll know how "far" your documents are from the queried document. Also, make sure to pick up most of the terms, by setting percent_terms_to_match=0, max_query_terms=high value and min_doc_freq=1. In order to know what terms from the queried document have matched in the response, you can use explain.
Alex On Thursday, May 29, 2014 10:42:47 AM UTC+2, Rgs wrote: > > hi, > > What i did now is, i have created a custom similarity & similarity > provider > class which extends DefaultSimilarity and AbstractSimilarityProvider > classes > respectively and overridden the idf() method to return 1. > > Now I'm getting some percentage values like 1, 0.987, 0.876 etc and > interpret it as 100%, 98%, 87% etc. > > Can you please confirm whether this approach can be taken for finding the > percentage of similarity? > > sorry for the late reply. > > Thanks > Rgs > > > > -- > View this message in context: > http://elasticsearch-users.115913.n3.nabble.com/Need-help-on-similarity-ranking-approach-tp4054847p4056680.html > > Sent from the ElasticSearch Users mailing list archive at Nabble.com. > -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/184a015f-fe68-4a24-999b-367d60d23798%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
