Hi, 

I'm new to elasticsearch and i wanted to do similarity ranking using it. 

Requirement. 

need to index documents having two fields (field1 and field2) which are of
free text. whenever a new document comes and indexed, needs find out how
similar is with the existing documents based on a filed (say field1) and
these similarities should be captured. And if the similarity reaches some
X%, some action should be done. these steps should be done for all documents
which are getting indexed. 

My approach 

1. whenever a new document comes, it should be indexed first. 
2. once its indexed, will start matching the document against existing
documents using the field1. 
3. on the search results, will check the score field for the similarity
percentage and will be captured. 
4. find scores which is x%, then do the required action. 


Could you please tell whether the approach taken is fine? or have any better
way to perform the similarity ranking in such cases? 

Thanks



--
View this message in context: 
http://elasticsearch-users.115913.n3.nabble.com/Need-help-on-similarity-ranking-approach-tp4054847.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/1398520399939-4054847.post%40n3.nabble.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to