Hi, I'm new to elasticsearch and i wanted to do similarity ranking using it.
Requirement. need to index documents having two fields (field1 and field2) which are of free text. whenever a new document comes and indexed, needs find out how similar is with the existing documents based on a filed (say field1) and these similarities should be captured. And if the similarity reaches some X%, some action should be done. these steps should be done for all documents which are getting indexed. My approach 1. whenever a new document comes, it should be indexed first. 2. once its indexed, will start matching the document against existing documents using the field1. 3. on the search results, will check the score field for the similarity percentage and will be captured. 4. find scores which is x%, then do the required action. Could you please tell whether the approach taken is fine? or have any better way to perform the similarity ranking in such cases? Thanks -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Need-help-on-similarity-ranking-approach-tp4054847.html Sent from the ElasticSearch Users mailing list archive at Nabble.com. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/1398520399939-4054847.post%40n3.nabble.com. For more options, visit https://groups.google.com/d/optout.
