I'm not sure you can use the score to determine % similarity. You certainly can for each new incoming document, run a more like this query against your index (and specify a bunch of parameters like percent_terms_to_match) to perhaps achieve something closer to what you want?
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/query-dsl-mlt-query.html#query-dsl-mlt-query -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/97e2f5bf-1c95-4775-a894-74650cccde12%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
