Hi all,

I have some documents that look like: 
{
    "name": "",
    "image_url: "",
    "simHash": ""
}

And there are many duplicate documents with the same `simHash` value, any 
idea to remove the duplicates instead of removing them in indexing time?

I have seen solutions 
here: 
http://stackoverflow.com/questions/24080846/removing-duplicates-from-search-results
 

However, I would like to know if it is possible that I can write code in 
Elasticsearch to do this job.

many thanks!

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/5fb60e5b-8a88-4af3-b0c1-0c064b280c7e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to