Hi all,
I have some documents that look like:
{
"name": "",
"image_url: "",
"simHash": ""
}
And there are many duplicate documents with the same `simHash` value, any
idea to remove the duplicates instead of removing them in indexing time?
I have seen solutions
here:
http://stackoverflow.com/questions/24080846/removing-duplicates-from-search-results
However, I would like to know if it is possible that I can write code in
Elasticsearch to do this job.
many thanks!
--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/5fb60e5b-8a88-4af3-b0c1-0c064b280c7e%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.