Re: Removing duplicate documents from search results

pravesh Thu, 23 Jun 2011 03:58:28 -0700

Would you care to even index the duplicate documents? Finding duplicacy in
content fields would be not so easy as in some untokenized/keyword field.
May be you could do this filtering at indexing time before sending the
document to SOLR. Then the question comes, which one document should go(from
a group of duplicates)?? The latest one?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/Removing-duplicate-documents-from-search-results-tp3099214p3099432.html
Sent from the Solr - User mailing list archive at Nabble.com.

Re: Removing duplicate documents from search results

Reply via email to