Have you tried using result grouping for your query? There are some very good 
examples in the wiki:

https://wiki.apache.org/solr/FieldCollapsing


Gonzalo

-----Original Message-----
From: Peter Kirk [mailto:p...@alpha-solutions.dk] 
Sent: Tuesday, December 02, 2014 9:58 AM
To: solr-user@lucene.apache.org
Subject: Find duplicates

Hi

Is it possible to formulate a Solr query which finds all documents which have 
the same value in a particular field?
Note, I don't know what the value is, I just want to find all documents with 
duplicate values.

For example, I have 5 documents:

Doc1: field Name = Peter
Doc2: field Name = Jack
Doc3: field Name = Peter
Doc4: field Name = Paul
Doc5: field Name = Jack


If I executed the query, it would find documents Doc1 and Doc3 (Peter is the 
same), and Doc2 and Doc5 (Jack is the same).



Thanks,
Peter

Reply via email to