all fields are the same will be the duplicated documents. i wanted to remove those, first.
On Sun, Oct 22, 2023 at 8:27 AM ufuk yılmaz <uyil...@vivaldi.net.invalid> wrote: > When do you consider two documents are duplicates? When 1 field has the > same value, when multiple fields have the same value, or all fields etc? > > Sent from Mail for Windows > > From: Vince McMahon > Sent: Sunday, October 22, 2023 3:22 PM > To: users@solr.apache.org > Subject: what is SOLR syntax to remove duplicated documents > > I have a SOLR 8.X. I suspect one of the core has duplicates and wants to > remove the duplicated documents. Signature, as in the SOLR guide, is not > implemented. https://solr.apache.org/guide/6_6/de-duplication.html > > in sql, a query without the use of a hash column will be liked: > ;WITH CTE AS > ( > SELECT cols, > RN = ROW_NUMBER() OVER( PARTITION BY cols > ORDER BY updated DESC) > FROM [table] > ) > DELETE FROM CTE > WHERE RN > 1 > > what would be the syntax for SOLR query? > >