Solr already support de-duplication when adding new documents. You can refer to the doc at https://cwiki.apache.org/confluence/display/solr/De-Duplication
On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko < vjeran.marci...@email.t-com.hr> wrote: > Hello, > > I'm pretty much Lucene newb, so wondering for some short guidelines on > how to implement some duplicate document filtering based on some field > which defines uniqueness, and first document stays, other duplicates are > filtered out? > > I know some 3rd party contrib lib existed before which was for that, but > it has been abandoned/deprecated for these newer versions of Lucene. > > Regards, > Vjeran > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >