Solr already support de-duplication when adding new documents. You can
refer to the doc at
https://cwiki.apache.org/confluence/display/solr/De-Duplication

On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko <
[email protected]> wrote:

> Hello,
>
> I'm pretty much Lucene newb, so wondering for some short guidelines on
> how to implement some duplicate document filtering based on some field
> which defines uniqueness, and first document stays, other duplicates are
> filtered out?
>
> I know some 3rd party contrib lib existed before which was for that, but
> it has been abandoned/deprecated for these newer versions of Lucene.
>
> Regards,
> Vjeran
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to