Thanx, but I'm not looking at de-deplication while adding documents, but
de-duplication while querying.
There is DuplicateFilter in contrib lib, but filters are not used
anymore in newer Lucene versions, so no luck there... :(
I assume I would maybe ned to implement my own Collector, but it s
Solr already support de-duplication when adding new documents. You can
refer to the doc at
https://cwiki.apache.org/confluence/display/solr/De-Duplication
On Tue, Sep 20, 2016 at 12:18 PM Vjeran Marcinko <
vjeran.marci...@email.t-com.hr> wrote:
> Hello,
>
> I'm pretty much Lucene newb, so wonderi
Hello,
I'm pretty much Lucene newb, so wondering for some short guidelines on
how to implement some duplicate document filtering based on some field
which defines uniqueness, and first document stays, other duplicates are
filtered out?
I know some 3rd party contrib lib existed before which w