I'm trying to write a LeafCollector that filters out duplicates for a
specific field. However, looking at the JavaDoc for `collect` it says not
to call `IndexSearch.doc` or `IndexReader.document`. How am I supposed to
determine the value of a field and then exclude it?
Lucene used to have a DuplicateFilter to do this, but we removed it
recently ... see https://issues.apache.org/jira/browse/LUCENE-6633 for
some discussion as to why.
Mike McCandless
http://blog.mikemccandless.com
On Thu, Dec 1, 2016 at 2:39 PM, Matt Hicks wrote:
> I'm trying to write a LeafCol