LeafCollector

2016-12-01 Thread Matt Hicks
I'm trying to write a LeafCollector that filters out duplicates for a specific field. However, looking at the JavaDoc for `collect` it says not to call `IndexSearch.doc` or `IndexReader.document`. How am I supposed to determine the value of a field and then exclude it?

Re: LeafCollector

2016-12-01 Thread Michael McCandless
Lucene used to have a DuplicateFilter to do this, but we removed it recently ... see https://issues.apache.org/jira/browse/LUCENE-6633 for some discussion as to why. Mike McCandless http://blog.mikemccandless.com On Thu, Dec 1, 2016 at 2:39 PM, Matt Hicks wrote: > I'm trying to write a LeafCol