[
https://issues.apache.org/jira/browse/LUCENE-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael McCandless updated LUCENE-7686:
---------------------------------------
Attachment: LUCENE-7686.patch
Another iteration ... it was trickier than I first realized because I have to
handle the case where another (later) segment has the same suggestion with a
better score, and replace the previous one in the priority queue.
So I broke out the dedup handling to a separate collector,
{{DeduplicatingTopSuggestDocsCollector}}.
And I added a new randomized test case in addition to the dedicated specific
test case.
I think it's ready.
> NRT suggester should have option to filter out duplicates
> ---------------------------------------------------------
>
> Key: LUCENE-7686
> URL: https://issues.apache.org/jira/browse/LUCENE-7686
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Michael McCandless
> Assignee: Michael McCandless
> Fix For: master (7.0), 6.5
>
> Attachments: LUCENE-7686.patch, LUCENE-7686.patch
>
>
> Some of the other suggesters have this ability, and it's quite simple to add
> it to the NRT suggester as long as the thing we are filtering on is the
> suggest key itself, not e.g. another stored field from the document.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]