[ 
https://issues.apache.org/jira/browse/LUCENE-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Areek Zillur updated LUCENE-6880:
---------------------------------
    Attachment: LUCENE-6880.patch

Initial patch:
 - adds a document-oriented collector: TopSuggestionDocumentsCollector w/ tests
 - adds TopSuggestsCollector collector to collect docs if docId deduplication 
is acceptable


> Add document oriented collector for NRTSuggester
> ------------------------------------------------
>
>                 Key: LUCENE-6880
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6880
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Areek Zillur
>            Assignee: Areek Zillur
>             Fix For: Trunk, 5.4
>
>         Attachments: LUCENE-6880.patch
>
>
> Currently NRTSuggester collects completions iteratively as they are accepted 
> by the TopNSearcher, implying that a document can be collected more than 
> once. In case of indexing a completion with multiple context values, the 
> completion leads to {{num_context}} paths in the underlying FST for the same 
> docId and gets collected {{num_context}} times, when a query matches all its 
> contexts. 
> Ideally, a document-oriented collector will collect top N documents instead 
> of top N completions by handling the docId deduplication while collecting the 
> completions. This could be used to collect n unique documents that matched a 
> completion query. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to