[ https://issues.apache.org/jira/browse/LUCENE-6880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Areek Zillur updated LUCENE-6880: --------------------------------- Attachment: LUCENE-6880.patch Initial patch: - adds a document-oriented collector: TopSuggestionDocumentsCollector w/ tests - adds TopSuggestsCollector collector to collect docs if docId deduplication is acceptable > Add document oriented collector for NRTSuggester > ------------------------------------------------ > > Key: LUCENE-6880 > URL: https://issues.apache.org/jira/browse/LUCENE-6880 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Areek Zillur > Assignee: Areek Zillur > Fix For: Trunk, 5.4 > > Attachments: LUCENE-6880.patch > > > Currently NRTSuggester collects completions iteratively as they are accepted > by the TopNSearcher, implying that a document can be collected more than > once. In case of indexing a completion with multiple context values, the > completion leads to {{num_context}} paths in the underlying FST for the same > docId and gets collected {{num_context}} times, when a query matches all its > contexts. > Ideally, a document-oriented collector will collect top N documents instead > of top N completions by handling the docId deduplication while collecting the > completions. This could be used to collect n unique documents that matched a > completion query. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org