David Smiley created LUCENE-7578:
------------------------------------
Summary: UnifiedHighlighter: Convert PhraseHelper to use
SpanCollector API
Key: LUCENE-7578
URL: https://issues.apache.org/jira/browse/LUCENE-7578
Project: Lucene - Core
Issue Type: Improvement
Components: modules/highlighter
Reporter: David Smiley
The PhraseHelper of the UnifiedHighlighter currently collects position-spans
per SpanQuery (and it knows which terms are in which SpanQuery), and then it
filters PostingsEnum based on that. It's similar to how the original
Highlighter WSTE works. The main problem with this approach is that it can be
inaccurate for some nested span queries -- LUCENE-2287, LUCENE-5455 (has the
clearest example), LUCENE-6796. Non-nested SpanQueries (e.g. that which is
converted from a PhraseQuery or MultiPhraseQuery) are _not_ a problem.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]