[
https://issues.apache.org/jira/browse/LUCENE-7551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15655156#comment-15655156
]
David Smiley commented on LUCENE-7551:
--------------------------------------
Might you try another highlighter? I've spent a lot of time on the
UnifiedHighlighter recently and it certainly isn't susceptible to this problem.
Furthermore, you can use offsets in postings instead of full term vectors to
significantly reduce your index size while highlighting faster than FVH. The
UH was released in Lucene 6.3.0.
> FastVectorHighlighter ignores position in PhraseQuery
> -----------------------------------------------------
>
> Key: LUCENE-7551
> URL: https://issues.apache.org/jira/browse/LUCENE-7551
> Project: Lucene - Core
> Issue Type: Bug
> Components: core/search
> Reporter: Ferenczi Jim
> Priority: Minor
>
> The PhraseQuery contains a list of terms and the positions for each term. The
> FVH ignores the term position and assumes that a phrase query is always
> dense. As a result phrase query with gaps are not highlighted at all. This is
> problematic for text fields that use a FilteringTokenFilter. This token
> filter removes tokens but preserves the position increment of each removal.
> Bottom line is that using this token filter breaks the highlighting of phrase
> query that contains filtered tokens.
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]