Robert Muir created LUCENE-4798:
-----------------------------------
Summary: PostingsHighlighter's formatter sometimes doesnt
highlight matched terms
Key: LUCENE-4798
URL: https://issues.apache.org/jira/browse/LUCENE-4798
Project: Lucene - Core
Issue Type: Bug
Components: modules/highlighter
Reporter: Robert Muir
This can happen if you have a sentence where the query terms match many times
in the same sentence:
for example if you query on "testing highlighter" but you have
"Testing highlighters is sometimes harder than testing other things."
The issue is that the formatter receives all 3 matches, but in this order:
Testing (first occurrence)
testing (second occurrence)
highlighters
The formatter expects the matches to be in sorted order by offset (not by term,
then offset). This is how the javadocs say they should be.
But there is currently a bug, a stupid side effect of how the ranking is done.
Because of this, in this example "highlighters" isnt marked up in bold.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]