[
https://issues.apache.org/jira/browse/SOLR-553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12596793#action_12596793
]
Bojan Smid commented on SOLR-553:
---------------------------------
I made a fix, patch is uploaded. LUCENE-794 is now incorporated into default
Solr highlighter.
Old way of highlighting is still retained and will be used in case requests to
Solr Highlighter remain the same as they were (same request parameters). New
functionality is invoked by adding another request parameter to URL,
hl.usePhraseHighlighter=true.
So, for URL:
http://localhost:8983/solr/select?q=features:%22ax%20bx%20cx%22&hl=on&hl.fl=features&hl.fragsize=20&hl.snippets=10
results will be the same as they were, but in case you want to use this fix
(and have correct phrase highlighting), the URL would look like this:
http://localhost:8983/solr/select?q=features:%22ax%20bx%20cx%22&hl=on&hl.fl=features&hl.fragsize=20&hl.snippets=10&hl.usePhraseHighlighter=true
This patch needs latest lucene-highlighter-*.jar and lucene-memory-*.jar from
trunk (since LUCENE-794 fix is committed there).
> Highlighter does not match phrase queries correctly
> ---------------------------------------------------
>
> Key: SOLR-553
> URL: https://issues.apache.org/jira/browse/SOLR-553
> Project: Solr
> Issue Type: New Feature
> Components: highlighter
> Affects Versions: 1.2
> Environment: all
> Reporter: Brian Whitman
> Attachments: highlighttest.xml
>
>
> http://www.nabble.com/highlighting-pt2%3A-returning-tokens-out-of-order-from-PhraseQuery-to16156718.html
> Say we search for the band "I Love You But I've Chosen Darkness"
> .../selectrows=100&q=%22I%20Love%20You%20But%20I\'ve%20Chosen%20Darkness%22&fq=type:html&hl=true&hl.fl=content&hl.fragsize=500&hl.snippets=5&hl.simple.pre=%3Cspan%3E&hl.simple.post=%3C/span%3E
> The highlight returns a snippet that does have the name altogether:
> Lights (Live) : <span>I</span> <span>Love</span> <span>You</span> But
> <span>I've</span> <span>Chosen</span> <span>Darkness</span> :
> But also returns unrelated snips from the same page:
> Black Francis Shop "<span>I</span> Think <span>I</span> <span>Love</span>
> <span>You</span>"
> A correct highlighter should not return snippets that do not match the phrase
> exactly.
> LUCENE-794 (not yet committed, but seems to be ready) fixes up the problem
> from the Lucene end. Solr should get it too.
> Related: SOLR-575
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.