Highlighting of multi-valued fields returns snippets which span multiple
different values
-----------------------------------------------------------------------------------------
Key: SOLR-556
URL: https://issues.apache.org/jira/browse/SOLR-556
Project: Solr
Issue Type: Bug
Components: highlighter
Affects Versions: 1.3
Environment: Tomcat 5.5
Reporter: Lars Kotthoff
Priority: Minor
When highlighting multi-valued fields, the highlighter sometimes returns
snippets which span multiple values, e.g. with values "foo" and "bar" and
search term "ba" the highlighter will create the snippet "foo<em>ba</em>r".
Furthermore it sometimes returns smaller snippets than it should, e.g. with
value "foobar" and search term "oo" it will create the snippet "<em>oo</em>"
regardless of hl.fragsize.
I have been unable to determine the real cause for this, or indeed what
actually goes on at all. To reproduce the problem, I've used the following
steps:
* create an index with multi-valued fields, one document should have at least 3
values for these fields (in my case strings of length between 5 and 15 Japanese
characters -- as far as I can tell plain old ASCII should produce the same
effect though)
* search for part of a value in such a field with highlighting enabled, the
additional parameters I use are hl.fragsize=70, hl.requireFieldMatch=true,
hl.mergeContiguous=true (changing the parameters does not seem to have any
effect on the result though)
* highlighted snippets should show effects described above
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.