[
https://issues.apache.org/jira/browse/SOLR-556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12603782#action_12603782
]
Lars Kotthoff commented on SOLR-556:
------------------------------------
In the setup I've been testing it with (one large single-valued "text" field
and several multi-valued fields) it didn't seem to have any serious performance
implications -- i.e. the randomness of my test queries was enough to mask any
loss of performance ;)
The length of the multi-valued fields is in the order of 10-20 characters on
average though and there're not many multiple different values.
I personally think that returning correct data is more important than
performance in this case, but that may just be because my particular setup
doesn't suffer any significant loss of performance. I didn't see any other way
to correct the behaviour of the current trunk code, but if anybody else has a
better idea how to do it, please let us know!
> Highlighting of multi-valued fields returns snippets which span multiple
> different values
> -----------------------------------------------------------------------------------------
>
> Key: SOLR-556
> URL: https://issues.apache.org/jira/browse/SOLR-556
> Project: Solr
> Issue Type: Bug
> Components: highlighter
> Affects Versions: 1.3
> Environment: Tomcat 5.5
> Reporter: Lars Kotthoff
> Assignee: Mike Klaas
> Priority: Minor
> Fix For: 1.3
>
> Attachments: SOLR-556-highlight-multivalued.patch,
> solr-highlight-multivalued-example.xml
>
>
> When highlighting multi-valued fields, the highlighter sometimes returns
> snippets which span multiple values, e.g. with values "foo" and "bar" and
> search term "ba" the highlighter will create the snippet "foo<em>ba</em>r".
> Furthermore it sometimes returns smaller snippets than it should, e.g. with
> value "foobar" and search term "oo" it will create the snippet "<em>oo</em>"
> regardless of hl.fragsize.
> I have been unable to determine the real cause for this, or indeed what
> actually goes on at all. To reproduce the problem, I've used the following
> steps:
> * create an index with multi-valued fields, one document should have at least
> 3 values for these fields (in my case strings of length between 5 and 15
> Japanese characters -- as far as I can tell plain old ASCII should produce
> the same effect though)
> * search for part of a value in such a field with highlighting enabled, the
> additional parameters I use are hl.fragsize=70, hl.requireFieldMatch=true,
> hl.mergeContiguous=true (changing the parameters does not seem to have any
> effect on the result though)
> * highlighted snippets should show effects described above
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.