[ http://issues.apache.org/jira/browse/LUCENE-627?page=comments#action_12438553 ] Yonik Seeley commented on LUCENE-627: -------------------------------------
I agree with Kerang about the expected behavior (of this specific case at least). The test case of mine quoted above was not what I was shooting for, but was an acceptable unintended side-effect of fixing the other cases. So I'm fine with this case being changed to query = new QueryParser("text",new WhitespaceAnalyzer()).parse("hi speed"); highlighter = new Highlighter(new QueryScorer(query)); result = highlighter.getBestFragments(getTS2(), s, 3, "..."); assertEquals("<B>Hi</B>-<B>Speed</B>10 foo",result); > highlighter problems with overlapping tokens > -------------------------------------------- > > Key: LUCENE-627 > URL: http://issues.apache.org/jira/browse/LUCENE-627 > Project: Lucene - Java > Issue Type: Bug > Components: Other > Affects Versions: 2.0.1 > Reporter: Yonik Seeley > Fix For: 2.0.1 > > Attachments: highlight_overlap.diff, Highlighter.java.diff > > > The lucene highlighter has problems when tokens that overlap are generated. > For example, if analysis of iPod generates the tokens "i", "pod", "ipod" > (with pod and ipod in the same position), > then the highlighter will output this as iipod, regardless of if any of those > tokens are highlighted. > Discovered via http://issues.apache.org/jira/browse/SOLR-24 -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]