[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12607437#action_12607437 ]
Mark Miller commented on LUCENE-794: ------------------------------------ FYI: This has been applied to the trunk as well. The SpanScorer needs a TokenStream because it shoves the stream into a MemoryIndex and transforms the query into a Span approximation to find hit positions. This approach was used for compatibility with the current API. The QueryScorer simply +scores each Token that is in the query as well as the TokenStream - so it just needs to extract the terms from the query and find overlap with the TokenStream passed to the getFragments method. This is not position sensitive. The SpanScorer works the same way, but it also fills the MemoryIndex and gets matching Spans so that Terms in the wrong position score a 0 during Highlighter Term scoring (again getBestFragments or whatever). The approach was mainly dictated by the old API. Fitting into the current API seemed the most practical/efficient way to get a position sensitive Highlighter in the short term. On Mon, Jun 23, 2008 at 7:51 PM, Tavi Nathanson (JIRA) <[EMAIL PROTECTED]> > Extend contrib Highlighter to properly support PhraseQuery, SpanQuery, > ConstantScoreRangeQuery > ----------------------------------------------------------------------------------------------- > > Key: LUCENE-794 > URL: https://issues.apache.org/jira/browse/LUCENE-794 > Project: Lucene - Java > Issue Type: Improvement > Components: Other > Reporter: Mark Miller > Priority: Minor > Fix For: 2.3.2 > > Attachments: MultiPhraseQueryExtraction.patch, > SpanHighlighter-01-26-2008.patch, SpanHighlighter-01-28-2008.patch, > SpanHighlighter-02-10-2008.patch, SpanHighlighter-RemovSysOut.patch, > spanhighlighter.patch, spanhighlighter10.patch, spanhighlighter11.patch, > spanhighlighter12.patch, spanhighlighter2.patch, spanhighlighter3.patch, > spanhighlighter5.patch, spanhighlighter6.patch, spanhighlighter7.patch, > spanhighlighter8.patch, spanhighlighter9.patch, > spanhighlighter_24_January_2008.patch, spanhighlighter_patch_4.zip > > > This patch adds a new Scorer class (SpanQueryScorer) to the Highlighter > package that scores just like QueryScorer, but scores a 0 for Terms that did > not cause the Query hit. This gives 'actual' hit highlighting for the range > of SpanQuerys, PhraseQuery, and ConstantScoreRangeQuery. New Query types are > easy to add. There is also a new Fragmenter that attempts to fragment without > breaking up Spans. > See http://issues.apache.org/jira/browse/LUCENE-403 for some background. > There is a dependency on MemoryIndex. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]