Re: Highlighter that works with phrase and span queries

Mark Miller Wed, 27 Jun 2007 05:00:15 -0700


markharw00d wrote:

I was thinking along the lines of wrapping some core classes such asIndexReader to somehow observe the query matching process and deducefrom that what to highlight (avoiding the need for MemoryIndex) butI'm not sure that is viable. It would be nice to get some more matchinfo out of the main query logic as it runs to aid highlighting ratherthan reverse engineering the basis of a match after the event.

I have been thinking about a way to pursue this, and it does not seemclear that there is a nice solution. Even if you could wrap Querys orother classes to observe matched tokens (non trivial since a Query isonly concerned if it matches a doc, not which tokens it matches at whichpositions), you would still have the major problem of which matches doyou keep information for. It does not seem practical to save all of theinformation to highlight *any* doc after a search and it also seemsunlikely that you would know which docs you wanted to highlight beforethe search. The only compromise that I can see is maybe just storinginfo to highlight the first n docs, but even here, while the scoring isoccurring you do not yet know the return order. Also, there is probablylittle value in knowing which Tokens were matches for highlightingunless you have stored offsets as well.

Unless someone has any suggestions on how to accomplish this, I thinktime would be better spent improving the existing Highlighter framework.

Perhaps Ronnie's Highlighter should be added as an alternate Highlighterthat is less feature rich but much faster on large docs. It looks to melike there is unlikely to be a faster Highlighting method for simplenon-position aware highlighting.


- Mark

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Highlighter that works with phrase and span queries

Reply via email to