On Apr 15, 2008, at 4:07 PM, robert engels wrote:

we use this for name searching, so that you might have

"engels r*"

the problem being that the phrase "engels r" is quite unique, but the term r* is not unique at all.

Anything improve the handling of this in the current lucene?


You could tokenize differently and index pairs together. Think searching for "engles_r*".

If that's not an option, make sure that those two are being joined via AND -- I think that will trigger the skipTo optimization when the posting lists for r* are scanned.

HTH,

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to