Please correct me if I am wrong, but I think the prefix query finds
matching terms first and puts them in an array???
So if you have a numeric field, searching for 1* could give a near
infinite number - causing an OOM.
Causes me to think though... in most cases the number of unique names
that start with r should be relatively small - not enough to cause an
OOM - so I need to investigate further what is happening.
On Apr 15, 2008, at 9:05 PM, Marvin Humphrey wrote:
On Apr 15, 2008, at 4:07 PM, robert engels wrote:
we use this for name searching, so that you might have
"engels r*"
the problem being that the phrase "engels r" is quite unique, but
the term r* is not unique at all.
Anything improve the handling of this in the current lucene?
You could tokenize differently and index pairs together. Think
searching for "engles_r*".
If that's not an option, make sure that those two are being joined
via AND -- I think that will trigger the skipTo optimization when
the posting lists for r* are scanned.
HTH,
Marvin Humphrey
Rectangular Research
http://www.rectangular.com/
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]