Yonik Seeley wrote:
+(+text:jame +text:sutherland) +searchSet:testSet
+(+text:james~0.75 +text:sutherland~0.75) +searchSet:testSet

I can tell from the first that this is a stemmed field... "james" is
transformed to "jame"
"James" being the plural of "Jame" according to the stemmer. I guess my mind hadn't run in that direction. :)

I guess I wasn't expecting the fuzzy query logic to bypass the stemming. Would it be correct that if I were to add "james" to the protwords.txt file that this *specific* problem would go away? Obviously there are a significant quantity of proper names where this would have an impact, so a more generic solution is preferable.

So, you could
- index the field twice using copyField, and then do fuzzy queries on
the non-stemmed version. [plus two other good suggestions]
As I look at the field types in the example schema would you recommend something like text_lu without the EnglishPorterFilterFactory, or are there other issues I'm overlooking.

Walter Lewis
(aka Walt Lewi apparently)

Reply via email to