Re: Best fuzzy match on multiple terms

2019-06-13 Thread Namgyu Kim
Dear Matthias, First you need to know about the Lucene's ranking concept. Lucene's basic ranking is BM25 and it depends on your index status. (https://en.wikipedia.org/wiki/Okapi_BM25) There can be many reasons. One of thing that I can guess is your index has a lot of 'rozi' term so it is getting

Re: Best fuzzy match on multiple terms

2019-06-13 Thread baris . kazar
i would suggest trying (indexing and searching) without === ' === s and see You can find it first. Thanks On 6/13/19 11:25 AM, Matthias Müller wrote: I am currently matching botanic names (with possible mis-spellings) against an indexed referenced list with Lucene. After quick progress in

Best fuzzy match on multiple terms

2019-06-13 Thread Matthias Müller
I am currently matching botanic names (with possible mis-spellings) against an indexed referenced list with Lucene. After quick progress in the beginning, I am struggeling with the proper query design to achieve a ranking result I want. Here is an example: Search term: Acer campestre 'Rozi'

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
i see, i am using an older version 6.6 and we should switch to Your 8.1 version of at least 7.X. Tomoko i think i understood You meant MAIN NASHUA for the string :) Again i really appreciate all answers. How do we disable or enable stemming while indexing? :) another question. Best

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread Tomoko Uchida
Sorry, I made a mistake when copypasting. Let me just correct my previous mail. > 1. Indexed this text: "NASHUA NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED > STATES". 1. Indexed this text: "MAIN DUNSTABLE NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES" As far as I can say, this query

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
However, the index does not have MAINS but MAIN for the expected entry. Best regards On 6/13/19 10:33 AM, baris.ka...@oracle.com wrote: does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming.

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming. Whenever I see a problem with something ending in “s” (or “er” or “ing” or….) my first suspect is that stemming is turned on. In that

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread Erick Erickson
Shot in the dark: stemming. Whenever I see a problem with something ending in “s” (or “er” or “ing” or….) my first suspect is that stemming is turned on. In that case the token in the index that’s actually searched on is somewhat different than you expect. The test is easy, just insure your

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Tomoko,-  That is strange indeed. Something is wrong when i use mains but maink, mainl, mainr,mainq, maint all work ok any consonant at the end except s works in this case. Case #3 had +contentDFLT:mains~2 but not +contentDFLT:"mains~2". i am using fuzzy query with ~ from Query.builder and