Re: FuzzyQuery- why is it ignored?

2019-06-25 Thread baris . kazar
accept my apologies for not confirming the sent address. i will try 8.1 tomorrow on my env. baris - Original Message - From: tomoko.uchida.1...@gmail.com To: java-user@lucene.apache.org Sent: Saturday, June 22, 2019 10:35:26 PM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery- why is it

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Baris Kazar
my env. baris - Original Message - From: tomoko.uchida.1...@gmail.com To: java-user@lucene.apache.org Sent: Saturday, June 22, 2019 10:35:26 PM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery- why is it ignored? Please send messages to java-user mail list only. It is not recommended

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Tomoko Uchida
gt; baris > > - Original Message - > From: tomoko.uchida.1...@gmail.com > To: baris.ka...@oracle.com > Sent: Saturday, June 22, 2019 9:14:22 PM GMT -05:00 US/Canada Eastern > Subject: Re: FuzzyQuery- why is it ignored? > > > If You could index these entries and stil

Re: FuzzyQuery- why is it ignored?

2019-06-22 Thread Baris Kazar
s a bug in Lucene 6.6. Best regards - Original Message - From: baris.ka...@oracle.com To: java-user@lucene.apache.org, tomoko.uchida.1...@gmail.com, erickerick...@gmail.com, a...@linux.com, baris.ka...@oracle.com, luc...@mikemccandless.com Sent: Thursday, June 13, 2019 10:49:05 AM GMT -05:00 US/C

Re: FuzzyQuery- why is it ignored?

2019-06-19 Thread baris . kazar
thanks for confirming. StandardAnalyzer does not do stemming. So, MAINS case has some other reason. Best regards - Original Message - From: tomoko.uchida.1...@gmail.com To: java-user@lucene.apache.org Sent: Sunday, June 16, 2019 4:39:29 AM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery

Re: FuzzyQuery- why is it ignored?

2019-06-16 Thread Baris Kazar
16, 2019 4:39:29 AM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery- why is it ignored? Hi, you said you are using standard analyzer. If so, you are not using any stemmer at all (please see the analyzer's Javadocs). 2019年6月16日(日) 11:43 Baris Kazar : > > Hello,- > Erick ex

Re: FuzzyQuery- why is it ignored?

2019-06-16 Thread Tomoko Uchida
ck...@gmail.com, a...@linux.com, baris.ka...@oracle.com, > luc...@mikemccandless.com > Sent: Thursday, June 13, 2019 10:48:47 AM GMT -05:00 US/Canada Eastern > Subject: Re: FuzzyQuery- why is it ignored? > > i see, i am using an older version 6.6 and we should switch to Your 8.1 &g

Re: FuzzyQuery- why is it ignored?

2019-06-15 Thread Baris Kazar
To: java-user@lucene.apache.org, tomoko.uchida.1...@gmail.com, erickerick...@gmail.com, a...@linux.com, baris.ka...@oracle.com, luc...@mikemccandless.com Sent: Thursday, June 13, 2019 10:48:47 AM GMT -05:00 US/Canada Eastern Subject: Re: FuzzyQuery- why is it ignored? i see, i am using an older

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
i see, i am using an older version 6.6 and we should switch to Your 8.1 version of at least 7.X. Tomoko i think i understood You meant MAIN NASHUA for the string :) Again i really appreciate all answers. How do we disable or enable stemming while indexing? :) another question. Best regar

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread Tomoko Uchida
Sorry, I made a mistake when copypasting. Let me just correct my previous mail. > 1. Indexed this text: "NASHUA NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED > STATES". 1. Indexed this text: "MAIN DUNSTABLE NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES" As far as I can say, this query correct

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
However, the index does not have MAINS but MAIN for the expected entry. Best regards On 6/13/19 10:33 AM, baris.ka...@oracle.com wrote: does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could Y

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
does it consider it as like plural word? :) :) :) That makes sense. Best regards On 6/13/19 10:31 AM, baris.ka...@oracle.com wrote: Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming. When

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Erick, Cool, could You give a simple example with my example please? Best regards On 6/13/19 10:12 AM, Erick Erickson wrote: Shot in the dark: stemming. Whenever I see a problem with something ending in “s” (or “er” or “ing” or….) my first suspect is that stemming is turned on. In that cas

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread Erick Erickson
Shot in the dark: stemming. Whenever I see a problem with something ending in “s” (or “er” or “ing” or….) my first suspect is that stemming is turned on. In that case the token in the index that’s actually searched on is somewhat different than you expect. The test is easy, just insure your fie

Re: FuzzyQuery- why is it ignored?

2019-06-13 Thread baris . kazar
Tomoko,-  That is strange indeed. Something is wrong when i use mains but maink, mainl, mainr,mainq, maint all work ok any consonant at the end except s works in this case. Case #3 had +contentDFLT:mains~2 but not +contentDFLT:"mains~2". i am using fuzzy query with ~ from Query.builder and t

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread Tomoko Uchida
Hi, > Ok, i think only this very specific only "mains" has an issue. It looks strange to me. I did some test locally. 1. Indexed this text: "NASHUA NASHUA HILLSBOROUGH NEW HAMPSHIRE UNITED STATES". 2a. This query string (just copied from your Case #3) worked correctly for me as far as I can see

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread baris . kazar
Ok, i think only this very specific only "mains" has an issue. all i knew about Lucene was fine :) Great... i have one more question: which one is advised to use: FuzzyQuery or the Query.parser with search string~ appended? The second one will go through analyzer and make search string lower

Re: FuzzyQuery- why is it ignored?

2019-06-12 Thread baris . kazar
Hi again,- this is really interesting and i hope i am missing something. Index small cases all entries so case sensitivity is not an issue i think. Case #1: org.apache.lucene.queryparser.classic.QueryParser parser = new org.apache.lucene.queryparser.classic.QueryParser(field, phraseAnalyzer)

Re: FuzzyQuery

2019-06-12 Thread baris . kazar
Tomoko,-  Thank You for Your suggestions. i am trying to understand it and i thought i did :) but it does not work with FuzzyQuery when i used with a *single* large TextField like street=...value... city=...value... region=...value... country=...value... (with or without quotes for the value

Re: FuzzyQuery

2019-06-11 Thread Tomoko Uchida
I'd suggest to correctly understand the way a software works before suspecting its bug :-) I guess you may miss two points: 1. the standard analyzer (standard tokenizer) breaks words by double quote (U+0022) so quotes are not indexed or searched at all if you are using standard analyzer. (That is

Re: FuzzyQuery

2019-06-11 Thread baris . kazar
i can say that quotes is not the issue with index as it still results in same results with quotes or without quotes. i am starting to feel that this might be a bug maybe?? Best regards On 6/10/19 2:46 PM, baris.ka...@oracle.com wrote: Somehow " is causing an issue as this should return stree

Re: FuzzyQuery

2019-06-10 Thread Atri Sharma
I would suggest using a QueryParser for your fuzzy query before adding it to the Boolean query. This should weed out any case issues. On Mon, 10 Jun 2019 at 8:06 PM, wrote: > BooleanQuery.Builder booleanQuery = new BooleanQuery.Builder(); > > //First set > > booleanQuery.add(new FuzzyQue

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
Somehow " is causing an issue as this should return street with MAIN: [contentDFLT:street="MAINS"~2, +contentDFLT:"city nashua", +contentDFLT:"region new-hampshire", +contentDFLT:"country united states"] Best regards On 6/10/19 2:24 PM, baris.ka...@oracle.com wrote: [+contentDFLT:"city nashu

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
[+contentDFLT:"city nashua", +contentDFLT:"region new-hampshire", +contentDFLT:"country united states", contentDFLT:street contentDFLT:mains] QueeryParser chops it into two pieces from parser.parser("street=\"MAINS\""); Index has a TextField named contentDFLT the following data : street="MAIN

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida
Or, " (double quotation) in your query string may affect query parsing. When I parse this string by classic query parser (lucene 8.1), street="MAINS~" parsed (raw) query is text:street text:mains (I set the default search field to "text", so text: is appeared here.) Query parsing is a complex

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida
Hi, I noticed one small thing in your previous mail. > when i use q1 = parser.parse("street=\"MAIN\""); i get same results which is good. To specify a search field, ":" (colon) should be used instead of "=". See the query parser documentation: http://lucene.apache.org/core/8_1_0/queryparser/org/

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
booleanQuery.add(Utils.createPhraseQuery(phraseAnalyzer, field, "city=\"NASHUA\""), BooleanClause.Occur.MUST);     booleanQuery.add(Utils.createPhraseQuery(phraseAnalyzer, field, "region=\"NEW HAMPSHIRE\""), BooleanClause.Occur.MUST);     booleanQuery.add(Utils.createPhraseQuery(phraseAna

Re: FuzzyQuery

2019-06-10 Thread Tomoko Uchida
Hi, just for the basic verification, can you find the document without fuzzy query? I mean, does this query work for you? Query query = parser.parse("MAIN"); Tomoko 2019年6月11日(火) 0:22 : > > why cant the second set not work at all? > > it is indexed as Textfield like street="..." city="..." etc.

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
why cant the second set not work at all? it is indexed as Textfield like street="..." city="..." etc. Best regards On 6/10/19 11:23 AM, baris.ka...@oracle.com wrote: i dont know how to use Fuzzyquery with queryparser but probably You are suggesting QueryParser parser = new QueryParser(fiel

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
i dont know how to use Fuzzyquery with queryparser but probably You are suggesting QueryParser parser = new QueryParser(field, analyzer) ; Query query = parser.parse("MAINS~2"); booleanQuery.add(query, BooleanClause.Occur.SHOULD); am i right? Best regards On 6/10/19 10:47 AM, Atri Sharma wro

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
i am using standardanalyzer. Best regards On 6/9/19 11:22 AM, Tomoko Uchida wrote: Hi, What analyzer do you use for the text field? Is the term "Main" correctly indexed? 2019年6月8日(土) 9:13 Baris Kazar : Hi,- i cant get FuzzyQuery working for searching with a query like Mains~2 to find the

Re: FuzzyQuery

2019-06-10 Thread baris . kazar
How do i check how it is indexed? lowecase or uppercase? only way is now to by testing. i am using standardanalyzer. Best regards On 6/9/19 11:57 AM, Atri Sharma wrote: On Sun, Jun 9, 2019 at 8:53 PM Tomoko Uchida wrote: Hi, What analyzer do you use for the text field? Is the term "Main"

Re: FuzzyQuery

2019-06-09 Thread Atri Sharma
On Sun, Jun 9, 2019 at 8:53 PM Tomoko Uchida wrote: > > Hi, > > What analyzer do you use for the text field? Is the term "Main" > correctly indexed? Agreed. Also, it would be good if you could post your actual code. What analyzer are you using? If you are using StandardAnalyzer, then all of your

Re: FuzzyQuery

2019-06-09 Thread Tomoko Uchida
Hi, What analyzer do you use for the text field? Is the term "Main" correctly indexed? 2019年6月8日(土) 9:13 Baris Kazar : > > Hi,- > i cant get FuzzyQuery working for searching with a query like Mains~2 to > find the word Main in a TextField. > Any suggestions please? > > -

Re: FuzzyQuery on entire set of terms

2016-10-21 Thread Michael McCandless
You mean the total number of edits between those strings must be <= 2? If so, you must index the entire "Lucene Apache Group" as a single token, and likewise do a FuzzyQuery with the entire "Luceni Apachi Group", etc. If instead you do tokenize and use BooleanQuery to combine them, then that allo

Re: FuzzyQuery with short words

2013-09-12 Thread Michael McCandless
oc update. Again, apologies to all if I missed this > point in the documentation. > > Best, > > Tim > > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Thursday, September 12, 2013 7:19 AM > To: Lucene Users > Subje

RE: FuzzyQuery with short words

2013-09-12 Thread Allison, Timothy B.
] Sent: Thursday, September 12, 2013 7:19 AM To: Lucene Users Subject: Re: FuzzyQuery with short words I think so? The default prefixLength is 0 ... Just make sure you're not hitting the maxExpansions limit (default is 50). Ie, if your index has many other terms within edit distance 2 o

Re: FuzzyQuery with short words

2013-09-12 Thread Michael McCandless
I think so? The default prefixLength is 0 ... Just make sure you're not hitting the maxExpansions limit (default is 50). Ie, if your index has many other terms within edit distance 2 of "ab" or edit distance 1 of "x" then then may cause your example "abcd" to rank below the top 50, and be pruned

Re: FuzzyQuery in lucene 4.0

2013-01-09 Thread Jack Krupansky
FWIW, new FuzzyQuery(term, 2 ,0) is the same as new FuzzyQuery(term), given the current values of defaultMaxEdits (2) and defaultPrefixLength (0). -- Jack Krupansky -Original Message- From: Ian Lea Sent: Wednesday, January 09, 2013 9:44 AM To: java-user@lucene.apache.org Subject: Re

Re: FuzzyQuery in lucene 4.0

2013-01-09 Thread Ian Lea
See the javadocs for FuzzyQuery to see what the parameters are. I can't tell you what the comment means. Possible values to try maybe? -- Ian. On Wed, Jan 9, 2013 at 2:34 PM, algebra wrote: > is true Ian, o code is good. > > The only thing that I dont understand is a line: > > Query query =

Re: FuzzyQuery in lucene 4.0

2013-01-09 Thread algebra
is true Ian, o code is good. The only thing that I dont understand is a line: Query query = new FuzzyQuery(term, 2 ,0); //0-2 Whats means 0 to 2? -- View this message in context: http://lucene.472066.n3.nabble.com/FuzzyQuery-in-lucene-4-0-tp4031871p4031879.html Sent from the Lucene - Java Us

Re: FuzzyQuery in lucene 4.0

2013-01-09 Thread Ian Lea
What adjustments did you make? One of them might be to blame. But at a glance the code looks fine to me. In what way is it not working? Care to provide any input/output/details of what does/doesn't work? -- Ian. On Wed, Jan 9, 2013 at 2:03 PM, algebra wrote: > I was using lucene 3.6 and my

Re: FuzzyQuery - rounding bug?

2007-12-17 Thread Erick Erickson
Please do not highack the thread. When starting a new topic, do NOT use "reply to", start an entirely new e-mail. Otherwise your topic often gets ignored by people who are uninterested in the original thread. Best Erick On Dec 17, 2007 5:57 AM, anjana m <[EMAIL PROTECTED]> wrote: > how to i use

RE: FuzzyQuery + QueryParser - I'm puzzled

2007-12-17 Thread Steven A Rowe
Hi anjana m, You're going to have lots of trouble getting a response, for two reasons: 1. You are replying to an existing thread and changing the subject. Don't do that. When you have a question, start a new thread by creating a new email instead of replying. 2. You are not telling the list

Re: FuzzyQuery - prefixLength - use with QueryParser?

2007-12-17 Thread Erik Hatcher
On Dec 17, 2007, at 3:31 AM, Helmut Jarausch wrote: FuzzyQuery (in the 2.2.0 API) may take 3 arguments, term , minimumSimilarity and prefixLength Is there any syntax to specify the 3rd argument in a query term for QueryParser? (I haven't found any the current docs) No, there isn't. But you

Re: FuzzyQuery + QueryParser - I'm puzzled

2007-12-17 Thread anjana m
hey i amnot bale to comple packages are not found.. i download..the luncene package.. help me.. .lucene.search.Hits; import org.apache.lucene.search.Query; import org.apache.lucene.document.Field; import org.apache.lucene.search.Searcher; import org.apache.lucene.index.IndexWriter; import org.apach

Re: FuzzyQuery + QueryParser - I'm puzzled

2007-12-17 Thread Doron Cohen
See in Lucene FAQ: "Are Wildcard, Prefix, and Fuzzy queries case sensitive?" On Dec 17, 2007 11:27 AM, Helmut Jarausch <[EMAIL PROTECTED]> wrote: > Hi, > > please help I am totally puzzled. > > The same query, once with a direct call to FuzzyQuery > succeeds while the same query with QueryParse

Re: FuzzyQuery - rounding bug?

2007-12-17 Thread anjana m
how to i use lucene search to serach files of the local system On Dec 17, 2007 2:11 PM, Helmut Jarausch <[EMAIL PROTECTED]> wrote: > Hi, > > according to the LiA book the FuzzyQuery distance is computed as > > 1- distance / min(textlen,targetlen) > > Given > def addDoc(text, writer): >doc = D