Erick, Very useful answers -- I'll be reading up more with the links you've provided.
Thanks. Jeff -----Original Message----- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Saturday, November 11, 2006 5:51 PM To: java-user@lucene.apache.org Subject: Re: Partial Word Matches See below..... On 11/11/06, Storey, Jeff <[EMAIL PROTECTED]> wrote: > > I stand corrected -- I am NOT getting partial matches, I was extracting > partial matches from the text programmatically and thought that's what > was being returned. > > On another topic, regarding Boolean queries and wildcard queries, I have > two questions: > > It seems like when I enter the query "ball AND basket" it returns > different results than "ball and basket." Is there a way to make the > Boolean operators case insensitive? No. I've found this kind of odd too. Here's a snippet... Boolean operators allow terms to be combined through logic operators. Lucene supports AND, "+", OR, NOT and "-" as Boolean operators(Note: Boolean operators must be ALL CAPS). from http://sewm.pku.edu.cn/src/clucene/doc/queryparsersyntax.html As for wildcard searches, one of the things I attempt to do is pick out > the words that caused a particular file to be returned. However, when I > search for the term "yellow~" I might get something like "bellow." Is > there a way to list what Lucene found in the document that made it > relevant? Well, first ~ isn't a wildcard, it's a "fuzzy search" (aka similar terms). So getting "bellow" for "yellow~" is expected. Although, somewhat confusingly, "lemon orange"~10 is a proximity search. * and ? are the wildcard characters. Searcher.explain() is probably your friend, although I haven't used it much, I've certainly seen it mentioned enough...... If you haven't, get a copy of Luke (google lucene luke). It's a program that allows you to explore your indexes, explain queries, explore the effects of different analyzers, etc. Really, really, really get a copy <G>.... Thanks for all the help. > > Jeff > > -----Original Message----- > From: Paul Borgermans [mailto:[EMAIL PROTECTED] > Sent: Saturday, November 11, 2006 3:06 PM > To: java-user@lucene.apache.org > Subject: Re: Partial Word Matches > > Indeed, the only way this can happen as far as I know Lucene is by using > a > stemmer during indexing, the standard analyzer won't result in such > behaviour. > > hth > > Paul > > On 11/11/06, Erick Erickson <[EMAIL PROTECTED]> wrote: > > > > That's not the default behavior, so I'm perplexed. Normally, you have > to > > go > > to considerable effort to get partial matches.... > > > > What analyzers are you using at both index and query time? Perhaps as > > short > > a code snippet as you could make showing this behavior would be a good > > thing > > to post. I flat guarantee folks will look at it. But please make it > short > > <G>. > > > > Best > > ERick > > > > On 11/11/06, Storey, Jeff <[EMAIL PROTECTED]> wrote: > > > > > > Hi. I'm using Lucene to do some searching (using the Searcher object > and > > > passing it a ParsedQuery). I search for a word such as "long" and it > is > > > returning partial matches, such as "belong" and "along." Is there a > way > > > to turn off this behavior and only match whole words? > > > > > > > > > > > > Thank you, > > > > > > Jeff > > > > > > > > > > > > > > > > > > > > > > -- > http://walhalla.wordpress.com > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]