On Fri, Jan 8, 2010 at 15:01, Jamie <ja...@stimulussoft.com> wrote: > Hi There > > We are trying to search for the exact word "Lowe's" across a large set of > indexed data. Our results include everything with "low" in it. Thus, we are > receiving a much larger data set that we expected. The data is indexing > using the analyzer: > TokenStream result = new StandardTokenizer(reader); > result = new StandardFilter(result); > result = new LowerCaseFilter(result); > result = new StopFilter(result, stopTable); > result = new PorterStemFilter(result); PorterStemFilter is changing Lowe to low. Change your tokenizer so that Lowe's is tokenized as a single token, and that should avoid it.
Will --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org