On Fri, Jan 8, 2010 at 15:01, Jamie <ja...@stimulussoft.com> wrote:
> Hi There
>
> We are trying to search for the exact word "Lowe's" across a large set of
> indexed data. Our results include everything with "low" in it. Thus, we are
> receiving a much larger data set that we expected. The data is indexing
> using the analyzer:
>           TokenStream result = new StandardTokenizer(reader);
>           result = new StandardFilter(result);
>           result = new LowerCaseFilter(result);
>           result = new StopFilter(result, stopTable);
>           result = new PorterStemFilter(result);
PorterStemFilter is changing Lowe to low.  Change your tokenizer so
that Lowe's is tokenized as a single token, and that should avoid it.

Will

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to