My goal is to be able to get meaningful results from search queries that
include some words that are on the default stop words list, especially
"not".  I am using the StandardAnalyzer and I have tried passing in null and
an empty set for the set of stop words to use in the constructor hoping that
no words would be stripped but I am getting strange results.

If I enter a query of just the word "not" I get no matches.  If I run a
query with just the word "included" I get lots of matches.  If I run the
query "not included" (without surrounding quotation marks) I get lots of
matches and the highlighter indicates that "not" is one of the matching
fragments.  But if I run the query ""not included"" (with surrounding
quotation marks) I get no matches even though there are many occurrences in
the content of that exact phrase which were matched when I entered the same
query without the quotation marks.

What's going on here?  Why can't I search for the word "not" by itself or in
a quote?  Similar behaviour happens for other words like "the" but I am
explicitly telling the analyzer not to remove any words (or so I believe). 
How can I achieve a StandardAnalyzer where every word in the query is
significant?

Thanks,

-sbs

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Strange-StopFilter-and-stop-words-behaviour-tp3199367p3199367.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to