Moving to lucene-user list. One of my Lucene articles includes a more comprehensive stop word list for English:
http://www.onjava.com/pub/a/onjava/2003/01/15/lucene.html?page=2#references Otis --- [EMAIL PROTECTED] wrote: > Dear all, > > for my taste the stopwords included in Lucene (e.g. > StopAnalyzer.ENGLISH_STOP_WORDS, wich is usually used > with the SnowballAnalyzer - and I guess also with the > StandardAnalyzer) is not strict enough: > > For example in a sentence with "we need ..." I would > consider "we" and "need" as stopwords but they are not > stripped by SnowballAnalyzer or StandardAnalyzer. > > Now: > Is there an in-built solution to use more restrictive > stripping or do I better create my own analyzer in that > case with a more restrictive stopword list ? > > If so - are you aware of more rigid lists ? (a URI > would be great !) > > Thanks, > > Holger > > ___________________________________________________ > The ALL NEW CS2000 from CompuServe > Better! Faster! More Powerful! > 250 FREE hours! Sign-on Now! > http://www.compuserve.com/trycsrv/cs2000/webmail/ > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > ka --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
