On Wed, Apr 13, 2011 at 9:51 AM, Clemens Wyss <[email protected]> wrote: > What I really want to do is ignore german stop words such as "der", "die", > "das", "ein",...
GermanAnalyzer takes a stemExclusionSet if you put those terms into this set the stemmer will not touch them. This should be in 3.1 I think public GermanAnalyzer(Version matchVersion, Set<?> stopwords, Set<?> stemExclusionSet) simon > >> -----Ursprüngliche Nachricht----- >> Von: Robert Muir [mailto:[email protected]] >> Gesendet: Dienstag, 12. April 2011 17:03 >> An: [email protected] >> Betreff: Re: German*Filter, Analyzer "cutting" off letters from (french) >> words... >> >> On Tue, Apr 12, 2011 at 8:46 AM, Clemens Wyss <[email protected]> >> wrote: >> > Why so? Where have the e's gone? >> > >> >> the e is being stemmed as its a german suffix... all of the german stemming >> algorithms remove final -e, as do all the french stemming algorithms. >> >> so i don't understand your problem. >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [email protected] >> For additional commands, e-mail: [email protected] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
