Seems your ask if to remove accents before or after stemming. Here is a discussion on similar question (for Spanish) -
http://www.nabble.com/Snowball-and-accents-filter...--tf3653720.html#a10207399 Hope this helps, Doron Christian Aschoff <[EMAIL PROTECTED]> wrote on 22/11/2007 21:27:20: > Hello, > > as a prolog, i have no problems and everything works the way i want :-) > > I am more interested in a tip if i am using the right way or pattern. > I want to strip accents before data goes into my index, so i wrote > the code following below. I did not find an example of where to place > a filter (for indexing) with google, so this is my guess of how to do > it. > > My question is: Is this the correct pattern for the usage of a filter > or where should it be placed? > > Thank you in advantage for any comments, > Christian ... > private static final SnowballAnalyzer analyzer = new > SnowballAnalyzer("German", GermanAnalyzer.GERMAN_STOP_WORDS); ... > public TokenStream tokenStream(String fieldName, Reader reader) { > return new UTF8AccentFilter(analyzer.tokenStream(fieldName, --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]