My file.encoding is set to Cp1252. Maybe this is the reason. However, its a good point replacing all the Umlaute Ä, ... with A, ... before indexing, such that people with non-Umlaut keyboards can search for them. I might do that.
Greetings, Philipp Daniel Naber <[EMAIL PROTECTED]> 28.02.2005 17:04 An [EMAIL PROTECTED] Kopie Thema Re: special character with lucene On Monday 28 February 2005 16:36, [EMAIL PROTECTED] wrote: > In a simple test I noticed that StandardAnalyzer removes special > characters like ä, ö, ... It doesn't do that on my system (configured for UTF-8). Are you sure the umlauts are okay when you feed them into Lucene? Regards Daniel -- Daniel Naber, IntraFind Software AG, Tel. 089-8906 9700