I think you'll want to write your own Analyzer + Tokenizer, detect tokens with umlauts, and then emit two tokens at the same position (think of them as synonyms), one being the original one with the umlaut, and the other one with the umlaut transformed according to the rules (e.g. ü -> ue). Hm, I wonder if GermanAnalyzer already does this... maybe, have a look.
Otis ----- Original Message ---- From: Supriya Kumar Shyamal <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Tuesday, June 20, 2006 8:09:18 AM Subject: How to search for europian word with and without special characters Hi All, I have a question regarding the indexing and searching for german characters. For eg. when I search for the word "müller" also I want to search for the word "mueller". How to achieve this in lucene. Thanks, supriya -- Mit freundlichen Grüßen / Regards Supriya Kumar Shyamal Software Developer tel +49 (30) 443 50 99 -22 fax +49 (30) 443 50 99 -99 email [EMAIL PROTECTED] ___________________________ artnology GmbH Milastr. 4 10437 Berlin ___________________________ http://www.artnology.com __________________________________________________________________________ News / Aktuelle Projekte: * artnology gewinnt Ausschreibung des Bundesministeriums des Innern: Softwarelösung für die Verwaltung der Sammlung zeitgenössischer Kunstwerke zur kulturellen Repräsentation des Bundes. Projektreferenzen: * Globaler eShop und Corporate-Site für Springer: www.springeronline.com * E-Detailing-Portal für Novartis: www.interaktiv.novartis.de * Service-Center-Plattform für Biogen: www.ms-life.de * eCRM-System für Grünenthal: www.gruenenthal.com ___________________________________________________________________________ --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]