Something flexible and elegant would also be a simple fst.
Here is one built for lucene:
 http://sourceforge.net/projects/normalizer/

-----Original Message-----
From: stephane vaucher [mailto:[EMAIL PROTECTED]] 
Sent: Thursday, December 12, 2002 12:23 PM
To: Lucene Users List
Subject: Re: Accentuated characters


Thanks for the reference. I basically work with french, english, or 
bilingual texts. I'll take a quick look at the lib, but it might be an 
overkill.

Cheers,
Stephane

Alex Murzaku wrote:

>IBM's ICU4J has a normalizer which should do what you need. It's a big 
>library, but if you deal with multilingual text often, it might make 
>your life easier.
>
>
>
>-----------------------------------------------------------------------
>-
>
>--
>To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
>For additional commands, e-mail: 
><mailto:[EMAIL PROTECTED]>
>



--
To unsubscribe, e-mail:
<mailto:[EMAIL PROTECTED]>
For additional commands, e-mail:
<mailto:[EMAIL PROTECTED]>



--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to