Something flexible and elegant would also be a simple fst. Here is one built for lucene: http://sourceforge.net/projects/normalizer/
-----Original Message----- From: stephane vaucher [mailto:[EMAIL PROTECTED]] Sent: Thursday, December 12, 2002 12:23 PM To: Lucene Users List Subject: Re: Accentuated characters Thanks for the reference. I basically work with french, english, or bilingual texts. I'll take a quick look at the lib, but it might be an overkill. Cheers, Stephane Alex Murzaku wrote: >IBM's ICU4J has a normalizer which should do what you need. It's a big >library, but if you deal with multilingual text often, it might make >your life easier. > > > >----------------------------------------------------------------------- >- > >-- >To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> >For additional commands, e-mail: ><mailto:[EMAIL PROTECTED]> > -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]> -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
