Hello everyone,
I wish to implement a TokenFilter that will remove accentuated
characters so for example '�' will become 'e'. As I would rather not
reinvent the wheel, I've tried to find something on the web and on the
mailing lists. I saw a mention of a contrib that could do this (see
http://www.mail-archive.com/lucene-user%40jakarta.apache.org/msg02146.html),
but I don't see anything applicable.
Has anyone done this yet, if so I would much appreciate some pointers
(or code), otherwise, I'll be happy to contribute whatever I produce
(but it might be very simple since I'll only need to deal with french).
Cheers,
Stephane
--
To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
- Re: Accentuated characters stephane vaucher
- Re: Accentuated characters Joshua O'Madadhain
- Re: Accentuated characters stephane vaucher
- RE: Accentuated characters Alex Murzaku
- Re: Accentuated characters stephane vaucher
- RE: Accentuated characters Alex Murzaku
- RE: Accentuated characters Eric Isakson
- Re: Accentuated characters stephane vaucher
- RE: Accentuated characters Eric Isakson
- Re: Accentuated characters stephane vaucher
- RE: Accentuated characters Eric Isakson
