Can you say more about this.

Is the source code available?

How to you decide which diacritics to add?
For example both Mueller and Muller get the umlaut added
on the "u".

Do you know of code that removes diacritics in a reasonable
way, e.g. for systems that can only handle ASCII.
Ideally your approach to ading diacritics would be fully reversible,
when processing the unaccented words,
but that is perhaps too idealistic.
 
Hopefully helpfully yours,
Steve
-- 
Steven Tolkin          [EMAIL PROTECTED]      617-563-0516 
Fidelity Investments   82 Devonshire St. V4D     Boston MA 02109
There is nothing so practical as a good theory.  Comments are by me, 
not Fidelity Investments, its subsidiaries or affiliates.


> -----Original Message-----
> From: Richard Jelinek [mailto:[EMAIL PROTECTED]
> Sent: Wednesday, April 02, 2003 9:29 AM
> To: [EMAIL PROTECTED]
> Subject: NLP portal nlp.petamem.com
> 
> 
> Hi,
> 
> thought you might be interested in http://nlp.petamem.com
> 
> The aim of this site is to provide a nice portal with various NLP
> services. The backend is mostly pure Perl, the frontend is
> mod_perl2. It's still in development, doesn't contain many features
> yet and has some flaws - but we're working on it.
> 
> -- 
> best regards,
> 
>      Dipl.-Inf. Richard Jelinek
> 
>      - PetaMem s.r.o. - Ocelarska 1 - Prague - www.petamem.com -
>                      -= 2026049 Mind Units =-
> 

Reply via email to