Hi,

On Wed, Jan 7, 2009 at 11:42 AM, Michael Lackhoff
<lackh...@fh-muenster.de> wrote:
> diakritics + base char to the combined character. So I still have two
> characters for e.g. the
> German umlauts. This might be correct UTF-8 but is not useable to
> present in (X)HTML.
> Is there any other option short of  doing it by hand with lots of s///
> for at least the most common
> combinations?

You can use NFC() from Unicode::Normalize to do this (after using
MARC::Charset to do the conversion to UTF-8).

Regards,

Galen
-- 
Galen Charlton
VP, Research & Development, LibLime
galen.charl...@liblime.com
p: 1-888-564-2457 x709
skype: gmcharlt

Reply via email to