Hi, On Wed, Jan 7, 2009 at 11:42 AM, Michael Lackhoff <lackh...@fh-muenster.de> wrote: > diakritics + base char to the combined character. So I still have two > characters for e.g. the > German umlauts. This might be correct UTF-8 but is not useable to > present in (X)HTML. > Is there any other option short of doing it by hand with lots of s/// > for at least the most common > combinations?
You can use NFC() from Unicode::Normalize to do this (after using MARC::Charset to do the conversion to UTF-8). Regards, Galen -- Galen Charlton VP, Research & Development, LibLime galen.charl...@liblime.com p: 1-888-564-2457 x709 skype: gmcharlt