Hi Markus,

Markus Scherer <[EMAIL PROTECTED]> wrote: 
> ICU 2.8 has the ability to handle m:n character conversion mappings driven
> by simple lines in 
> Unicode conversion tables (text files).

That's a nice coiincidence, to have this feature. I was wondering
if this would enable transcoding from legacy Tamil charsets (in visual
glyph order, like Thai) to Unicode. 

I've looked at the example data files for the m:n mappings but
it's still opaque to me, what hat to go in the headers. Is there a
point to start reading from to gain further insights?

I'm especially wondering, whether the converter by default will
take the longest matching entry in an m:n table or whether
to sequence of entries is significant. (Something must be done
to e.g. disambiguate keLa from kau).

Regards,
Peter Jacobi


-- 
+++ GMX - die erste Adresse für Mail, Message, More +++
Neu: Preissenkung für MMS und FreeMMS! http://www.gmx.net



Reply via email to