Rui,

> and byte-length map should be added to support multibyte encoding. 

Do you mean that in order for expat to recognize an encoding of an input 
document, one has to supply the table that tells expat the length of the 
multibyte code for each leading byte in this encoding?

Thinking of this, it should not be too hard to write a small program to 
generate such a table for any given Unicode definition file of an 
encoding, as at

http://www.unicode.org/Public/MAPPINGS/
Still, it's bad that an encoding can't be supported (on input) using 
iconv alone... but this looks like a reasonable way of obtaining the 
extra data file necessary.

Tom

Reply via email to