Though making expat to switch to iconv seems to be the best long term 
solution, we can't rely or even wait for it. Expat doesn't change much 
and its developers answer mails with a huge delay (if ever). This is not 
criticism (I see expat is not their fulltime), just a matter of fact.

As iconv can't be used as universal mean for all sablotron/expat 
encoding transformations, we need another way to handle the encoding 
issues. We aren't able to change expat, but what we can is to learn 
Sablot to convert charset the same way as expat. XML::Encoding charset 
maps were created by expat developers (Clark Cooper) for expat and 
XML::Parser. If Sablotron addopts this way of conversion, we get a 
robust and general mean for both input and output transformations, 
supporting about 15 basic charsets. Other charset maps could be added 
easily. On top of it, iconv would provide a lot of additional encodings 
for output (as it does now). Compared to the current chaotic state, it 
doesn't sound bad to me.

I'm looking forward to hear your suggestions or comments.
Petr

Pavel Storek wrote:

> Hi,
> 
> IANAP,  but  it seems, that we will have to seriously consider doing the
> right  thing and stay with iconv for all of the encodings except the few
> supported  internally  by  expat/sablotron.  This  would  obviously mean
> working  with  expat  people  to  include  iconv  into  expat  for input
> conversions  and  suck only utf  from  expat to sablotron and at the end use
> iconv  again  for  output  translation. This is the only robust and long
> term  solution IMHO.  The  only  drawback  is the systems, where iconv is not
> available  or  somehow  broken.  These  would be limited to the internal
> character sets, right?
> 
> Now I am ducking to avoid the stones from people who know, what they are
> discussing.
> 
-- 
Petr Cimprich
Ginger Alliance Ltd.
www.gingerall.com

Reply via email to