On Tue, 30 Jan 2001, Tom Kaiser wrote:
Sorry for the late reply.
> Would anyone greatly miss any encoding which does NOT appear in the list
> below? (This is the list of encodings covered by XML::Encoding).
> Big 5, ISO-8859-2 to ISO-8859-9, x-euc-jp, x-euc-kr, x-sjis
> (Shift_JIS), windows-1250
> (plus the built-in ISO-8859-1, US_ASCII, UTF-8 and UTF-16)
While working on the mailing list archives I have seen a lot of encodings
go by and there definately needs to be away to add new ones. windows-####
for example exists in about 6 different flavors and I run into them all
the time.
> Any other comments will be welcome too, especially if you have
> experience with using either of these two alternatives.
> One other option is the ICU lib by IBM. In the present situation though,
> the ICU is not much different from iconv.
I would prefer a solution based on either iconv or ICU lib by IBM. It would
be nice if we could use this in a stream, so we do not have to have the
entire XML file in memory.
Apache 2.0 and APR use iconv. I know they have also discussed using ICU, I
will have to dig through and find out their reasoning and figure out what
would be the best for Sab. We should also ask William Rowe, since he was
the one that imported the libs for Apache.
Also in regards to Expat. Maybe there should be some development on the
new expat that would allow for an option of iconv to be used instead of
the tables. I am sure this is going to be an issue for Greg Stein as well
sooner or later.
Cheers,
--
Sander van Zoest [[EMAIL PROTECTED]]
Covalent Technologies, Inc. http://www.covalent.net/
(415) 536-5218 http://www.vanzoest.com/sander/