Now to update ... everything.
A. Kevin Baynes wrote:
Did you try using UTF-8?
XML is designed to support Unicode in order to provide foreign language characters; I believe that all XML parsers are required to support UTF-8 at a minimum, many support UTF-16 as well. The Latin-1 codepage (ISO-8859-1) is a smaller set than UTF-8, so UTF-8 is more 'generic' in that it is a superset of Latin-1.
I use Crimson and Xerces on numerous platforms (Windows, *NIX, AS/400, OS/390) using UTF-8 for input and output without any problems.
~akb
| -----Original Message----- | From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] | Behalf Of Joseph R. McVerry | Sent: Tuesday, September 09, 2003 11:09 AM | To: [EMAIL PROTECTED] | Subject: [Juglist] XML and Java and Codepages | | | Lately I have had several users report that their XML parsers are | getting exceptions related to codepage mismatch. One example, below, | the Crimson parser doesn't like the codepage as defined in the dtd. The | code page in the document says "ISO-8859-1"; Crimson wants Cp1252. | | "Warning at (file null, line 1, char -1): org.apache.crimson.parser/P-061 | ISO-8859-1 Cp1252 | Exception Message: Fatal Error at (file null, line 2, char -1): | org.apache.crimson.parser/P-055 envelopeRules.dtd | | All this user did was move the document from one machine to another. As | with all unsightly bugs, the parser runs fine on the first machine. | | BTW, this isn't related just to Crimson. I've seen similar reports from | other Java/XML parsers. | | Is this because the codepage is not loaded on his apache machine? | | Should I specify a more generic code page? Presently it's ISO-8859-1, | which I thought was as generic as codepages get. | | Any help and tips would be appreciated. | | Joe McVerry | | | -- | ----------- | Joseph R. McVerry, President | American Coders Ltd. | POBox 97462 | Raleigh, NC 27624 USA | 919.846.2014 (voice/fax) | http://www.americancoders.com | Home Of OBOE - an EDI and EDI/XML Translator | and xBaseJ - xBase Database Engine For Java | | | | | _______________________________________________ | Juglist mailing list | [EMAIL PROTECTED] | http://trijug.org/mailman/listinfo/juglist_trijug.org
_______________________________________________ Juglist mailing list [EMAIL PROTECTED] http://trijug.org/mailman/listinfo/juglist_trijug.org
-- ----------- Joseph R. McVerry, President American Coders Ltd. POBox 97462 Raleigh, NC 27624 USA 919.846.2014 (voice/fax) http://www.americancoders.com Home Of OBOE - an EDI and EDI/XML Translator and xBaseJ - xBase Database Engine For Java
_______________________________________________ Juglist mailing list [EMAIL PROTECTED] http://trijug.org/mailman/listinfo/juglist_trijug.org
