The document encoding is "UTF-8" <?xml version="1.0" encoding="UTF-8"?>
Shekhar ----- Original Message ----- From: Andy Clark <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Saturday, March 29, 2003 4:05 AM Subject: Re: UTF-8 Encoding > Shekhar Karani wrote: > > I am using the xerces 2.2.1 to parse XML documents. One of the XML > > documents has a hex character B6. This character is being treated as an > > invalid UTF-8 character by the parser. The parser gives the error > > "Invalid byte 1 of UTF-8 byte stream". However, the editor XML SPY > > version 5, accepts this character. > > What is the encoding of the document? If it is really > ISO Latin 1 ("ISO-8859-1") and you do NOT specify this > in the XML Declaration at the top of the document (e.g. > "<?xml version='1.0' encoding='...'?>"), then your > document is in error. > > If XML Spy accepts it w/o the encoding declaration, > then it is not following the XML specification which > dictates that the encoding of the document is assumed > to be UTF-8 in the absence of the XML declaration. > > -- > Andy Clark * [EMAIL PROTECTED] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]