In this case, I think ASCII and UTF-8 are equivalent. If you're certain that there are no other characters in the document than what appear in your original message, this would appear to be a bug - there are no multi-byte sequences in the document you inlined. However, it would be a bug in a fundamental part of the code, so I'm still skeptical that your input is what it appears to be. I'd suggest looking at the input document with a hex editor and/or parsing it under a debugger to see what sequence of characters causes the exception.
> -----Original Message----- > From: Peter Guyatt [mailto:[EMAIL PROTECTED] > Sent: Tuesday, May 11, 2004 9:25 AM > To: [EMAIL PROTECTED] > Subject: RE: UTF-8 Encoding problem > > Hi There, > > It is saved a straight ASCII, do I have to encode the > characters as UTF-8 > before xerces will parse the document ? > > Thanks > > Pete > > -----Original Message----- > From: Jesse Pelton [mailto:[EMAIL PROTECTED] > Sent: 11 May 2004 14:08 > To: [EMAIL PROTECTED] > Subject: RE: UTF-8 Encoding problem > > > The text looks like straight ASCII in the message, which leads me to > wonder if it's been transmitted faithfully. Try sending it as an > attachment. > > > -----Original Message----- > > From: Peter Guyatt [mailto:[EMAIL PROTECTED] > > Sent: Tuesday, May 11, 2004 8:28 AM > > To: [EMAIL PROTECTED] > > Subject: UTF-8 Encoding problem > > > > Hi All, > > > > I was wondering if someone would be able to answer a > > question I have. > > > > I parse the a document as UTF-8 using xerces c 2.2.0 and get > > an error in my > > custom handler stating the following error. > > > > Fatal Error line 5, col 15, Message:An Exception occurred! > > Type:TranscodingException, Message:An invalid multi-byte source text > > sequence was encountered > > > > The actual entry in the XML file is as follows: > > > > <?xml version="1.0" encoding="UTF-8"?> > > <Maintenance> > > <DMPair> > > <Instance>1</Instance> > > <Name>simeecauaeiouaeiou</Name> <!-- Exception here --> > > <ServerPort>9001</ServerPort> > > <IpAddressNode0>172.16.3.28</IpAddressNode0> > > <IpAddressNode1>172.16.3.29</IpAddressNode1> > > <Enabled>False</Enabled> > > <FailureRoutingType>3</FailureRoutingType> > > <FailureRoutingData>2</FailureRoutingData> > > </DMPair> > > </Maintenance> > > > > I am pretty sure that the characters in the Name tag are fine > > since I have > > look at the UTF-8 spec and their character values appear to > > in the valid set > > of unicode characters, also this document parses fine using > > Xerces-J-2.6.2. > > > > Any insight into this will be greatly appreciated. > > > > Thanks in advance > > > > Pete > > > > > > > > > > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]