In this case, I think ASCII and UTF-8 are equivalent.  If you're certain
that there are no other characters in the document than what appear in
your original message, this would appear to be a bug - there are no
multi-byte sequences in the document you inlined.  However, it would be
a bug in a fundamental part of the code, so I'm still skeptical that
your input is what it appears to be.  I'd suggest looking at the input
document with a hex editor and/or parsing it under a debugger to see
what sequence of characters causes the exception.

> -----Original Message-----
> From: Peter Guyatt [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, May 11, 2004 9:25 AM
> To: [EMAIL PROTECTED]
> Subject: RE: UTF-8 Encoding problem
> 
> Hi There,
> 
>       It is saved a straight ASCII, do I have to encode the 
> characters as UTF-8
> before xerces will parse the document ?
> 
> Thanks
> 
> Pete
> 
> -----Original Message-----
> From: Jesse Pelton [mailto:[EMAIL PROTECTED]
> Sent: 11 May 2004 14:08
> To: [EMAIL PROTECTED]
> Subject: RE: UTF-8 Encoding problem
> 
> 
> The text looks like straight ASCII in the message, which leads me to
> wonder if it's been transmitted faithfully.  Try sending it as an
> attachment.
> 
> > -----Original Message-----
> > From: Peter Guyatt [mailto:[EMAIL PROTECTED]
> > Sent: Tuesday, May 11, 2004 8:28 AM
> > To: [EMAIL PROTECTED]
> > Subject: UTF-8 Encoding problem
> >
> > Hi All,
> >
> >     I was wondering if someone would be able to answer a
> > question I have.
> >
> > I parse the a document as UTF-8 using xerces c 2.2.0 and get
> > an error in my
> > custom handler stating the following error.
> >
> > Fatal Error line 5, col 15, Message:An Exception occurred!
> > Type:TranscodingException, Message:An invalid multi-byte source text
> > sequence was encountered
> >
> > The actual entry in the XML file is as follows:
> >
> > <?xml version="1.0" encoding="UTF-8"?>
> > <Maintenance>
> >     <DMPair>
> >             <Instance>1</Instance>
> >             <Name>simeecauaeiouaeiou</Name> <!-- Exception here -->
> >             <ServerPort>9001</ServerPort>
> >             <IpAddressNode0>172.16.3.28</IpAddressNode0>
> >             <IpAddressNode1>172.16.3.29</IpAddressNode1>
> >             <Enabled>False</Enabled>
> >             <FailureRoutingType>3</FailureRoutingType>
> >             <FailureRoutingData>2</FailureRoutingData>
> >     </DMPair>
> > </Maintenance>
> >
> > I am pretty sure that the characters in the Name tag are fine
> > since I have
> > look at the UTF-8 spec and their character values appear to
> > in the valid set
> > of unicode characters, also this document parses fine using
> > Xerces-J-2.6.2.
> >
> > Any insight into this will be greatly appreciated.
> >
> > Thanks in advance
> >
> > Pete
> >
> >
> >
> >
> >
> >
> > 
> ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
> >
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to