Re: Xerces Unicode

2005-03-30 Thread Johannes Gutleber
*** NOTE *** Dear Sir/Madam, Your e-mail address has not been registered previously with this incoming mailbox. Due to excessive amounts of incoming unsolicitated e-mails, Your e-mail may be filtered out. If You think that happens, or if You wish to make your e-mail address white-listed for this

Re: Xerces Unicode

2005-03-30 Thread Stephane Negri
Jesse, I used a hexdump utility to examine the contents of your file. It's UTF-16 with a byte-order mark (BOM). The BOM enables a parser to determine the encoding and byte order without reference to the declared encoding. (With a 16-bit encoding, you have to know whether the high-order byte come

RE: Xerces Unicode

2005-03-29 Thread Jesse Pelton
r, you might want to. > -Original Message- > From: Stephane Negri [mailto:[EMAIL PROTECTED] > Sent: Tuesday, March 29, 2005 8:54 AM > To: xerces-c-dev@xml.apache.org > Subject: Re: Xerces Unicode > > > >What do you mean when you say the file is written

Re: Xerces Unicode

2005-03-29 Thread Stephane Negri
>What do you mean when you say the file is written in Unicode? UTF-8 is >> one of the three standard Unicode encodings (the other two being UTF-16 >> and UTF-32). Does the encoding specified in the document match the >> actual encoding? It might be helpful for you to send a sample document sorry

RE: Xerces Unicode

2005-03-29 Thread Jesse Pelton
What do you mean when you say the file is written in Unicode? UTF-8 is one of the three standard Unicode encodings (the other two being UTF-16 and UTF-32). Does the encoding specified in the document match the actual encoding? It might be helpful for you to send a sample document (as an attachme