Re: Xerces Unicode

2005-03-30 Thread Stephane Negri
Jesse, I used a hexdump utility to examine the contents of your file. It's UTF-16 with a byte-order mark (BOM). The BOM enables a parser to determine the encoding and byte order without reference to the declared encoding. (With a 16-bit encoding, you have to know whether the high-order byte come

Re: Xerces Unicode

2005-03-29 Thread Stephane Negri
>What do you mean when you say the file is written in Unicode? UTF-8 is >> one of the three standard Unicode encodings (the other two being UTF-16 >> and UTF-32). Does the encoding specified in the document match the >> actual encoding? It might be helpful for you to send a sample document sorry

Xerces Unicode

2005-03-29 Thread Stephane Negri
Hello All, I m modyfing an application which already uses xerces to parse XML files, but now I would like to be able to handle unicode files since I m porting the application to japanese. So, I think I ve managed to change everything correclty, my output file is correctly written in UNICODE. Wh