What do you mean when you say the file is written in Unicode? UTF-8 is one of the three standard Unicode encodings (the other two being UTF-16 and UTF-32). Does the encoding specified in the document match the actual encoding? It might be helpful for you to send a sample document (as an attachment, not in the body of the message).
> -----Original Message----- > From: Stephane Negri [mailto:[EMAIL PROTECTED] > Sent: Tuesday, March 29, 2005 8:31 AM > To: xerces-c-dev@xml.apache.org > Subject: Xerces Unicode > > Hello All, > > > I m modyfing an application which already uses xerces to parse XML > files, but now I would like to be able to handle unicode > files since I m > porting the application to japanese. > > So, I think I ve managed to change everything correclty, my > output file > is correctly written in UNICODE. When I try to read it, the parser > (SAXParser) can not parse it (but there is no error). If I change the > output file encoding to UTF-8, I can, then, read it. It seems > that the > parser cannot parse UNICODE saved file, which is strange. > > I change the encoding using Textpad only, no change in the code. > > Maybe one simple solution is to save the output file with > UTF-8 encoding > instead of UNICODE... > > Any idea on that ? > > Maybe it s not a xerces problem thought, but if you have any idea ... > I m using a wostream to save data ... > > Thanks a lot, > > Stephane --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]