What do you mean when you say the file is written in Unicode?  UTF-8 is
one of the three standard Unicode encodings (the other two being UTF-16
and UTF-32).  Does the encoding specified in the document match the
actual encoding?  It might be helpful for you to send a sample document
(as an attachment, not in the body of the message).

> -----Original Message-----
> From: Stephane Negri [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, March 29, 2005 8:31 AM
> To: xerces-c-dev@xml.apache.org
> Subject: Xerces Unicode
> 
> Hello All,
> 
> 
> I m modyfing an application which already uses xerces to parse XML 
> files, but now I would like to be able to handle unicode 
> files since I m 
> porting the application to japanese.
> 
> So, I think I ve managed to change everything correclty, my 
> output file 
> is correctly written in UNICODE. When I try to read it, the parser 
> (SAXParser) can not parse it (but there is no error). If I change the 
> output file encoding to UTF-8, I can, then, read it. It seems 
> that the 
> parser cannot parse UNICODE saved file, which is strange.
> 
> I change the encoding using Textpad only, no change in the code.
> 
> Maybe one simple solution is to save the output file with 
> UTF-8 encoding 
> instead of UNICODE...
> 
> Any idea on that ?
> 
> Maybe it s not a xerces problem thought, but if you have any idea ...
> I m using a wostream to save data ...
> 
> Thanks a lot,
> 
> Stephane

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to