UTF-8 is not ASCII. Character codes above 127 are transformed into multiple byte sequences in UTF-8. I haven't checked that 195+188 is correct for 252, but it sounds about right. It sounds like what you really want is ISO-8859-1 encoding, which extends ASCII with the basic European characters (and some special characters) in the 128-255 range.

 - Dennis

Chris Huisman wrote:

This is my marshalling/unmarshalling call:
- marshallingContext.marshalDocument(writeData, "UTF-8", null, bos);
where bos is a bufferedOutputStream and writeData is the data I want to write.


the ansi character (I got the ansi character numbers from TextPad) I am trying to write and read is: 252 (�) and it gets transformed into the ansi characters: 195 and 188 (ü).

Thanks,

Chris.

Dennis Sosnoski wrote:

Can you spell out the details of how you're configuring the output and what the actual byte values are that are being written? That should help isolate what's going on.

 - Dennis

Chris Huisman wrote:

Hi,

I'm using UTF-8 to encode my documents, but I'm having a problem with '�', it comes up as: 'ü' instead. Does anyone else have any experience with this?
Thanks,


Chris.



-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
jibx-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/jibx-users

Reply via email to