UTF-8 is not ASCII. Character codes above 127 are transformed into
multiple byte sequences in UTF-8. I haven't checked that 195+188 is
correct for 252, but it sounds about right. It sounds like what you
really want is ISO-8859-1 encoding, which extends ASCII with the basic
European characters (and some special characters) in the 128-255 range.
- Dennis
Chris Huisman wrote:
This is my marshalling/unmarshalling call:
- marshallingContext.marshalDocument(writeData, "UTF-8", null, bos);
where bos is a bufferedOutputStream and writeData is the data I want
to write.
the ansi character (I got the ansi character numbers from TextPad) I
am trying to write and read is: 252 (�) and it gets transformed into
the ansi characters: 195 and 188 (ü).
Thanks,
Chris.
Dennis Sosnoski wrote:
Can you spell out the details of how you're configuring the output
and what the actual byte values are that are being written? That
should help isolate what's going on.
- Dennis
Chris Huisman wrote:
Hi,
I'm using UTF-8 to encode my documents, but I'm having a problem
with '�', it comes up as: 'ü' instead. Does anyone else have any
experience with this?
Thanks,
Chris.
-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
jibx-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/jibx-users