Re: [jibx-users] page break characters

Dennis Sosnoski Fri, 14 Jan 2005 16:13:44 -0800

ASCII code 12 is one of the forbidden characters in the XML 1.0 recommendation. Basically the only things allowed below 0x20 (space) are 0x09, 0x0A, and 0x0D. This changes somewhat in the new 1.1 recommendation, but 1.1 is not very usable at present due to lack of parser and tool support.

The only thing I can suggest is that you run your data through a filter (which could be a custom serializer/deserializer for the strings that can contain this) and translate the page breaks to some character code that's not normally present in the data (perhaps '|', or one of the >0x80 character code in ISO-8859-1). You'd also need to translate any other forbidden character codes <0x20, if they're present in your data. This means whatever you exchange documents with has to follow the same translation convention, of course.

 - Dennis

Chisholm, Paul wrote:

Hi,
The data I am marshalling/unmarshalling contains page break (ASCII code 12) characters. I am mapping the data to a Java String object and using UTF-8 encoding. Processing fails when page break characters are in the data. Do I need to change to a different encoding?
Thanks,
Paul

-------------------------------------------------------
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
_______________________________________________
jibx-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/jibx-users

Re: [jibx-users] page break characters

Reply via email to