DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://nagoya.apache.org/bugzilla/show_bug.cgi?id=19327>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=19327 Character entities are escaped too aggressively ------- Additional Comments From [EMAIL PROTECTED] 2003-07-16 02:32 ------- Since we will support UTF-8 and UTF-16 only (for now) the Xerces based implementation was way too heavy. Therefore I have searched for an alternative and found http://czyborra.com/utf/. I have implemented the two encoders based on the presented algorithms. See attachment for a proof of concept. Steve: You said in #15133 we need to handle chars < 32. Do you have any further details for me? How should we treat ASCII0. Throw a runtime exception?
