DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://nagoya.apache.org/bugzilla/show_bug.cgi?id=10915>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=10915 copyright symbol isn't UTF-8? Summary: copyright symbol isn't UTF-8? Product: Xerces2-J Version: 2.0.2 Platform: PC URL: ftp://ftp.bind.ca/BIND/spec/xmldtd/BIND.dtd OS/Version: Linux Status: NEW Severity: Normal Priority: Other Component: SAX AssignedTo: [EMAIL PROTECTED] ReportedBy: [EMAIL PROTECTED] The referenced file above (BIND.dtd) has a copyright symbol contained in one of its comments. This character is causing me to receive: java.io.UTFDataFormatException: invalid byte 1 of 1-byte UTF-8 sequence (0xa9) Can you confirm whether or not this character _should_ be causing this error? I know that the Xerces FAQ addresses this error in particular, but as far as I can tell (having used od -hc) the character in question shouldn't be a problem: 0006260 7279 6769 7468 a920 3032 3130 4d20 756f y r i g h t � 2 0 0 1 M o u The copyright symbol is represented, I believe, by 'a9', which should fit easily into the acceptable range. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
