DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=10915>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=10915

copyright symbol isn't UTF-8?

           Summary: copyright symbol isn't UTF-8?
           Product: Xerces2-J
           Version: 2.0.2
          Platform: PC
               URL: ftp://ftp.bind.ca/BIND/spec/xmldtd/BIND.dtd
        OS/Version: Linux
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: SAX
        AssignedTo: [EMAIL PROTECTED]
        ReportedBy: [EMAIL PROTECTED]


The referenced file above (BIND.dtd) has a copyright symbol contained in
one of its comments. This character is causing me to receive:

java.io.UTFDataFormatException: invalid byte 1 of 1-byte UTF-8 sequence (0xa9)

Can you confirm whether or not this character _should_ be causing this error?

I know that the Xerces FAQ addresses this error in particular, but as far as I
can tell (having used od -hc) the character in question shouldn't be a problem:

0006260 7279 6769 7468 a920 3032 3130 4d20 756f
          y   r   i   g   h   t       �   2   0   0   1       M   o   u

The copyright symbol is represented, I believe, by 'a9', which should fit easily
into the acceptable range.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to