On Tue, Dec 13, 2005 at 03:55:11PM +0530, Arun S K (RBIN/EDM3) * wrote: > <?xml version="1.0" encoding="UTF8"?> > > The document has the character ß (Beeta) in it. The parser aborts with the > following message > -------------------------------------------------------------------- > :13: parser error : Input is not proper UTF-8, indicate encoding ! > Bytes: 0x80 0x20 0x3C 0x2F > <NAME>test_1ß</NAME> > -------------------------------------------------------------------- > > Is ß not a valid UTF8 character?
The character is part of unicode. But the sequence of bytes used to express it are not valid in UTF-8. It is a fatal XML error. > How can this be corrected. Replacing the wrong bytes in the instance by a sequence which is valid for UTF-8. Read the material pointed to at the beginning of http://xmlsoft.org/encoding.html Daniel -- Daniel Veillard | Red Hat http://redhat.com/ [EMAIL PROTECTED] | libxml GNOME XML XSLT toolkit http://xmlsoft.org/ http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/ _______________________________________________ xml mailing list, project page http://xmlsoft.org/ xml@gnome.org http://mail.gnome.org/mailman/listinfo/xml