On Tue, Dec 13, 2005 at 03:55:11PM +0530, Arun S K (RBIN/EDM3) * wrote:
> <?xml version="1.0" encoding="UTF8"?>
> 
> The document has the character ß (Beeta) in it. The parser aborts with the 
> following message 
> --------------------------------------------------------------------
> :13: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0x80 0x20 0x3C 0x2F
>                               <NAME>test_1ß</NAME>
> --------------------------------------------------------------------
> 
> Is ß not a valid UTF8 character?

  The character is part of unicode. But the sequence of bytes used to
express it are not valid in UTF-8. It is a fatal XML error.

> How can this be corrected.

  Replacing the wrong bytes in the instance by a sequence which is
valid for UTF-8.
   Read the material pointed to at the beginning of
     http://xmlsoft.org/encoding.html

Daniel

-- 
Daniel Veillard      | Red Hat http://redhat.com/
[EMAIL PROTECTED]  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
xml@gnome.org
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to