Though my hex math isn't good enough to do it in my head, it looks like the
UTF-8 version of the Unicode byte order mark. I know that there was some
controversy as to whether that was legal, since the BOM is supposed to
indicate byte order for Unicode, whereas UTF-8 has no byte order. OTOH, if
you encode Unicode in UTF-8, is it required to strip the BOM, or just encode
it as UTF-8?

Anyway, it looks like that's it. I don't know if the parser was updated to
handle that or not. It used to not, because I never forsaw such a thing, and
would have thought it was illegal by the XML spec, which says its either
Unicode with a BOM or the first thing must be <?xml. UTF-8 isn't Unicode,
though it can encode Unicode.

Of course, if that's not what it is, and its just some random garbage, then
ignore all of this :-)

--------------------------
Dean Roddey
The Charmed Quark Controller
Charmed Quark Software
[EMAIL PROTECTED]
http://www.charmedquark.com

"If it don't have a control port, don't buy it!"


----- Original Message -----
From: "Brendan Reville" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Saturday, January 19, 2002 3:13 PM
Subject: "Invalid document structure" exception?


> hi all,
>
> I have a text .xml file which I saved in Windows Notepad as a UTF-8 file.
>
> However, when I try to parse it with Xerces, I get an "invalid document
> structure" exception on line 1, character 1.  Any idea why I would be
> getting this?
>
>
> The very beginning of the file looks like this:
>
> <?xml versio
>
> A binary dump of the same gives this:
>
> ef bb bf 3c 3f 78 6d 6c   20 76 65 72 73 69 6f 6e
>
> if I'm not mistaken.  I'm not sure what those first three bytes are, but I
> didn't expect to see them there; they don't show up in Notepad, that's for
> sure.
>
> thanks
>
> - Brendan
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to