Though my hex math isn't good enough to do it in my head, it looks like the UTF-8 version of the Unicode byte order mark. I know that there was some controversy as to whether that was legal, since the BOM is supposed to indicate byte order for Unicode, whereas UTF-8 has no byte order. OTOH, if you encode Unicode in UTF-8, is it required to strip the BOM, or just encode it as UTF-8?
Anyway, it looks like that's it. I don't know if the parser was updated to handle that or not. It used to not, because I never forsaw such a thing, and would have thought it was illegal by the XML spec, which says its either Unicode with a BOM or the first thing must be <?xml. UTF-8 isn't Unicode, though it can encode Unicode. Of course, if that's not what it is, and its just some random garbage, then ignore all of this :-) -------------------------- Dean Roddey The Charmed Quark Controller Charmed Quark Software [EMAIL PROTECTED] http://www.charmedquark.com "If it don't have a control port, don't buy it!" ----- Original Message ----- From: "Brendan Reville" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Saturday, January 19, 2002 3:13 PM Subject: "Invalid document structure" exception? > hi all, > > I have a text .xml file which I saved in Windows Notepad as a UTF-8 file. > > However, when I try to parse it with Xerces, I get an "invalid document > structure" exception on line 1, character 1. Any idea why I would be > getting this? > > > The very beginning of the file looks like this: > > <?xml versio > > A binary dump of the same gives this: > > ef bb bf 3c 3f 78 6d 6c 20 76 65 72 73 69 6f 6e > > if I'm not mistaken. I'm not sure what those first three bytes are, but I > didn't expect to see them there; they don't show up in Notepad, that's for > sure. > > thanks > > - Brendan > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
