Re: are surrogate chars allowed in node names?

David Bertoni Wed, 30 May 2007 23:46:42 -0700

sushil kumar wrote:

following is the error message I am getting

F:\Xerces-Binary\xerces-c-windows_2000-msvc_60\bin>DOMCountF:\surrogateelem_new

.xml

Fatal Error at file F:\surrogateelem_new.xml, line 8, char 4
  Message: Expected an element name

Errors occurred, no output available


Please reply to the list, and not to my email address.

F:\Xerces-Binary\xerces-c-windows_2000-msvc_60\bin>
XML is well formed and even validate in XML-SPY application if you addfollowing DTD on top of below mentioned XML


Are you sure the XML is well-formed, and that XML Spy is correct?


<?xml version="1.0" encoding="UTF-16" standalone="yes"?>
<!DOCTYPE booklist [
    <!ELEMENT booklist (book*)>
    <!ELEMENT book (टाईटल, author+, 𪘚)>
    <!ELEMENT टाईटल (#PCDATA)>
    <!ELEMENT author (#PCDATA)>
    <!ELEMENT 𪘚 (#PCDATA)>
]>
<booklist>
    <book>
        <टाईटल>𪘚 Title</टाईटल>
        <author>Amit 𪘚</author>
        <author>𪘚.jpg</author>
        <author>Kumar 𪘚</author>
        <𪘚>टाईटल.jpg</𪘚>
    </book>
    <book>
        <टाईटल>𪘚.jpg</टाईटल>
        <author>Charu 𪘚𪘚𪘚𪘚𪘚</author>
        <author>Pankaj 𪘚𪘚𪘚𪘚</author>
        <𪘚>Pearson 𪘚𪘚𪘚𪘚 Press</𪘚>
    </book>
</booklist>

I just want to know whether Xerces support surrogate chars in tag nameor not. As I have seen that XML-SPY is validating above mentioned XML.

Well, I pointed you to the XML recommendation, which describes the grammarfor element names, and includes a list of the Unicode code points allowed.If you look at that, you'll see that there are no Unicode code pointsencoded in UTF-16 as surrogate pairs that are allowed as name characters inXML 1.0.

Since I don't have your original document (pasting it into an email messagedoesn't work), I can't say for sure what the actual bytes are. However, Idid copy-and-paste the text from your reply, and was able to reproduce theerror message (when I saved the document as UTF-8 and updated the XMLdeclaration). In addition, three other XML parsers also reported that thetag name in the DTD on line 4 is not a valid XML name.

Perhaps you should contact Altova and ask them why their XML parser doesnot reject documents that are not well-formed.


Dave

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: are surrogate chars allowed in node names?

Reply via email to