Re: [Standards] <[CDATA[ in XMPP

Rachel Blackman Mon, 30 Jul 2007 19:34:24 -0700

CDATA is purely XML level and doesn't carry any semantic meaning.
And yes, the normal compliant XML parser doesn't even bother to tell
you how the data was encoded in the byte stream.
You are seriously confusing layers here.

Fair enough that I shouldn't have used spaces as the example; you'reright that it's invalid, and I simply grabbed it as the sample due tothe JID escaping thing.

JID escaping has, however, been put forward as a method of escapingcharacters to get them across the wire. I think I am failing to getmy point across clearly, so I will try one last time. What I've beentrying to address is, for instance:

So, we are talking about way to escape characters in an XML stream. My

view is that all way to escape characters are good, especially whenthey

are defined in XML, cited in XMPP RFC and are simple to implement (and
implemented by all parsers I know).


Read that carefully. "All way to escape characters are good."

If we are viewing CDATA as 'one more way to escape characters,' thenwe need to think about the implications. Because I will /guarantee/you that if we recommend CDATA as an escaping method, then someonewill do a <![CDATA[john&[EMAIL PROTECTED]> in an <item/> value, orwhatever.

My point is that we need to /define/ things like this, rather thanleaving them vague. Or else someone WILL go, 'Oh, well, when I senddown john&[EMAIL PROTECTED] it disconnects me with a stream errorsaying there's an unescaped character there. I'll just make sureanything with unescaped characters goes into a CDATA block.' And ifthey do that, it will be valid XML across the wire, too! It shouldnot pop them off with a stream error, right?

If we proclaim that all JIDs must adhere to the current rules and thecharacters we've discussed as visually useful but invalid to sendacross the wire as part of a node (namely, & and ' and so on) must beescaped using JID escaping, that's *fine*. Deciding to explicitlysay that JIDs cannot contain those characters except as representedin JID escaping is a *valid and viable solution* to my concern.

If, however, we want to just leave it vague and make CDATA 'one moreway to escape characters,' then people will most likely makeassumptions about how things interact. Based on past experience, Isuspect at least some of those assumptions will be wrong. My pointis that if we want to include CDATA, then we need to make it clearwhere CDATA is /not/ an appropriate solution for escaping.

I hope that makes my concern clearer, but I will leave it alone atthis point; I have realized I am arguing this point utterly alone; itmay mean that I am utterly failing to communicate my concern clearly,or that I am seeing a problem where one does not exist. I will hopethat it is the latter and my concerns are just motivated by mypersonal generally squidgy feelings about hazily-defined edges tostandards, rather than being an actual problem. :)


--
Rachel Blackman <[EMAIL PROTECTED]>
Trillian Messenger - http://www.trillianastra.com/

Re: [Standards] <[CDATA[ in XMPP

Reply via email to