Jacob Lund wrote:
Thanks!

Could you then explain the relationship between UTF-8 and XML. Does is make
sense to have unescaped UTF-8 encoding in XML or should UTF-8 always be
escaped when used in XML?


XML frequently uses unicode (more often than not, in fact), though it can use other character sets (XML processors all (?) use unicode internally). Most often, the unicode is encoded using UTF-8. Using UTF-8 as-is in XML is sensible and a normal safe approach. Of course, you have to do _some_ escaping because it's XML - so things like < and > (which are, of course, unicode characters) still need escaping. But that's XML escaping (using character references like &gt;), not escaping the UTF-8.


Mike


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to