On Dec 18, 2009, at 13:25 , Cyril Concolato wrote: >> For "space characters", why did you add U+000B and U+000C? > I think this question is even more important if you note that XHTML 1 > indicates that U+000C is an invalid XML char (see > http://www.w3.org/TR/xhtml1/#C_15)
I don't think that looking at XHTML is the best idea if you want a normative definition for XML :) U+000C is indeed forbidden in XML 1.0 (all editions) but is allowed (IIRC only as a numeric character reference) in XML 1.1. P+C doesn't tie processors to a particular version of XML, and lists its white space characters accordingly (and defensively). If you're certain that you will only ever get content that comes from a conforming XML 1.0 implementation, then you probably don't need to check for this. -- Robin Berjon - http://berjon.com/
