On Wed, Jun 27, 2012 at 1:26 PM, Mark Rejhon <[email protected]> wrote:
> Technically, it is not the responsibility of the XML processor to do > normalization, and normalization should be done closer to the GUI end. > Then the indexes (p and n attributes) used by real time editing action > elements are always accurate. > Let me clarify this: From http://www.w3.org/TR/2006/REC-xml11-20060816/#sec-normalization-checking Section 2.1.3 Normalization Checking XML processors are encouranged to *check* normalization, but not do the actual normalizing. ---- XML processors must not transform the input to be in fully normalized form. XML applications that create XML 1.1 output from either XML 1.1 or XML 1.0 input should ensure that the output is fully normalized; it is not necessary for internal processing forms to be fully normalized. The purpose of this section is to strongly encourage XML processors to ensure that the creators of XML documents have properly normalized them, so that XML applications can make tests such as identity comparisons of strings without having to worry about the different possible "spellings" of strings which Unicode allows. ---- So, essentially, it's strongly encouraged for senders to normalize their Unicode strings *early* -- before the RTT encoder. Beyond this, for a compliant XML processor, it's no longer a technical problem against XEP-0301. Cheers, Mark Rejhon
