On Wed, Jun 27, 2012 at 1:26 PM, Mark Rejhon <[email protected]> wrote:

> Technically, it is not the responsibility of the XML processor to do
> normalization, and normalization should be done closer to the GUI end.
>  Then the indexes (p and n attributes) used by real time editing action
> elements are always accurate.
>

Let me clarify this:

From
http://www.w3.org/TR/2006/REC-xml11-20060816/#sec-normalization-checking
Section 2.1.3 Normalization Checking
XML processors are encouranged to *check* normalization, but not do the
actual normalizing.

----
   XML processors must not transform the input to be in fully normalized
form. XML applications that create XML 1.1 output from either XML 1.1 or
XML 1.0 input should ensure that the output is fully normalized; it is not
necessary for internal processing forms to be fully normalized.
    The purpose of this section is to strongly encourage XML processors to
ensure that the creators of XML documents have properly normalized them, so
that XML applications can make tests such as identity comparisons of
strings without having to worry about the different possible "spellings" of
strings which Unicode allows.
----

So, essentially, it's strongly encouraged for senders to normalize their
Unicode strings *early* -- before the RTT encoder.   Beyond this, for a
compliant XML processor, it's no longer a technical problem against
XEP-0301.

Cheers,
Mark Rejhon

Reply via email to