Re: Text in composed normalized form is king, right? Does anyone generate text in decomposed normalized form?

Philippe Verdy Wed, 06 Feb 2013 01:32:52 -0800

2013/2/5 Richard Wordingham <[email protected]>:
> On Tue, 5 Feb 2013 12:16:47 +0100
> Philippe Verdy <[email protected]> wrote:
>
>> A process can be FULLY conforming by preserving the canonical
>> equivalence and treating ALL strings that are canonically equivalent,
>> without having to normalize them in any recommanded form,...
>
> Try doing UCA collation with <U+0302 COMBINING CIRCUMFLEX ACCENT,
> U+0067 LATIN SMALL LETTER G> being a collation element (with arbitrary
> collation elements) without doing normalisation.


<0302, 0067> is defective, and its normalisation is still <0302,
0067>, it is NOT canonically equivalent to <0067, 0302>

I was not speaking about arbitrary collation elements containing
defective sequences, is is a real case ?

Consider how you
> would handle <U+011D LATIN SMALL LETTER G WITH CIRCUMFLEX, U+011D,
> U+011D>!

with which collation rule set ? including defective collection elements ?

Re: Text in composed normalized form is king, right? Does anyone generate text in decomposed normalized form?

Reply via email to