Eric Brunner-Williams wrote:

>> Anyway, Arabic strings are examples of exponential explosions
>> with large coefficients a lot easier to understand for most
>> of you than Chinese ones.
> 
> the density of variant (context dependent) characters in arabic script, 
> whether sampled as text, or sampled as domain names, is sparse,

While meaningful variations would be sparse, meaningful
capitalization of Latin is also sparse. That is, meaningful
capitalization of "mydomain" should be

        myDomain
        Mydomain
        MyDomain
        MYDOMAIN

However, meaningless capitalization such as:

        mYdOmAiN

should also be protected, which should be same for Arabic.

A protection could be rejection of domain registration. But it,
anyway, requires a document of complete and unambiguous
definition of extended case insensitivities.

And, the definition used for TLDs must be international one.

So, the requirement for complete and unambiguous specification
for case insensitivities or canonicalization is same.

> relative
> to the density of "variant" characters in the (unified) han script(s), 
> which is not quite 2^^n,

You seems to be thinking "variant" for Chinese mean simplified
and complex. But there are other types of "variant" Chinese
characters. It's somewhat like plain 'C' without ceddille and
'C' with ceddille, which, sometimes (here is dependency on
locale information), must be treated as identical characters.

> their answers were as i expected, and fail to support a
> "Arabic strings are examples of exponential explosions with
> large coefficients" claim.

They should, naturally, reject meaningless combinations.

                                                Masataka Ohta
_______________________________________________
DNSOP mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/dnsop

Reply via email to