Re: [precis] width mapping in draft-yoneya-precis-mappings

Peter Saint-Andre Thu, 27 Sep 2012 17:32:26 -0700

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Martin, thanks for the helpful explanation. Comments at end.


On 9/22/12 5:06 AM, "Martin J. Dürst" wrote:
> Hello Peter, others,
> 
> On 2012/09/21 4:31, Peter Saint-Andre wrote: Section 3.1 of
> draft-yoneya-precis-mappings states:
> 
> Width mapping will increase backward compatibility with Stringprep 
> [RFC3454] and precis framework [I-D.ietf-precis-framework].
> Because in a Stringprep profile which specifies Unicode
> normalization form KC (NFKC) for normalization method,
> fullwidth/halfwidth characters are mapped into its compatible form.
> If a precis framework profile specified NFKC (which is not
> recommended), width mapping might not be useful.
> 
> Is backward compatibility the only reason to specify width
> mapping?
> 
>> I very much don't think so. The full-width/half-width distinction
>> that came to Unicode from East Asian Encodings is an artifact
>> from the age of fixed character cell widths. In high-quality
>> typesetting, there are various conventions for when to use one or
>> the other form (and the "half-width" form is set in proportional
>> type). But in everyday use, it's just mostly happenstance that
>> decides which form is used.
> 
> If so, then it would be good to say that width mapping is
> appropriate for technologies that are migrating from stringprep
> (with NFKC) to precis, but not for technologies that never had a
> stringprep profile.
> 
> If not, then it would be good to explain why it can be appropriate
> for a technology that uses precis (but never used stringprep) to
> specify width mapping. For example, perhaps width mapping helps to
> prevent violations of the "principle of least user surprise"
> because users in language communities with fullwidth and halfwidth
> characters might not be able to tell the difference between various
> widths on common devices.
> 
>> "Might not be able to tell the difference" is the wrong
>> expression. It's not too difficult to spot and identify the
>> difference. It's much more a "not being motivated to spot the
>> difference". An average American reader would be perfectly able
>> to tell the difference between serif and sans-serif type. But
>> when they read or write something, they (in most cases) just
>> don't care. Why should they?
> 
>> So (I wish it were not, but) indeed width mapping is way more
>> important than just for backwards compatibility.
> 
>> NFKC is in stringprep simply because the design team wanted to
>> have a width mapping for IDNA 2003, and specifying NFKC was the
>> easiest way to get there. In practice, width mapping is way more
>> important than many of the mappings in NFC. And it's definitely
>> the most important part of NFKC.
> 
> In my opinion, some discussion of the security implications of
> width mapping (e.g., the possibility of more "fase positives")
> would also be helpful.
> 
>> I don't think there will be false positives. Saying that e.g. IBM
>> and ＩＢＭ are should be two different things is simply crazy. So it
>> should be either width mapping, or the less basic forms (i.e.
>> full-width for Latin and half-width for Kana,...) should be
>> prohibited.

That all makes sense. It sounds as if we'll want to advise PRECIS
usages and subclasses to strongly consider performing width mapping.
Do you think it is so important that we ought to think about adding it
to the framework alongside casemapping and the other "first-class"
PRECIS rules?

Peter

- -- 
Peter Saint-Andre
https://stpeter.im/


-----BEGIN PGP SIGNATURE-----
Version: GnuPG/MacGPG2 v2.0.18 (Darwin)
Comment: Using GnuPG with Mozilla - http://www.enigmail.net/

iEYEARECAAYFAlBk5E4ACgkQNL8k5A2w/vxsDQCdHj7nRoX89MWQ36b0LdOAKhei
yzgAn17jS1E1yUxbOLvX30As1K0pge97
=SJnY
-----END PGP SIGNATURE-----
_______________________________________________
precis mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/precis

Re: [precis] width mapping in draft-yoneya-precis-mappings

Reply via email to