Re: filename and normalization (was gcc identifiers)

H. Peter Anvin Wed, 04 Dec 2002 08:41:17 -0800

Followup to:  <[EMAIL PROTECTED]>
By author:    Henry Spencer <[EMAIL PROTECTED]>
In newsgroup: linux.utf8
> 
> The main chance of difficulties with UTF-8 is if different programs take
> different approaches to normalization of filenames.  A standard for that
> would help, as would suitable code in libraries.
>


I expect that Unix systems will use normalization form C, and yes, we
need libraries to do all the various kinds of manipulation that one
can do on Unicode text (as opposed to "general localized text")
including producing the various normalization forms, querying
character properties, and converting to and from the various UTF
forms.

That being said, I consider the production of normalization form C to
be the responsibility of the user input system, and *perhaps* of
editing programs.  I don't believe we should attempt to insert
normalization everywhere, partially because it inevitably leads to
security holes.

        -hpa

-- 
<[EMAIL PROTECTED]> at work, <[EMAIL PROTECTED]> in private!
"Unix gives you enough rope to shoot yourself in the foot."
http://www.zytor.com/~hpa/puzzle.txt    <[EMAIL PROTECTED]>
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Re: filename and normalization (was gcc identifiers)

Reply via email to