Followup to:  <[EMAIL PROTECTED]>
By author:    Glenn Maynard <[EMAIL PROTECTED]>
In newsgroup: linux.utf8
>
> On Wed, Dec 04, 2002 at 04:03:38PM +0100, Keld J�rn Simonsen wrote:
> > Well, users should not expect these two sequences to be identical,
> > they are not, according to ISO/IEC 10646.
> 
> Users expect that "�" == "�", and don't know or care about Unicode, and
> that's reasonable.
> 
> Programmers should care, of course, but programmers aren't the only ones
> who use filenames, and this problem, as Henry pointed out, is a more
> general one.
> 

The issue is where the normalization enters the picture.  It should be
done at input time, so that when a user presses the � key on their
keyboard they get U+00D6.  Problem solved.  If this is U+004F U+0308
then someone has entered something weird to begin with.  "ls" may
chose to display this as an anomaly by outputting it as O\u0308 or
something like that, but that's again a presentation issue.

          -hpa
-- 
<[EMAIL PROTECTED]> at work, <[EMAIL PROTECTED]> in private!
"Unix gives you enough rope to shoot yourself in the foot."
http://www.zytor.com/~hpa/puzzle.txt    <[EMAIL PROTECTED]>
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to