Followup to: <[EMAIL PROTECTED]>
By author: [EMAIL PROTECTED]
In newsgroup: linux.utf8
>
> >Unlike UCS-2, UTF-8 can also encode the entire 31-bit Unicode space.
>
> 20.1-bit space. And UTF-16 (which has mostly surplanted UCS-2, even
> if the UTF-16 support is still a bit beta) can encode the same 20.1
> space. Those characters UTF-16 doesn't support aren't going to be
> used for Unicode characters.
>
Note that the 20.1-bitness exists *only* to make UTF-16 possible.
UTF-16 (and other things which have chosen to assume UTF-16
limitations) is the only thing which will break if 20.1 bits turns out
to be insufficient.
> It shouldn't call it the Linux encoding; for most purposes, the Linux
> Unicode encoding is UTF-8, notably including filenames. Most of us
> on this list have UTF-8 named files, if only for the sake of making
> sure they work. Your filesystem is broken; the proper thing to do
> (albeit not necessarily easy or practical) is to replace it with
> a POSIX filesytem (which can be swapped transparently, for most
> purposes, and which handles UTF-8 transparently.)
>
> >The Linux encoding used by [ FS ] is a hack. It is not as space-
> >efficient as UTF-8, nor is it a de jour (official) standard.
>
> It's not a de facto standard, either. In the several years I've
> been on this list and [EMAIL PROTECTED], I've never heard of
> it.
>
> >The Linux encoding is used because the system is not ready for UTF-8.
>
> What is "the system" here? It sure as heck isn't Linux. Again I object
> to the misnomer "Linux Encoding".
>
> >The Linux encoding only uses US-ASCII, unlike UTF-8.
>
> You just explained how it encoded U+0080-U+00FF as bytes. So it doesn't use
> US-ASCII, it uses Latin-1; according to your description, it only works
> right under Latin-1.
I am completely confused what this so-called "Linux encoding" is
supposed to be...
-hpa
--
<[EMAIL PROTECTED]> at work, <[EMAIL PROTECTED]> in private!
"Unix gives you enough rope to shoot yourself in the foot."
http://www.zytor.com/~hpa/puzzle.txt <[EMAIL PROTECTED]>
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/