FdC> However, the original goal of UTF-8 was to be able to use it
FdC> "behind the back" of UNIX, just as we do now with ISO 8859. But
FdC> because UTF-8 is not C1-safe, this idea breaks down
Yes.
FdC> It would have been possible to design a C1-safe UTF-8 (and in
FdC> fact is has been done as a proof-of-concept), but it's too late
FdC> now.
Yes. We need to make do with the current definition of UTF-8.
FdC> It's an interesting problem.
It's a difficult problem, but one that we have no choice but to solve.
As you point out, there is no perfect solution; a solution will
require having one more mode in the terminal driver, ensuring that in
UTF-8 locales the terminal driver is set to that mode by default,
ensuring that applications know how to send C1 characters in the
current locale, etc. All of this is pretty yucky.
Juliusz
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/