, yes,
but not surprising.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
, as the Unicode spec points out, you really want to do
case-insensitive comparisons by first mapping characters to equivalence
classes -- not by mapping to a particular case and assuming that all
equivalent letters will map to the same character.
Henry
to this confusion?
I think you've overlooked the fact that *now* he is talking about tolower(),
not towlower(). tolower() deals with char, not wchar_t, and in most char
character sets, there is no dotless i.
Henry Spencer
. It would help if you could
supply more context: why do you want to do this, as part of what?
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive
thing.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
will differ considerably from the keystrokes needed to
enter it. It's a protocol of some kind, and somebody needs to define how
that protocol works and what its backspace operation does. Unicode
assigns no semantics to codes 8 and 127.
Henry
you might want editing, whether it be a terminal emulator or
an X server. So the problem has user-land solutions in principle; I'm not
saying it would be particularly simple...
Henry Spencer
with any of the languages where these issues get serious.
But the potential is there.)
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http
efficiency issue any more.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
of the error is possible,
or (c) it's important to preserve the original data even if it is
malformed.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive
.)
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
, but the helper function is only part of an implementation and
should not be mislabeled as being the whole thing.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
.)
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
to
differentiate between kilobits and kilobytes with kb and kB.
Changing hyphens and case doesn't make distinctions or avoid confusion.
Yes, it would be better to call the more general encoding, say, UTF-P.
Henry Spencer
, again to avoid confusion. Call it UTF-P, or UTF-8P, or UTF-9,
but not utf8, please.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http
+FEFF is now discouraged.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
!). Arguably, the same issue arises for things
like U+, which are guaranteed to be non-characters and hence should
never appear in input.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8
techniques, in particular, are available only
if the desired sequences are exactly known in advance.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
all it needs are encoded as precomposed...
As I understand it, the usual written forms of Vietnamese explicitly need
multiple marks per letter; there are no precomposed forms for that.
Henry Spencer
and reversibly.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
not ASCII-compatible...
It might be worth mention, because Java's not the only thing using it.
It's actually quite convenient to be able to make applications
NUL-transparent without having to recode all the string operations.
Henry
boldface as the verbatim
font with verbatim processing, that will go a long way toward doing the
right thing. Bold does not see much other use in traditional manpages.
Henry Spencer
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
font should perhaps lead to
warning messages for the author.
Hmm, yes, that's probably the best approach.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
On Wed, 4 Dec 2002, Glenn Maynard wrote:
When --help is printed, I want to see two hyphens, not a dash.
You probably want to see two minus signs, not two hyphens...
Henry Spencer
On 10 Sep 2002, H. Peter Anvin wrote:
The only sane way to deal with this is to do a console daemon in
userspace...
As the re-invention of X proceeds apace... :-)
Henry Spencer
raw 8-bit in *headers*, even as a future direction. For the
moment, and probably for a long time to come, mail headers have to use RFC
2047 encodings.
Henry Spencer
[EMAIL PROTECTED
is using right now means that existing notebooks, binders,
shelves, etc. do not suddenly become unusable, as they often do if the
new size is *larger* in one or both dimensions.
Henry Spencer
politics of standardization here, not about right and wrong.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
changes and additions.
Quite apart from any *technical* merit that has, it means that the existing
design's current vendors have to make changes too; this helps sell the
new standard to people who will have to retool completely for it.
Henry
exactly the same proportions as the original
sheet.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
to believe that there is no distinction to be made, that
hyphen is proper for all purposes.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http
.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
, people will use setenv...
For small values of people. :-) Only the experts will. The experts
presumably can get the case of a locale name right.
Henry Spencer
[EMAIL PROTECTED
. :-(
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
at least
a strong historical presence in Latin-alphabet texts, are unreadable to a
lot of Latin-alphabet users, and were nevertheless unified.
Henry Spencer
[EMAIL PROTECTED]
--
Linux
and scholars -- people who *did* expect to have to
deal with it on a day to day basis -- were involved in the design and
implementation of Han unification. This was not some hideous Western plot
foisted on Japan from abroad.
Henry Spencer
until afterward.)
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
on the comments of one person
who clearly has strong opinions on the matter himself.
Henry Spencer
[EMAIL PROTECTED]
--
Linux-UTF8: i18n of Linux on all levels
Archive: http
small companies
building IT hardware. It's not as easy for a small company to get into
the hardware business as it used to be, but it is still feasible;
moreover, encouraging this is important.
Henry Spencer
.
Henry Spencer
[EMAIL PROTECTED]
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/
nd
tolower() was redefined to do that. (A stupid move -- they should have
changed the name when they changed the behavior.)
Henry Spencer
[EMAIL PROTECTED]
-
Linux-UTF8: i18n of Linux on all leve
not be the best way to do that.
Henry Spencer
[EMAIL PROTECTED]
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/
value is unknown or
unrepresentable in Unicode". That is, it marks the place where something
untranslatable used to be.
Henry Spencer
[EMAIL PROTECTED]
-
Linux-UTF8: i18n of Li
of a decoder see no advantage from this behavior, since they are
canonicalizing anyway.
Henry Spencer
[EMAIL PROTECTED]
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/
49 matches
Mail list logo