I am involved a project that support web users input in different language,
and do some sort things on database. I've encountered some difficulties on
the decision of different user's encoding, different user's input in
different language, and parse the info in servlet and then do sort on
On the cover of my French driver's license, it says ``Driving
license'' in 10 languages (all the EU languages at the time it was
printed). The titles are ordered alphabetically by the name of the
language in the language itself. The Portuguese don't seem to mind.
(Fair enough,
Actually, in the case of the 10 EU languages being referred to, I do not
think there would be any dissention as to the order, would there be?
Admittedly if Lithuania was in the EU and there were countries that
started
with a "Y" there as well, there would be problems with people who did not
I admit to nitpicking because in this particular case, the language
names,
we may be just lucky so that there are no collation conflicts.
I believe this is an accurate statement... .we ARE lucky, so far.
But believing that there is a collation order that works across all
[EMAIL PROTECTED] wrote:
But believing that there is a collation order that works across all the
European (Latin script, let's not even go to Cyrillic and Greek) languages
is a very hopeless fallacy:
Quite true. But there is a *default* collation that works *fairly* well,
plus machinery for
Assuming that ISO 8859 is anything official (which is not what I think),
then (use of) Latin-1 (and Latin-3 Latin-5) have not been deprecated,
but
rather a note have been added in the text of these standards to explain
that
there are small restrictions. Then, a note in a informative
At 07:53 AM 06/15/2000 -0800, Michael Kaplan (Trigeminal Inc.) wrote:
Eventually someone will have a language name that does not fit
or a language like German will inist on sorting sooner, under Deutsch rather
than under German, etc. (which I personally think makes more sense than
making a
There is a new ISO standard coming out for a default collation,
namely ISO 14651, and a Unicode technical report too, which
should be equivalent technically. This should also be apllicable
to subsets of 10646, like the one you are indication (which I
read as 8859-1-ish). Nowadays I would
I have tried without success to find information on how to view multilingual
Web pages with a Macintosh and which multilingual fonts are available, so I
have documented the things I have discovered by a process of trial and
error, and produced a new page in my collection of Unicode information
Lars Marius Garshol [EMAIL PROTECTED] wrote
I would be glad if people here could read through it and tell me if
they see any mistakes (or other kinds of things that could be
improved).
This seems to have a nice chatty style. Just a couple of points:
- When defining characters, you omit
10 matches
Mail list logo