At 06:46 AM 9/7/2000 -0800, Antoine Leca wrote:
When we were developping ISO 15924, Michael E. insisted on having a
different code for Gaelic Latin as opposed to normal Latin.
In TrueType Open / OpenType, there are different codes for Traditionnal
vs. "Reformed" Malayalam. All of these appear to
Please ignore my previous message (subj "[EMAIL PROTECTED]", to Antoine,
cc [EMAIL PROTECTED]). Sorry about that.
Antoine Leca wrote:
[EMAIL PROTECTED] wrote:
[...]
In ordinary cases, a ZW[N]J inside a consonant cluster does
not prevent
matra reordering. E.g., in Devanagari:
"John Hudson" [EMAIL PROTECTED] wrote:
we need be clear about what is being tagged -- script variants,
orthographic variants or language specific practices --, so as to avoid the
kind of inconsistencies that exist in ISO 639.
I'd add regional (and/or country) specific variants to this list.
Tuesday, September 12, 2000
Last Friday was International Literacy Day here at LC. SIL was among
those distributing literature here. From it I gather their goal is to
define and implement writing systems for many presently unwritten
languages
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
What is confusing is that sometimes "surrogates" refer to
certain code units (for UTF-16) that are reserved as code points,
and sometimes "surrogates" is used to refer to 'characters
on planes 01-10'. I think the latter is a misuse.
I thnk there are codes given to entities in the Ethnologue list that aren't
languages in the sense that we need to identify languages in IT and in
Bibliography (which is what the codes are for). I think that it is not
mature for International Standardization. It is a work in progress, subject
to
Michael Everson wrote:
I don't see how. It isn't difficult to learn the Cyrillic alphabet, and it
isn't difficult to learn the Latin alphabet. Look at Greece: everybody
knows both the Greek and Latin alphabets. It's on all the street signs.
Quite right. They are all just local variants of a
On 09/12/2000 12:18:37 PM Michael Everson wrote:
I thnk there are codes given to entities in the Ethnologue list that
aren't
languages in the sense that we need to identify languages in IT and in
Bibliography (which is what the codes are for).
Perhaps there is a cat that needs to be let out of
"Christopher J. Fynn" wrote:
I think a clear distinction may need to be made between those languages which
are commonly written and those which are (largely) only spoken. Outside the
realm of specialised applications for linguists, most applications currently
only deal with written
[EMAIL PROTECTED] wrote:
- BMP characters: characters in the BMP; note that d800-dfff are not
characters; fffe and are also not characters
Not in the Glossary, but "BMP" is.
- "astral"/supplementary/extended-plane/?? characters: everything in planes
1 - 16 (excluding anything ending
We do need to clean up terminology, and we need to do so in a way that
incorporates understanding of UTR-17. I think we need:
- BMP characters: characters in the BMP; note that d800-dfff are not
characters; fffe and are also not characters
- "astral"/supplementary/extended-plane/??
Peter noted:
We do need to clean up terminology, and we need to do so in a way that
incorporates understanding of UTR-17. I think we need:
- BMP characters: characters in the BMP; note that d800-dfff are not
characters; fffe and are also not characters
-
Oh Michael...
I think there are codes given to entities in the Ethnologue list that
aren't languages in the sense that we need to identify languages in IT
and in Bibliography
ISO 639, and every other "standard" for language/locale codes also has this problem,
and from what I remember of the
For what it's worth, I've been referring to characters between 0x1 and
0x10 as "higher-plane" characters as distinguished from BMP characters.
Seems to work well in a general way. For plane 1, I use "plane=1"
characters, etc.
Murray
Does anyone know if the JavaScript function parseFloat is
locale sensitive? I am using German Windows 2000 and the
function parseFloat still will not work if the input is not
in English format (with period as the decimal separator).
Thanks,
Paul
Paul Deuter
Internationalization Manager
[EMAIL
Le Mardi, Septembre 12, 2000, à 11:36 AM, Misha Wolf a écrit :
I can't stand "astral planes". The term suggests, to me at
least, that these planes (and, hence, the characters in them)
are not as "real" as the BMP.
I guess it doesn't bother me much since I'm used to talking about
John Le Mardi, Septembre 12, 2000, à 11:36 AM, Misha Wolf a écrit :
I can't stand "astral planes". The term suggests, to me at least, that
these planes (and, hence, the characters in them) are not as "real" as
the BMP.
John I guess it doesn't bother me much
beard-
any clue ?
Paul Deuter wrote:
Does anyone know if the JavaScript function parseFloat is
locale sensitive? I am using German Windows 2000 and the
function parseFloat still will not work if the input is not
in English format (with period as the decimal separator).
Thanks,
Paul
Hmmm, I hope this is tongue in cheek, the math flashbacks are scary here!
We could have some fun in the future with "hostile characters", "psychotic
characters", "apathetic characters", and other variations. :-)
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
Can anyone point me to an existing list of languages that is more
comprehensive and better researched than the Ethnologue?
If there is no such list, then we don't need to consider any
alternatives, right?
I'm not qualified to judge the merits of one list over another
but there certaily
I still think one plane should be delegated entirely to Mr Everson and he
should be left to get on with populating it to his heart's content :-) This
could save everybody concerned lots of time and expense.
- Chris
- Original Message -
From: "Kenneth Whistler" [EMAIL PROTECTED]
To:
Mischa Wolf wrote:
I can't stand "astral planes". The term suggests, to me at
least, that these planes (and, hence, the characters in them)
are not as "real" as the BMP.
Stars are real.
Christopher J. Fynn wrote:
I still think one plane should be delegated entirely to Mr Everson
On 09/12/2000 02:59:38 PM Kenneth Whistler wrote:
[snip]
I think Ken's comments on planes is good.
3. The term "surrogate character" should be eschewed altogether, because
of the confusion is causes. "Surrogate code point" can continue to
be used as it currently is, and the term
I guess it doesn't bother me much since I'm used to talking about
"imaginary"
and "irrational" numbers, too. (My thirteen-year-old refuses to believe
that
imaginary numbers are "real." *sigh*)
In a recent email to someone giving a long explanation to how multilingual
technologies work and why
On 09/12/2000 09:23:01 PM "James Kass" wrote:
I can't stand "astral planes". The term suggests, to me at
least, that these planes (and, hence, the characters in them)
are not as "real" as the BMP.
Stars are real.
This may be true, but it doesn't help everyday users to understand the
On Tue, 12 Sep 2000, John Cowan wrote:
"Christopher J. Fynn" wrote:
In short I favour inclusion of codes for written languages in the Ethnolouge
list which are currently missing in ISO 639 (and the requirement for a certain
number of publications does not seem too onerous) - but do not
26 matches
Mail list logo