In a message dated 2001-07-02 12:08:57 Pacific Daylight Time,
[EMAIL PROTECTED] writes:
Latin Extended A and B are not l33tish enough, although we
can expect them to turn up in the names of rock bands.
Which reminds me that, once again, another minor revision of Unicode will be
released
I don't think there's any point in encoding 64 hexagrams; especially when
we have the pieces already. Use the pieces of three and position them with
a drawing program. We don't have combining thingies for putting chess
pieces on board squares, either.
Rick
On Mon, 2 Jul 2001, Martin Kotulla wrote:
Can anyone give me some information on the Slovenian and Croat letters
in the Unicode range U+0200 to U+0217?
They are used to mark the tone in phonetics:
double grave marks a short high voice
inverted breve marks a long high voice
Along with them,
27 Jun 2001 13:38:33 +0100, Gaute B Strokkenes [EMAIL PROTECTED] pisze:
I would be indebted if any of the experts who hang out on the
unicode list could sort out this confusion.
I would be glad if the resolution allowed UTF-8 and UTF-32 encoders and
decoders to not worry about surrogates at
From: "Marcin 'Qrczak' Kowalczyk" [EMAIL PROTECTED]
It's a pity that UTF-16 doesn't encode characters up to U+F, such
that code points corresponding to lone surrogates can be encoded as
pairs of surrogates.
Unfortunately, we would then be stuck with what happens when two such
surrogate
Tue, 3 Jul 2001 01:50:56 -0700, Michael (michka) Kaplan [EMAIL PROTECTED] pisze:
It's a pity that UTF-16 doesn't encode characters up to U+F, such
that code points corresponding to lone surrogates can be encoded as
pairs of surrogates.
Unfortunately, we would then be stuck with what
At 08:04 + 2001-07-03, Marcin 'Qrczak' Kowalczyk wrote:
27 Jun 2001 13:38:33 +0100, Gaute B Strokkenes [EMAIL PROTECTED] pisze:
I would be indebted if any of the experts who hang out on the
unicode list could sort out this confusion.
I would be glad if the resolution allowed UTF-8 and
Hi Everyone,
I was wondering if anyone could help?
We have created a test oracle database and used character set UTF8.
Unfortunately when we go to svrmgrl and look for the language parameter, it
comes back with American. Is this Correct?
How can I check and find out which character set is
Hi all,
I'd like to ask about the encoding status of the Japanese Jindai
scripts, which are mentioned in older documents[1], and until a certain
point in time, versions of the Roadmap.
Here's a scan of a partial table of over a dozen Jindai scripts (except
the rightmost column, which is
Looks
interesting. How are you approaching the complication that transliteration
is between pairs of languages?
E.g.
Russian to English, Russian to French, Russian to German, and Russian to
Finnish, all these are slightly different (as far as I know), because the
goal of transliteration is
Rick McGowan wrote:
I don't think there's any point in encoding 64 hexagrams; especially when
we have the pieces already. Use the pieces of three and position them with
a drawing program. We don't have combining thingies for putting chess
pieces on board squares, either.
Hi Rick,
I
Thomas Chan wrote...
I'd like to ask about the encoding status of the Japanese Jindai
scripts, which are mentioned in older documents[1], and until a certain
point in time, versions of the Roadmap.
Do you have a paper on the topic? You say over a dozen 'Jindai'
scripts. What does this
Looks interesting. How are you approaching the complication that transliteration is
between pairs of languages?
I know what you mean: Gorbachev is Gorbatschow in German.
I think that the rules that we have in ICU are probably English-centric where it makes
a difference.
Note that some of
As Markus says, one can do that right now, by making your own (say)
German-Serbian transliterator, one that is different from Latin-Cyrillic,
Latin-Serbian, or German-Cyrillic. In ICU 2.0, we are examining the
possibility of a lookup heirarchy, similar to the resource heirarchy, that
would allow
I trust that 'moving' a name or a term between languages would be called
transcription, not transliteration. Transliteration just tries to 'move' from script
to script.
Markus Scherer writes:
Looks interesting. How are you approaching the complication that transliteration
is between
I don't think Jindai-Moji
(script of the period when Japan was
created by the gods from heaven)
can be treated as real scripts.
It is very doubtful they really existed (in history).
Actually, there is no proof they ever had been used
except by some nationalist scholars (but even there not in
At 04:16 PM 7/2/2001, Michael Everson wrote:
At 12:33 -0700 2001-07-02, Edward Cherlin wrote:
Has anyone proposed the following for inclusion in Unicode? If so, what
is their status?
Daoist Hexagrams, 64 forms (the trigrams are already included, but with
no combining mechanism)
You're welcome
At 11:18 PM 7/2/2001, Rick McGowan wrote:
I don't think there's any point in encoding 64 hexagrams; especially when
we have the pieces already. Use the pieces of three and position them with
a drawing program. We don't have combining thingies for putting chess
pieces on board squares, either.
They are fake old syllabaries for Japanese, made in the very nationalistic
kokugaku circles in the 18th century (Hirata Atsutane/Tsurumine Shigenobu/
Ookuni Takamasa/Ochiai Naozumi etc.), many obviously under Korean
influence. They were claimed by their discoverers/makers to go back to the
time
Richard Cook wrote:
--A: They are compositionally formed from the 8 trigrams.
Rebuttal: By this reasoning, the 8 trigrams themselves ought not to have
been encoded, since the 8 trigrams can be generated from simple broken
and unbroken lines. This alone is not a reason to encode them, but
On Tue, 3 Jul 2001, Rick McGowan wrote:
Thomas Chan wrote...
I'd like to ask about the encoding status of the Japanese Jindai
scripts, which are mentioned in older documents[1], and until a certain
point in time, versions of the Roadmap.
Do you have a paper on the topic? You say over
I know what you mean: Gorbachev is Gorbatschow in German.
Gorbatsov in Finnish transliteration, the ch would be very unwieldy
for a Finnish mouth. (The s is used solely in transliteration, not
in Finnish proper.)
I think that the rules that we have in ICU are probably
English-centric
Another list member mentioned (off-list) the system of 9 bigrams and 81 tetragrams.
These appear in the text of a book called [U+592a][U+7384][U+7d93]
Tai Xuan Jing by [U+63da][U+96c4] Yang Xiong.(c.53BC-c.18AD).
Where the 64 hexagrams are based on a binary system,
the 81 tetragrams are based
From: ext [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]]
Sent: Tuesday, July 03, 2001 2:56 PM
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Subject: RE: Unicode transliterations (and other operations)
I know what you mean: Gorbachev is Gorbatschow in German.
Gorbatsov in Finnish
At 14:35 -0400 2001-07-03, Thomas Chan wrote:
I'm just puzzled by the disappearance of mentions of them in what I can
find on the publically available parts of the unicode.org and WG2
websites, even if its just to say that not enough is known about them to
do anything, e.g., WG2 N1955
I think the absence of the 64 hexagrams is a mistake, and the idea of
composing them out of the trigrams (or of composing the trigrams out
of pieces either) is extremely silly. Sorry, Rick, but there are
things one can decompose and things one cannot. These are semantic
entities, regardless
At 11:40 AM 7/3/2001, Christopher John Fynn wrote:
Richard Cook wrote:
--A: They are compositionally formed from the 8 trigrams.
Rebuttal: By this reasoning, the 8 trigrams themselves ought not to have
been encoded, since the 8 trigrams can be generated from simple broken
and
At 23:18 -0700 2001-07-02, Rick McGowan wrote:
I don't think there's any point in encoding 64 hexagrams; especially when
we have the pieces already. Use the pieces of three and position them with
a drawing program.
That isn't plain text.
We don't have combining thingies for putting chess
Thank you Kay Genenz. This web page is helpful. I was not aware of any
of this info. I'm not surprised they disappeared from the roadmap.
Should one consider the Chinese oracle bone
inscriptions (1200 BC) for entry to the unicode list?
They really did exist.
They are unified with the
At 13:33 -0700 2001-07-03, Edward Cherlin wrote:
The conclusion seems to be that jindai moji are historical fakes
that passed into real, if limited, use. Realler than Klingon,
apparently. :-) So the question is, do any historians want to create
an electronic corpus for study, and do they need
At 13:59 -0700 2001-07-03, Edward Cherlin wrote:
But I thought proposals for characters with decompositions into existing
characters are no longer being accepted.
True for accented letters where the combining marks already exist,
but I don't think we want to have two sets of trigrams, one
From the Irish Times, 2001-07-03
Finnish Academic releases Elvis hits sung in Sumerian
A FINNISH academic whose quirky recordings of Elvis Presley songs in
Latin have gained cult status has now put the King of Rock 'n' Roll
back a few thousand more years -- with a record in the ancient
At 8:07 PM +0200 7/3/01, Genenz wrote:
Should one consider the Chinese oracle bone
inscriptions (1200 BC) for entry to the unicode list?
They really did exist.
As a rule, historical scripts (in which I'll include OBI, even though
their descendant is with us today), are encoded when the
John Cowan wrote:
Rick McGowan scripsit:
I don't think there's any point in encoding 64 hexagrams; especially when
we have the pieces already. Use the pieces of three and position them with
a drawing program. We don't have combining thingies for putting chess
pieces on board
Michael Everson wrote:
At 13:59 -0700 2001-07-03, Edward Cherlin wrote:
But I thought proposals for characters with decompositions into existing
characters are no longer being accepted.
True for accented letters where the combining marks already exist,
but I don't think we want to
John H. Jenkins wrote:
At 8:07 PM +0200 7/3/01, Genenz wrote:
Should one consider the Chinese oracle bone
inscriptions (1200 BC) for entry to the unicode list?
They really did exist.
As a rule, historical scripts (in which I'll include OBI, even though
their descendant is with us
So if I was trying to write my fake name in Polish, or for a Pole to read, I would
write it as "Tendou Rjuud{U+017E}i"?
That would be transliteration, right?
$B$i$s$^(B $B!z$8$e$&$$$C$A$c$s!z(B
$B!!!_$"$+$M(B
$B!(B: Re: Unicode transliterations (and other operations)
I
In a message dated 2001-07-03 21:06:50 Pacific Daylight Time, [EMAIL PROTECTED]
writes:
So if I was trying to write my fake name in Polish, or for a Pole to read,
I
would write it as Tendou Rjuud{U+017E}i?
That would be transliteration, right?
Maybe not. This is the part I got wrong
38 matches
Mail list logo