Hello all,
Im implementing a new feature that support unicode characters
for printer drivers in UI. I have been going through some of the
documents in this regard.
1) I wanted to know what are diacritical characters?
2) In which of the languages are these characters used?
3) Which of the
Hi,
I am
presently working to write a few algorithms to match 2 input Japanese strings
and see if they match properly.
The
problem i am facing is that input strings can be written in any Japanese scripts
(Kanji, Katakan, Hiragana)and 2 strings written in different Japanses
scripts can
At 21:11 -0500 2002-01-23, Patrick Andries wrote:
In the first edition of this dictionary it was said that in many
compounds whose second element begins with h the h is silent unless
the accent falls on the syllable that it begins; thus philhellenic
and philharmonic should not sound the h; in
This is the kind of mess that has discouraged anybody from doing a
systematic survey of simplifications for the Unihan database.
Part of this is because there is the orthogonal complexity of
variant TC forms. Before converting TC to SC, one should resolve
all TC variants to the most
A 08:13 2002-01-23 -0500, John Cowan a écrit :
Middle French spelling is very unphonemic. This is the so-called
aspirated h, which still blocks liaison even though it is
quite silent now.
[Alain] Not only quite, but absolutely mute, one must not be so shy. We
use the word aspirated to
A 16:18 2002-01-23 -0800, Yves Arrouye a écrit :
Obviously (I advocate in French changing the spelling of common foreign
words so that there would be more consistency).
Le ouiquende?
That would be pronounced wikãd... To respect the English pronunciation
you would have to write it
From: Dinesh Agarwal [EMAIL PROTECTED]
5. Following suggestions from this list, a Unicode
Hindi-specific font (Mangal) was obtained (from
private sources) and installed in Control Panel -
Fonts.
Please be aware of the fact that those private sources have perhaps
engaged in piracy by giving
Many have responded:
Meanwhile, it is true that there are simplified characters which
correspond to more than one traditional form.
...
This is the kind of mess that has discouraged anybody from doing a
systematic survey of simplifications for the Unihan database.
...
Before converting TC
On Thursday, January 24, 2002, at 09:39 AM, [EMAIL PROTECTED] wrote:
Currently on the IDN mailing list there is a big debate over this topic.
It
is well known that ASCII-based domain names are matched in the DNS in a
case-insensitive manner. Many people recognize that Chinese readers
-Original Message-
From: Robin Sackmann [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, January 22, 2002 4:24 AM
To: [EMAIL PROTECTED]
Subject: Unicode Tibetan
Dear Sirs,
when I try to use the Tibetan characters contained in the Arial Unicode
MS
font in Microsoft Word 2002, the vowel
-Original Message-
From: Whit Gurley [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, January 23, 2002 8:47 AM
To: [EMAIL PROTECTED]
Subject: Running out of options...
Hello!
[...]
Now, I understand how ASCII text encoding works and therefore have a
pretty good understanding of how
On Thu, 24 Jan 2002, John H. Jenkins wrote:
However, this is already a problem in Unicode. shuowen.org will have to
register both U+8AAAU+6587.org and U+8AACU+6587.org; Jingwa,
Inc., will need both U+4E3CU+86D9 and U+4E95U+86D9.
U+8AAA and U+8AAC are given on p. 265 of TUS3.0 as an
At 21:32 1/23/2002, Dinesh Agarwal wrote:
Is Microsoft doing anything to improve their publically distributed
version of Arial Unicode MS? The UTC should press them to do so. Also,
does anyone know if the Mangal font is available in the public domain?
Microsoft should be asked by the UTC to
[...] but do we rob the rich man because he is rich?
(I hope everyone thinks the answer here is no!).
I have been told that this mailing list is not for politics, so let's stick
to bare logics:
1. we rob the poor man because he is poor
2. we rob the poor man because he is rich
3. we rob the
Hi,
I was refreshing my memory about Maya numerals: http://www.michielb.nl/maya/math.html
and started thinking how would one do these in Unicode, given the funny stacking.
NOTE: the following is most definitely an encoding suggestion/request for
the Maya numerals, firstly
This is because Arial Unicode MS does not have any of the OpenType
information that would be necessary for the shaping in the Tibetan script. I
do not know of any fonts that currently have this information, but I assume
it is only a matter of time before there are some.
MichKa
Michael Kaplan
Doug Ewell wrote:
Currently on the IDN mailing list there is a big debate over
this topic. It is well known that ASCII-based domain names
are matched in the DNS in a case-insensitive manner. Many
people recognize that Chinese readers who are familiar with
both TC and SC consider text
We rob NO ONE. We behave with honor and we wish others to do the same with
us.
Its a respect thing.
MichKa
Michael Kaplan
Trigeminal Software, Inc. -- http://www.trigeminal.com/
- Original Message -
From: Marco Cimarosti [EMAIL PROTECTED]
To: 'Michael (michka) Kaplan' [EMAIL
Hi, Michael
By 'we', do you mean Microsoft? If so, I'm surprised you identify yourself
so closely with that corporation - I thought you were an independent
contractor...
As for the question of whether Microsoft has robbed anyone, I would say that
there are a couple of cases before the courts
On Thursday, January 24, 2002, at 11:44 AM, Thomas Chan wrote:
On Thu, 24 Jan 2002, John H. Jenkins wrote:
However, this is already a problem in Unicode. shuowen.org will have
to
register both U+8AAAU+6587.org and U+8AACU+6587.org; Jingwa,
Inc., will need both U+4E3CU+86D9 and
At 10:44 1/24/2002, Marco Cimarosti wrote:
I have been told that this mailing list is not for politics, so let's stick
to bare logics:
1. we rob the poor man because he is poor
2. we rob the poor man because he is rich
3. we rob the rich man because he is poor
4. we rob the rich man because he
From: Marco Cimarosti [EMAIL PROTECTED]
I have been told that this mailing list is not for politics, so let's
stick
to bare logics:
1. we rob the poor man because he is poor
2. we rob the poor man because he is rich
3. we rob the rich man because he is poor
4. we rob the rich man
I would encode Mayan using the same model we will use for Egyptian.
The individual characters cluster groups much as Egyptian does,
though the font rendering is much more complex.
--
Michael Everson *** Everson Typography *** http://www.evertype.com
From: Rick Cameron [EMAIL PROTECTED]
By 'we', do you mean Microsoft?
No, I mean we as in we, the members of the human race on this spaceshhip
Earth. WE behave appropriately as that as what a good person does. Is we
act badly, then we should be ashamed and keep it to ourselves.
If so, I'm
Hi, MichKa et al.
I certainly didn't intend to suggest that it's OK to rob someone (or
something) if you believe they (or it) has robbed others. My statement was
predicated on the erroneous assumption that you, MichKa, were asserting that
Microsoft has not robbed anyone; and I just wanted to
I would just encode the 20 numerals. However, nobody has yet come up with
a comprehensive proposal, so I would defer any discussion to the point at
which some expert(s) have an opinion about the script in general.
Rick
On Thursday, January 24, 2002, at 12:29 PM, John Cowan wrote:
John H. Jenkins wrote:
{TC1, SC1, SC2, TC2, TC3, SC3} constitute a Han simplification
class (HSC), and are all the same when appearing in IDNs.
Correct?
Oui.
The caveat is that this must be understood to be a first-order,
Can someone tell me what the current status of encoding Sanskrit (Vedic)
characters in the UCS is? Have any formal proposals been made yet or are any
being planned? Also, what about the alternate shapes for certain Devanagari
numerals like 9 that are mentioned at:
Yup, encoding the 20 numerals would probably be the easiest solution.
-Original Message-
From: ext Rick McGowan [mailto:[EMAIL PROTECTED]]
Sent: Thursday, January 24, 2002 14:41
To: [EMAIL PROTECTED]
Subject: Re: Maya numerals
I would just encode the 20 numerals. However, nobody has
John H. Jenkins wrote:
Well, first of all, the UTC is already on record as refusing to encode
new SC separately.
Secondly, we would break IDN equivalence. If we add a new SC which is
equivalent to two TC,
By your previous graf, that can't happen; so it must be adding a new TC
(off a
Please be aware of the fact that those private sources have perhaps
engaged in piracy by giving you this font, which they have no right to
redistribute. This list hosts the company who was the victim of the
piracy -- a big company, to be sure, but do we rob the rich man because he
is rich? (I
Let's assume I want to transliterate a large Wade-Giles database into
pinyin. It this a purely algorithmic process? For all nouns ? Common and
proper (cf. Chiang Kai-Shek vs Jiang Jeshi )? Even for dialectal words?
Would any data in the Unicode database help me in this process?
Patrick
Maurice Bauhahn wrote :
No, FrameMaker 6.0 does not support Unicode.
So if I get it right for FrameMaker6.0+SGML to support Chinese I need a Chinese OS?
And to support French (spelling, hyphenation) and Chinese? Can I use a French
FrameMaker+SGML
on a Chinese OS?
This is a sore point
Patrick,
Let's assume I want to transliterate a large Wade-Giles database into
pinyin. It this a purely algorithmic process? For all nouns ? Common and
proper (cf. Chiang Kai-Shek vs Jiang Jeshi )? Even for dialectal words?
Mostly, but not completely. (Note that Kai-Shek is not standarard
Unicoders:
The Unicode 3.2 BETA directory has been updated again,
to complete filling some last minute gaps. In particular:
ftp://www.unicode.org/Public/BETA/Unicode3.2/BidiMirroring-3.2.0d2.txt
ftp://www.unicode.org/Public/BETA/Unicode3.2/UnicodeData-3.2.0d8.txt
These complete the drafting of
Patrick Andries scripsit:
Let's assume I want to transliterate a large Wade-Giles database into
pinyin. It this a purely algorithmic process? For all nouns ? Common and
proper (cf. Chiang Kai-Shek vs Jiang Jeshi )? Even for dialectal words?
Chiang Kai-Shek isn't Wade-Giles; it isn't even
As Unicode continues to grow, I wonder if we can expect another book-- or
multiple volumes -- at some stage, or if the standard will become a purely
electronic document? Has any decision been taken about this?
John Hudson
Tiro Typeworks www.tiro.com
Vancouver, BC [EMAIL
John Cowan wrote:
[EMAIL PROTECTED]">
Patrick Andries scripsit:
Let's assume I want to "transliterate" a large Wade-Giles database into pinyin. It this a purely algorithmic process? For all nouns ? Common and proper (cf. Chiang Kai-Shek vs Jiang Jeshi )? Even for "dialectal" words?
-BEGIN PGP SIGNED MESSAGE-
Alain LaBonté wrote:
A 08:13 2002-01-23 -0500, John Cowan a écrit :
Middle French spelling is very unphonemic. This is the so-called
aspirated h, which still blocks liaison even though it is
quite silent now.
[Alain] Not only quite, but absolutely
40 matches
Mail list logo