Re: Counting rods alternate forms

2016-08-17 Thread Richard Cook
On Aug 13, 2016, at 2:06 PM, eduardo marin wrote: > > It is well known that the southern song style of counting rods, had different > forms for the digits 4, 5 and 9 https://en.wikipedia.org/wiki/Counting_rods , > however currently there is no way to represent such forms, 〤[U+3024] 〥[U+3025] 〩

Re: Emoji characters for food allergens

2015-07-28 Thread Richard Cook
On Jul 28, 2015, at 8:56 AM, Asmus Freytag wrote: > >> On 7/28/2015 8:07 AM, Richard Cook wrote: >>> On Jul 28, 2015, at 7:53 AM, Doug Ewell wrote: >>> Richard Cook wrote: >>> >>>> And, what is the emotion playfully expressed by 🍔🍟 ? >>&

Re: Emoji characters for food allergens

2015-07-28 Thread Richard Cook
On Jul 28, 2015, at 7:53 AM, Doug Ewell wrote: > > Richard Cook wrote: > >> And, what is the emotion playfully expressed by 🍔🍟 ? > > "I'm having a burger and fries for lunch but can't be bothered to type > all that into this text message lol" >

Re: Emoji characters for food allergens

2015-07-28 Thread Richard Cook
On Jul 28, 2015, at 6:00 AM, Michael Everson allegedly wrote: > > Emojis are not for labelling things. They’re for the playful expression of > emotions. Is that what they're for? I thought they were (encoded) to satisfy certain device manufacturers. And, what is the emotion playfully expresse

Re: vexillology, was: Adding RAINBOW FLAG to Unicode

2015-07-07 Thread Richard Cook
On Jul 7, 2015, at 7:53 AM, Richard Cook wrote: > > Ken Whistler wrote: >>> vexillology > > >> Garth Wallace wrote: >> >> Tangentially, I recently ran across something called International >> Flag Identification Symbols. It's a symbolic notat

vexillology, was: Adding RAINBOW FLAG to Unicode

2015-07-07 Thread Richard Cook
Ken Whistler wrote: >> vexillology > Garth Wallace wrote: > > Tangentially, I recently ran across something called International > Flag Identification Symbols. It's a symbolic notation for vexillology > that describes their use of flags and some aspects of their design but > not enough to reprod

Re: Adding RAINBOW FLAG to Unicode

2015-06-30 Thread Richard Cook
> On Jun 30, 2015, at 9:11 AM, Garth Wallace wrote: > > I don't think display of U+1F308 as a rainbow flag would be expected > behavior. It risks turning a text like "It's a beautiful day! " into a > political statement. Garth, Any statement can be a political statement, in the right con

Re: Adding RAINBOW FLAG to Unicode

2015-06-29 Thread Richard Cook
Ken, I know that U+1F308 is RAINBOW ... because my nameslist lookup tool tells me so ... T C UTF-8 Codepoint : Name : Annotations 1 🌈 C2_A0 1F308 RAINBOW ... but could 🌈 also be a 'rainbow (flag)'

Biang,was: And what happened to...

2014-10-07 Thread Richard Cook
On Oct 7, 2014, at 5:23 PM, Mark E. Shoulson wrote: > > The infamous Biang-Biang Noodle Mark, You seem to know as much as anyone about biang. All I can say is, biang is attested in tones 2, 4 and 1, and enshrined (along with a glyph variant) in Wenlin CDL PUA at U+E999, with 51 or 57 strokes

Re: Is this the oldest d20 on Earth?

2014-09-20 Thread Richard Cook
On Sep 20, 2014, at 5:35 PM, Jonathan Coxhead wrote: > > Here's an icosahedral dice from the Ptolemaic period: > > http://www.metmuseum.org/collection/the-collection-online/search/551070 > > I find myself idly wondering whether the identities of the characters are all > known and encoded ...

Re: Discrepancies between kTotalStrokes and kRSUnicode in the Unihan database - repost all ascii

2014-09-09 Thread Richard COOK
>> On Sep 9, 2014, at 8:28 AM, Richard COOK wrote: > On Sep 8, 2014, at 12:03 PM, John Armstrong > wrote: Mr. Armstrong, I see that my reply to your message bounced from the main Unicode list, due to length constraints. At any rate, the message did go through on the Unihan

Re: Corrigendum #9

2014-07-04 Thread Richard COOK
On Jul 3, 2014, at 1:48 PM, Asmus Freytag wrote: > On 7/3/2014 11:02 AM, Richard COOK wrote: >> On Jul 2, 2014, at 8:02 AM, Karl Williamson wrote: >> >>> Corrigendum #9 has changed this so much that people are coming to me and >>> saying that inputs may very w

Re: Corrigendum #9

2014-07-03 Thread Richard COOK
On Jul 2, 2014, at 8:02 AM, Karl Williamson wrote: > Corrigendum #9 has changed this so much that people are coming to me and > saying that inputs may very well have non-characters, and that the default > should be to pass them through. Since we have no published wording for how > the TUS wil

Re: Bidi Brackets for Dummies

2014-04-24 Thread Richard COOK
On Apr 24, 2014, at 2:16 PM, Whistler, Ken wrote: > Given the incredible level of interest shown on this list during > the last week, I am glad that I can finally announce the publication > of Bidi Brackets for Dummies: > Dear Dr. Ken, Thanks ever so much

Re: CJK stroke order data: kRSUnicode v. kRSKangXi

2014-03-12 Thread Richard COOK
On Mar 12, 2014, at 2:59 AM, Adam Nohejl wrote: >> >> Since kRSUnicode is a Normative property, a formal proposal to modify that >> data is required, for review in WG2. I have added notes on the items you >> mention below, for consideration in that process, and in the meantime, if >> you identi

Re: ?MP = Multi*lingual* plane?

2014-03-10 Thread Richard COOK
On Feb 27, 2014, at 7:23 AM, Michael Everson wrote: > On 27 Feb 2014, at 02:32, Shriramana Sharma wrote: > >> Given that Unicode encodes scripts and not languages, how appropriate is it >> to call the BMP and the SMP as the multi*lingual* planes? > > You are more than two decades late in aski

Re: CJK stroke order data: kRSUnicode v. kRSKangXi

2014-03-10 Thread Richard COOK
Mr. Nohejl, About the property data you mention below. kRSUnicode property data permits multiple/variant (space-delimited) radical/stroke values, and I think we will see important variants added in the future. Where a specific value attested in a specific Kangxi edition is missing from kRSUnico

Re: Simplified Chinese radical set in Unihan

2004-12-19 Thread Richard Cook
On Dec 16, 2004, at 3:20 PM, Tom Emerson wrote: Ah, I don't have my copy of the Comprehensive ABC here at home with me. If you have Wenlin, you have it in electronic form. Wenlin does the typesetting (and sub-licensing) for ABC, and the ABC data is accessible from within the Wenlin app. But on t

Re: Unicode for words?

2004-12-07 Thread Richard Cook
On Dec 5, 2004, at 07:02 PM, Doug Ewell wrote: A word-based encoding for English could automatically assume spaces where they are appropriate. The sentence: "What means this, my lord?" would have seven encodable elements: the five words, the comma, and the question mark. Spaces would be automatic

Re: Unicode for words?

2004-12-05 Thread Richard Cook
On Dec 5, 2004, at 12:27 AM, Tim Finney wrote: my co-worker suggested encoding entire words in Unicode. The "word" is considerably less well-defined than the character. The set of words is open-ended. If you'd like to see where you go when you start trying to encode words, take a look at CJK Exte

script complexity, was Re: OpenType vs TrueType (was current version of unicode-font)

2004-12-04 Thread Richard Cook
On Dec 4, 2004, at 12:15 PM, John Hudson wrote: I think Peter's point was that complex script require font layout tables Script complexity is not so easily quantified. Has anyone tried to sort scripts by complexity? In terms of the present discussion, Han would be viewed as a simple script, and

Re: current version of unicode-font

2004-12-02 Thread Richard Cook
On Thu, 2 Dec 2004, John Cowan xiele: > Paul Hastings scripsit: > > > speaking of which, *are* there any open source fonts that come even > > close to Arial Unicode MS? > > In what, breadth of coverage or aesthetics? The GNU Unifont has very > wide coverage though it is a bitmap font; James Kass'

RE: Ideograph?!?

2004-11-30 Thread Richard Cook
On Mon, 29 Nov 2004, Kenneth Whistler opined contemplatively: > Allen Haaheim provided some further detailed clarification: > > > Note that Han characters are logographic, not ideographic. That is, > > they are graphemes that represent words (or at least morphemes), > > not ideas. > > This correct

Re: Ideograph?!?

2004-11-29 Thread Richard Cook
The term ideograph has special meaning in Unicode/ISO usage. "Ideograph" is short for "CJK Unified Ideograph", and is one of the characters with mapping or reference data in the Unihan.txt database. Likewise, "Radical" has special meaning. CJK Radicals are found in two places, in the "Kangxi Radic

Re: outside decomposed, inside precomposed

2004-10-14 Thread Richard Cook
On Oct 13, 2004, at 1:42 PM, Eric Muller wrote: Going back to the original scenario, to make my point clearer: System A, a subset of FileMaker, has {U+0065, U+0303, U+1EBD} as its repertoire. When presented with the input , it produces the output . System B, my rendering system, has {U+0065, U+0

Re: outside decomposed, inside precomposed

2004-10-13 Thread Richard Cook
Jon, Thanks for your reply. On Oct 13, 2004, at 3:15 AM, you wrote: imported UTF-8 sequences like [U+0065][U+0303] get remapped internally to [U+1ebd] LATIN SMALL LETTER E WITH TILDE. Is this kind of behavior what one would expect? That's conformant, if it causes problems with any other process (

outside decomposed, inside precomposed

2004-10-12 Thread Richard Cook
Using a certain newly "Unicode-aware" database application which shall remain nameless (FileMaker 7): imported UTF-8 sequences like [U+0065][U+0303] get remapped internally to [U+1ebd] LATIN SMALL LETTER E WITH TILDE. Is this kind of behavior what one would expect? It's problematic (and buglik

RE: Doulos SIL (was: French typographic thin space)

2004-04-07 Thread Richard Cook
On Wed, 7 Apr 2004, Peter Constable wrote: > They were encoded that way some while before they were accepted in > Unicode. Also, until Unicode 4.1 is published, there is a possibility > that codepoints may change. I see. I assumed the codepoint assignments were already firm.

Re: Tai Xuan Jing Symbols, any background information ?

2003-10-12 Thread Richard Cook
, 2003, at 15:55 US/Pacific, Richard Cook wrote: The English TXJ names come from Michael Nylan's book. You'll have to find that book to learn what she meant. Or better, get a copy of the Chinese original. -Richard On Saturday, Oct 11, 2003, at 13:28 US/Pacific, Patrick Andries wrote:

Re: Tai Xuan Jing Symbols, any background information ?

2003-10-11 Thread Richard Cook
The English TXJ names come from Michael Nylan's book. You'll have to find that book to learn what she meant. Or better, get a copy of the Chinese original. -Richard On Saturday, Oct 11, 2003, at 13:28 US/Pacific, Patrick Andries wrote: Would anyone know where I could find some background in

Re: TAI NÜA , TAI LE

2003-09-11 Thread Richard Cook
Gedney says "nuea"/"nü" is a Thai word for 'north/northern' ... looks as if the syllable in this name gets written many different ways ... le, lu, lü, lüe, lue, nü, nüa, nüe, neua, nuea ... at least it's possibly the same syllable. Here are some references: Gedney, William J. 1976. "Notes on T

Re: TAI NÜA , TAI LE

2003-09-11 Thread Richard Cook
On Thursday, Sep 11, 2003, at 10:45 US/Pacific, Michael Everson wrote: At 10:02 -0700 2003-09-11, Richard Cook wrote: I'm guessing that "Tai Le" would be the exonym (Chinese name), while "TAI NÜA" is the autonym. Don't guess. The Chinese name is Dehong Dai. Wel

Re: TAI NÜA , TAI LE

2003-09-11 Thread Richard Cook
On Thursday, Sep 11, 2003, at 09:42 US/Pacific, Michael Everson wrote: At 11:04 -0400 2003-09-11, Patrick Andries wrote: Does TAI LE, encoded in Unicode 4.0, refer to the same language as TAI NÜA ? Yes. If so, isn't TAI NÜA the most frequently used form of this language ? According to the Ethnol

Re: missing .GIF's for ideographs on unicode.org?

2003-07-16 Thread Richard Cook
"Ostermueller, Erik" wrote: > > I apologize if you all have already discussed this. > > At unicode.org, when I click this link, > > http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=2 > > I'm expecting to see a little square GIF that displays U+2. > Instead, I see "N/A". > > Sh

Re: Chinese language support for Unicode

2003-07-08 Thread Richard Cook
Sourav, You wrote: > > Hi All, > > Does Unicode support both Simplified as well as Traditional Chinese ? > Yes, it does, though the Simplified support is rather lacking in comparison with the Traditional, since the Traditional characterset is rather large, if not completely open-ended, and simp

Re: Problem with Arial Unicode MS font for BOLD/ITALICS in PDF

2003-06-20 Thread Richard Cook
On Friday, June 20, 2003, at 02:44 , Kenneth Whistler wrote: What is true is that use of italicized text is unusual in Chinese or Japanese body text--certainly not with the frequency or same range of functions as occurs in Latin typography. Bold text is not that unusual, however. In precomputer Ch

Re: Unicode not in Quark 6

2003-06-20 Thread Richard Cook
Michael Everson wrote: > > I wonder what Quark would do if we all wrote to [EMAIL PROTECTED] to > ask for Unicode support. > Good idea. I just did. But, Quark is just the tip of the iceberg. I still need a good (Mac OS X) database that can do Unicode Chinese (including supplemental planes). Any r

Re: Ext-B fonts updated

2001-10-17 Thread Richard Cook
James Kass wrote: > > Richard Cook wrote: > > > > > > > > Are there any instructions for reporting errata such as the glyphs > > > > at U+29FD7 and U+29FCE being identical? > > > > > > [U+29FD7] and [U+29FCE] are not identical. They

Re: Ext-B fonts updated

2001-10-17 Thread Richard Cook
> On Tuesday, October 16, 2001, at 08:00 PM, James Kass wrote: > > > Are there any instructions for reporting errata such as the glyphs > > at U+29FD7 and U+29FCE being identical? > > [U+29FD7] and [U+29FCE] are not identical. They are (admittedly rather close) graphical variants. If you want to

Re: Erratum in Unicode book

2001-07-09 Thread Richard Cook
Thomas Chan wrote: > > On Mon, 9 Jul 2001, Richard Cook wrote: > > > On a related note, I have 9000 word/char frequencies from Hanyu Pinlu > > Cidian (a mainland text; I typed the entries in back in the early 90's, > > and this is the freq data currently

Re: What should be radicals

2001-07-09 Thread Richard Cook
"Becker, Joseph" wrote: > > > Unicode is going to stick with the KangXi radical system > > There Unicode goes again, flouting the will of the people ... while > meanwhile in another thread an esteemed Unicode elder has proposed the death > radical. It's time to bring this system into the 21st C

Re: Erratum in Unicode book

2001-07-09 Thread Richard Cook
"Michael (michka) Kaplan" wrote: > > From: "John H. Jenkins" <[EMAIL PROTECTED]> > > > >Has the UNIHAN.TXT file been updated to include radical-stroke data > > >for Plane Two characters? > > > Yes. Ever since Unicode 3.1 was released. (We still don't have an > > Extension B font, however.) >

Re: Erratum in Unicode book

2001-07-09 Thread Richard Cook
"John H. Jenkins" wrote: > > At 11:29 AM -0400 7/9/01, Thomas Chan wrote: > >On Sun, 8 Jul 2001, James Kass wrote: > > > >> An ideal index for the casual or non-CJK user might be quite > >> different in approach. Perhaps the first component drawn in > > > >For the less than proficient user, I

Re: Erratum in Unicode book

2001-07-08 Thread Richard Cook
James Kass wrote: > > Richard Cook wrote: > > > "John H. Jenkins" wrote: > > > > > > It is on occasion something of an art figuring out the correct > > > radical/stroke position for a character in this kind of an index, sad > > >

Re: Erratum in Unicode book

2001-07-08 Thread Richard Cook
"John H. Jenkins" wrote: > > It is on occasion something of an art figuring out the correct > radical/stroke position for a character in this kind of an index, sad > to say. I'd say, when 2 radicals are possible, put it under both. When 3, well ... you probably get the idea ...

Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Richard Cook
Michael Everson wrote: > > At 11:10 -0700 2001-07-04, Richard Cook wrote: > >Michael Everson wrote: > >> > >> UTC approved it and there's a new document from John Jenkins and me > >> on Shavian for WG2, so it should get approved for ballotting

Re: Shavian (was: Re: UTF-17)

2001-07-04 Thread Richard Cook
Michael Everson wrote: > > UTC approved it and there's a new document from John Jenkins and me > on Shavian for WG2, so it should get approved for ballotting at the > next meeting of WG2. Hi Michael, I'm new to the idea that anyone would care to have Shavian encoded. Will you enlighten me? Bes

Re: status of Jindai scripts?

2001-07-03 Thread Richard Cook
"John H. Jenkins" wrote: > > At 8:07 PM +0200 7/3/01, Genenz wrote: > >Should one consider the Chinese oracle bone > >inscriptions (1200 BC) for entry to the unicode list? > >They really did exist. > > > > As a rule, historical scripts (in which I'll include OBI, even though > their descendant i

Re: New characters query (Hexagrams)

2001-07-03 Thread Richard Cook
Michael Everson wrote: > > At 13:59 -0700 2001-07-03, Edward Cherlin wrote: > > >>But I thought proposals for characters with decompositions into existing > >>characters are no longer being accepted. > > > >True for accented letters where the combining marks already exist, > >but I don't thin

Re: New characters query

2001-07-03 Thread Richard Cook
John Cowan wrote: > > Rick McGowan scripsit: > > > I don't think there's any point in encoding 64 hexagrams; especially when > > we have the pieces already. Use the pieces of three and position them with > > a drawing program. We don't have combining thingies for putting chess > > pieces on bo

Re: New characters query

2001-07-03 Thread Richard Cook
Another list member mentioned (off-list) the system of 9 bigrams and 81 tetragrams. These appear in the text of a book called [U+592a][U+7384][U+7d93] by [U+63da][U+96c4] Yang Xiong.(c.53BC-c.18AD). Where the 64 hexagrams are based on a binary system, the 81 tetragrams are based on a trinary s

Re: New characters query

2001-07-03 Thread Richard Cook
Rick McGowan wrote: > > I don't think there's any point in encoding 64 hexagrams; especially when > we have the pieces already. Use the pieces of three and position them with > a drawing program. We don't have combining thingies for putting chess > pieces on board squares, either. > Hi Rick,

Re: New characters query

2001-07-02 Thread Richard Cook
"John H. Jenkins" wrote: > > At 7:07 PM -0700 7/2/01, Richard Cook wrote: > >Evidence? There's ample evidence, starting c. 1000 BC, with > >[U+5468][U+6613] _Zhou Yi_ (aka _Yi Jing_ aka _I Ching_ aka _The Book of > >Changes_), an artifact of the Zhou Dy

Re: 64 Hexagrams, was re: New characters query

2001-07-02 Thread Richard Cook
"John H. Jenkins" wrote: > > At 7:07 PM -0700 7/2/01, Richard Cook wrote: > >Evidence? There's ample evidence, starting c. 1000 BC, with > >[U+5468][U+6613] _Zhou Yi_ (aka _Yi Jing_ aka _I Ching_ aka _The Book of > >Changes_), an artifact of the Zhou Dy

Re: New characters query

2001-07-02 Thread Richard Cook
Michael Everson wrote: > > At 12:33 -0700 2001-07-02, Edward Cherlin wrote: > >Has anyone proposed the following for inclusion in Unicode? If so, > >what is their status? > > > >Daoist Hexagrams, 64 forms (the trigrams are already included, but > >with no combining mechanism) > > You're welcome

Re: Book review: Cang Method

2001-06-29 Thread Richard Cook
Edward Cherlin wrote: > > I use Cangjie to access my character database, since it is usually much > faster than radical and stroke count, and I usually don't know the Chinese > pronunciation of characters I need to look up. The database gives me > Radical number, Stroke count, Chinese, Japanese,

Re: Not the Roadmap was Re: UTF-17

2001-06-23 Thread Richard Cook
Michael Everson wrote: > > At 14:52 -0700 2001-06-22, Yves Arrouye wrote: > >Isn't UTF-17 just a sarcastic comment on all of this UTF- discussion? > > I think UTF-11digit would be clearly sarcastic. UTF-17, well, I don't > know. I've been deleting the threads. Not my area. > > Didj'all like

Re: Why call kanji/hanji/hanja 'ideographs' when almost none are?

2001-06-02 Thread Richard Cook
Jon Babcock wrote: > > The Asia/East Asian/CJK thread reminded me of one of my own pet peeves, > the use of 'ideograph' to refer to kanji. > > Perhaps some of the professionals on this list can enlighten me here. I > thought that an ideograph meant that the graph stood for an idea, not a > soun

Re: Why call kanji/hanji/hanja 'ideographs' when almost none are?

2001-06-02 Thread Richard Cook
"John H. Jenkins" wrote: > > At 4:16 PM -0600 6/1/01, Jon Babcock wrote: > >The Asia/East Asian/CJK thread reminded me of one of my own pet > >peeves, the use of 'ideograph' to refer to kanji. > > > >Perhaps some of the professionals on this list can enlighten me > >here. I thought that an ideogr

Re: Radical of U+4E71

2001-05-30 Thread Richard Cook
Marco Cimarosti wrote: > So it seems there is in fact an etymological "hook" radical here, although > its modern shape has become identical to "second". The placement of [U+4e82] under a [U+4e59] [U+90e8][U+9996] goes back at least as far as [U+8aaa][U+6587] (121 AD).

ex presidents

2001-05-28 Thread Richard Cook
Anyone know which US president is [U+704c][U+6027][U+704c] ? Someone told me this (admittedly silly) joke in Japanese, with [U+85ea][U+6027][U+85ea]

Re: name this hanzi

2001-05-27 Thread Richard Cook
Thomas Chan wrote: > > On Sat, 26 May 2001, Richard Cook wrote: > > > Gaspar Sinai wrote: > > > On Sat, 26 May 2001, Richard Cook wrote: > > > > Here's a puzzle: Any idea 1.) what this character is, and 2.) if > > > > it's in Unico

Re: name this hanzi

2001-05-26 Thread Richard Cook
Japanese) > >U+9ECD > > Gaspar Sinai <[EMAIL PROTECTED]> > http://www.yudit.org/ > > On Sat, 26 May 2001, Richard Cook wrote: > > > Here's a puzzle: Any idea 1.) what this character is, and 2.) if it's in Unicode? > > > > http://linguistics.berkeley.edu/~rscook/bishop/Picture1.gif > > > >

name this hanzi

2001-05-26 Thread Richard Cook
Here's a puzzle: Any idea 1.) what this character is, and 2.) if it's in Unicode? http://linguistics.berkeley.edu/~rscook/bishop/Picture1.gif

[Fwd: www.perl.com - Larry Wall Apocalypse Two]

2001-05-03 Thread Richard Cook
at http://www.perl.com/pub/2001/05/03/wall.html Larry Wall writes: > > Perl 6 programs are notionally written in Unicode, and assume > Unicode semantics by default even when they happen to be > processing other character sets behind the scenes. Note that > when we say that Perl is written in Un

[unicode] Re: What is Unicode?

2001-03-23 Thread Richard Cook
Another web page, for your collective amusement: http://linguistics.berkeley.edu/~rscook/html/Unicode-tetralog.html

[unicode] Re: Spam being sent to the list?

2001-03-22 Thread Richard Cook
I thought Sarasvati was immune to this. Parvati?

Re: Unicode complaints

2001-03-15 Thread Richard Cook
Kenneth Whistler wrote: > > > In hunting around for negative opinions about Unicode, I've found that > > the majority of complaints relate to CJK character sets. Would listers > > agree that this is the largest area of unrest? Or is it just that people > > involved with CJK are vocal? > > Maybe

Re: Unicode market acceptance

2001-03-09 Thread Richard Cook
Tex Texin wrote: > > not the same as work for execs. The success of Unicode is obvious > to us (techies) is not clear to them. Tex, Recently looking at and talking about this http://i18n.homepage.com/UnicodeBenefits.html with some people, initiated and uninitiated, I quickly wrote this: http

Oh Unicode

2001-03-01 Thread Richard Cook
Is the Unicode anthem from the CD on a server somewhere, hopefully in mp3?

Re: CJKV ideographic, - Cantonese

2001-02-27 Thread Richard Cook
"akerbeltz.alba" wrote: > > Not kvite, the Cantonese for Kanji is "Hòn Jih" - although the term is > rather uncommon, sounding rather outlandish to Cantonese ears. "Jung Màhn > Jih" [U+4E2D] [U+6587] [U+5B57] is a lot more common. > > Cantonese, highly conservative in it's sound system generally,

Re: CJKV ideographic, was Re: Perception that Unicode is 16-bit

2001-02-27 Thread Richard Cook
Jungshik Shin wrote: > > On Tue, 27 Feb 2001, Thomas Chan wrote: > > > On Tue, 27 Feb 2001, Richard Cook wrote: > > > > > * 'chunom' in Vietnamese [similar to (i.e., analogical) Chinese characters]. > > > > If one is going to talk about Vie

Re: Kana, was re: CJKV ideographic, was Re: Perception that Unicode is

2001-02-27 Thread Richard Cook
All this talk of ideographs made me think that people on-list might enjoy this (Acrobat 4) PDF: http://linguistics.berkeley.edu/~rscook/pdf/HanKana.fp3.pdf It illustrates the evolution of the Hiragana and Katakana from Chinese characters, with kaishu (simplified and traditional) and small seal f

Re: CJKV ideographic, was Re: Perception that Unicode is 16-bit

2001-02-27 Thread Richard Cook
Thomas Chan wrote: > > There is also a similar phenomena in Chinese, called fangyanzi '"dialect" > character', which may be considered analogous to the above, the most well > known being the Cantonese ones, although others (Wu, Hakka, etc) do exist. > > [1] There is a small chance that they migh

Re: CJKV ideographic, was Re: Perception that Unicode is 16-bit

2001-02-27 Thread Richard Cook
Thomas Chan wrote: > > But is a romanized version of U+6F22 U+5B57 based on the Cantonese > pronunciation ever used in English writing the way (based on > Mandarin pronunciation) is? it could be ... it might even be used as a special term to distinguish "Cantonese Ideographs" ... > > For those

Re: CJKV ideographic, was Re: Perception that Unicode is 16-bit (was:

2001-02-27 Thread Richard Cook
Kenneth Whistler wrote: > > Doug Ewell asked, on this hopelessly wandering thread: > > > (Is > > there an English-language term for the subset of the CJK ideographic script > > that is used by a given language, say, Japanese?) > > Well, since "kanji" by now has been borrowed into English, at le

Re: Question on Unicode data files

2001-02-26 Thread Richard Cook
"John H. Jenkins" wrote: > > At 7:57 AM -0800 2/26/01, Richard Zhang wrote: > >Hello, Marco, > > > >Unihan is the official site I think. You can visit www.unihan.com.cn for > >more information about this, if you know Chinese :). Knowing Chinese is not enough. You and your browser need to know Si

Re: bijective (was re: An Absurdly Brief Introduction to Unicode (was

2001-02-24 Thread Richard Cook
Tom Lord wrote: > >> I think I'd like bijective too, if I knew what it meant. Someone? > > It would be a lot more fun to answer this question in plain-text > Unicode (using math notation) than in ASCII. > > Informally: > > "Bijective" describes a mapping between two sets. Every element of > t

Re: bijective (was re: An Aburdly Brief Introduction to Unicode (was Re:

2001-02-23 Thread Richard Cook
Mark Davis wrote: > > that must be made about what counts as an abstract character and what > > does not; and the generally acknowledged desirability of supporting > > bijective mappings between a variety of older character sets and > > while I like bijective, it is not a commonly understood ter

Re: Benefits of Unicode

2001-02-23 Thread Richard Cook
Sorry, I tuned out for a moment: is there a URL for the final version of Tex's tabulation of benefits? Also, I'd appreciate any similar links that might be used in a page of info for the uninitiated. Best, Richard

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-29 Thread Richard Cook
Just a correction. Someone previously asked about http://www.wenlin.com/ and its support for Vertical Ext. A. It turns out that this support has not yet made it into the public release ... Best, Richard

Re: Benefits of Unicode

2001-01-27 Thread Richard Cook
Has anybody played devil's advocate to this, with a list of "Failings of Unicode"? Are there any? :-) This question might in fact result in a longer Benefits list > Tex Texin wrote: > > I was asked to produce a list of the benefits of Unicode to be used > as a sidebar with an article refere

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-26 Thread Richard Cook
Kenneth Whistler wrote: > > > > > I cannot check now if these characters are included in Unicode as I don't > > > > have TUS handy in this moment. > > > > > > http://www.unicode.org/unicode/uni2book/u2.html (The Online Edition) > > > > > > and > > > > > > http://www.unicode.org/charts/draftunico

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-25 Thread Richard Cook
John Jenkins wrote: > > On Thursday, January 25, 2001, at 03:14 AM, Pierpaolo BERNARDI wrote: > > > I was talking about the index for the hanzi's ordered by radical+strokes > > which can be found at the end of the book, since I wanted to check > > whether > > high numbered elements were there. I

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-24 Thread Richard Cook
> > Kenneth Whistler wrote: > > > > > > > > I could not find the radical index. Has this been put online too? > > > > > > No. The CJK radical index was generated and printed with custom > > > software from the Unihan database. It was too much effort to try > > > to convert that software to produce

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-24 Thread Richard Cook
Richard Cook wrote: > > Kenneth Whistler wrote: > > > > > > I could not find the radical index. Has this been put online too? > > > > No. The CJK radical index was generated and printed with custom > > software from the Unihan database. It was too much

Re: Radical Index online? (was Re: Chemistry on chinesse. (CJK))

2001-01-24 Thread Richard Cook
Kenneth Whistler wrote: > > > > I could not find the radical index. Has this been put online too? > > No. The CJK radical index was generated and printed with custom > software from the Unihan database. It was too much effort to try > to convert that software to produce a postable .pdf file, so t

Re: Transcriptions of "Unicode"

2001-01-16 Thread Richard Cook
Michael Everson wrote: > > >The stress is definitely on the first syllable. One does hear some normal > >generative English variations such as ˈjunəˌkoË*d. (schwa instead of > >short-i), > > The pronuncuation ['juni:ko:d] with [i:] or [i] instead of schwa irritates > me a lot. No one would pr

Re: Representation of aspiration (was: Re: Transcriptions of "Unicode")

2001-01-12 Thread Richard Cook
Kenneth Whistler wrote: > > Richard Cook surmised: > > > BTW, in a very close transcription, if one is using superscription > > (position above baseline) and relative size reduction to indicate > > aspiration, I suppose that degree of superscription or the size or bot

Re: Transcriptions of "Unicode"

2001-01-12 Thread Richard Cook
[EMAIL PROTECTED] wrote: > > On 01/12/2001 10:33:48 AM Marco Cimarosti wrote: > > >Is that k aspirated? > > It is for any English speakers I've ever met. > I think the question about aspiration of this k relates to the fact that it is at the onset of a word-medial unstressed syllable. So, alth

Re: Transcriptions of "Unicode"

2001-01-12 Thread Richard Cook
Thomas Chan wrote: > > On Thu, 11 Jan 2001, Richard Cook wrote: > > > I see 2 Traditional Chinese translations here: > > > http://www.macchiato.com/unicode/Unicode_transcriptions.html > > Which one do people like? > > > > >

Re: Transcriptions of "Unicode"

2001-01-11 Thread Richard Cook
Jon Babcock wrote: > > At first glance, I agreed. But then if the U_Chinese3.gif, gets > shortened to the last three characters, wanguo ma, as I suspect it > would in practice, I'd favor it slightly over the three-character > tongyi ma of U_Chinese2.gif. FWIW. To me, wanguo ma emphasizes the > mu

Re: Transcriptions of "Unicode"

2001-01-11 Thread Richard Cook
John Jenkins wrote: > > On Thursday, January 11, 2001, at 10:25 AM, Richard Cook wrote: > >> Which one do people like? >> >> >http://my.ispchannel.com/~markdavis//unicode/Unicode_transcription_images/U_Chinese2.gif > > Is much better. "Unified

Re: Transcriptions of "Unicode"

2001-01-11 Thread Richard Cook
I see 2 Traditional Chinese translations here: > http://www.macchiato.com/unicode/Unicode_transcriptions.html Which one do people like? http://my.ispchannel.com/~markdavis//unicode/Unicode_transcription_images/U_Chinese2.gif http://my.ispchannel.com/~markdavis//unicode/Unicode_transcription_ima

Linguistic Transcription list

2000-12-21 Thread Richard Cook
Greetings Unicoders, As people on the Unicode list obviously have considerable experience in such matters, we would like to invite your recommendations and suggestions on the following. A new Linguistic Transcription mailing list has been set up, preliminary description and subscription info as

Re: Mongolian and Uighur (was Re: I have a drem one day...)

2000-12-19 Thread Richard Cook
Kenneth Whistler wrote: > > Thus the Uighur script is the direct ancestor of the Mongolian > script, and is also a term used for the modern Mongolian script > itself, to distinguish it from Mongolian written in one of the other > scripts (including Latin and Tibetan). And the Uighur script has i

Re: Information about curly-tailed phonetic letters

2000-12-17 Thread Richard Cook
"J%ORG KNAPPEN" wrote: > > The curly-tail consonants t, d, n, l, c, z are also included in the > TeX IPA (tipa fonts). The documentation of those fonts is available > on > > ftp://ftp.dante.de/texarchive/fonts/tipa/tipaman.ps.gz > > --J"org Knappen Hi J"org, It looks as if you sent the wrong u

Chinese Support

2000-12-13 Thread Richard Cook
I've been meaning to mention this program on-list. Tom Bishop's Wenlin at http://www.wenlin.com/ is a self-contained, Mac/Win means of editing Unicode Chinese. I've heard Unicoders speak well of it before. At the last conference one presenter said in his presentation, concluding his praise of We

Re: curly-tailed phonetic letters

2000-12-08 Thread Richard Cook
This table has undergone some further revision: http://stedt.berkeley.edu/pdf/curly-tail-table3.pdf Please note in the center of the table: U+0291/U+0293 and U+0255/U+0286 These 4 may in fact be 2 pairs of functional equivalents (synographs), pointing to the same place of articulation. Accordi

Re: curly-tailed phonetic letters

2000-12-05 Thread Richard Cook
With regard to the curly-tail character set, here's a link to an IPA-style chart of this I made: http://stedt.berkeley.edu/pdf/curly-tail-table2.pdf The curly-tail series is in red. As always, comments, suggestions and corrections are welcome. Richard S.

  1   2   >