-BEGIN PGP SIGNED MESSAGE-
Kenneth Whistler wrote:
And StandardizedVariants.html has been updated again, with more
of the missing glyphs provided.
I can't see any difference between plain U+2278 (either in the draft
code chart or StandardizedVariants.html) and U+2278 with VS1.
Is
Hi Rajat,
Any solutions to handle the same ( or in other words to compare 2 Japanese
strings written in different scripts or by mixture of two scripts) ??
It is definitely a non-trivial task.
If you just want to transform a katakana string into hiragana (or vice
versa), it is very easy. But
Michael Kaplan wrote:
We rob NO ONE. We behave with honor and we wish others to do the same with
us.
Its a respect thing.
For sure. But you understand that this is politics as well. Many aspects of
copyright and intellectual property, and even the very concept of private
property, are still
From my database with roughly 50.000 lexical
entries (compounds) I get a
number of 1431 compounds with
at least two readings and 71 with at least three
readings.
If I include also 60.000 personal and local names
I get 6840 compounds with two or more, 1344 with three
or more and 348 with four or
On Thu, 24 Jan 2002, Patrick Andries wrote:
John Cowan wrote:
Patrick Andries scripsit:
Let's assume I want to transliterate a large Wade-Giles database into
pinyin. It this a purely algorithmic process? For all nouns ? Common and
proper (cf. Chiang Kai-Shek vs Jiang Jeshi )? Even for
Dear Prof. Genenz,
From my database with roughly 50.000 lexical
entries (compounds) I get a
number of 1431 compounds with
at least two readings and 71 with at least three
readings.
Taking only into account compounds with multiple readings.
But imagine this: If a program had merely access to
At 12:46 -0800 2002-01-24, Kenneth Whistler wrote:
This heads immediately into a rathole where any scheme of dynamic
notation for anything whatsoever becomes a candidate for character
encoding.
Any candidate for encoding has to meet certain criteria. Like Klingon
didn't. One of those
Hm, I still don't see 'em all -- letf column images starting at U+2A3C are
still missing.
Bob
On 22-01-2002 05:02:24 Mark Davis wrote:
I sent the message out somewhat prematurely -- the images in the first
column (the normal representative glyphs) should be there tomorrow.
Mark
At 06:29 AM 1/24/02 +, David Hopwood wrote:
Kenneth Whistler wrote:
And StandardizedVariants.html has been updated again, with more
of the missing glyphs provided.
I can't see any difference between plain U+2278 (either in the draft
code chart or StandardizedVariants.html) and U+2278
Michael Everson wrote:
Any candidate for encoding has to meet certain criteria. Like Klingon
didn't. One of those criteria would be doable. Another would be
meets user requirements. A priori rejection of things makes me
nervous, though.
Yeah. I agree that a priori rejection of Labanotation,
- Original Message -
From: Michael Everson [EMAIL PROTECTED]
To: Patrick Andries [EMAIL PROTECTED]
Cc: David Starner [EMAIL PROTECTED]; [EMAIL PROTECTED]
Sent: Wednesday, January 23, 2002 12:35 AM
Subject: [Very-OT] Re: ü
snip
Garçon in Oxford English Dictionary but garconnière
At 06:29 AM 1/24/02 +, David Hopwood wrote:
Kenneth Whistler wrote:
And StandardizedVariants.html has been updated again, with more
of the missing glyphs provided.
Can anyone send me the URL for this chart? I can't seem to find it.
Bob,
Hm, I still don't see 'em all -- letf column images starting at U+2A3C are
still missing.
They're all there. I just checked. Try reloading the page.
--Ken
Bob
On 22-01-2002 05:02:24 Mark Davis wrote:
I sent the message out somewhat prematurely -- the images in the first
Yuri Yarmola has written to me again to say that he is working on 4-byte
cmap support, but needs an existing font with such a cmap in order to test
his import function. Does anyone have a font with Plane One characters
encoded in such a cmap? If so, please contact Yuri directly at
[EMAIL
Asmus Freytag scripsit:
While VARIATION SELECTOR is the formal name of the character (and therefore
fixed), referring to the selected thing as a 'variation' sounds really
odd, that's why the more common term 'variant' is used all over the place.
Perhaps we ought to make them formally
From: "Michael \(michka\) Kaplan" [EMAIL PROTECTED]
To: "$B$m!;!;!;!;(B $B$m!;!;!;(B" [EMAIL PROTECTED]
Subject: Re: Microsoft's Japanese IME has no Unicode option
Date: Fri, 25 Jan 2002 10:15:32 -0800
You are wrong.
$B$o$!$!$!$!$!$C$C$C$C!*!*!*(B
If that is so, how to I get the thing
If you use a scripting language like VBScript or JScript, it is converted to
Unicode for of the strings in your code. I have explained that you can
actually see Unicode in a particular scenario, but it is not going to
convert your pages for you or anything like that.
Rather than complaining about
John Hudson asked,
As Unicode continues to grow, I wonder if we can expect another book--
or
multiple volumes -- at some stage, or if the standard will become a
purely
electronic document? Has any decision been taken about this?
Speaking in my official capacity as editor, the answer is yes,
From: "$B$m!;!;!;!;(B $B$m!;!;!;(B" [EMAIL PROTECTED]
Okay, here's the scoop: I have a page with some (poorly
written) Japanese in it, and it is in Unicode. I want to be
able to edit the page without having to port the whole
doggone thing into Unipad and then curse when I can't
use my
Julie said:
As to the form and timing of 5.0,
that would be pure speculation at this point. Someone else on the
committee might be willing to speculate, but I won't!
Ummm...
Unicode 5.0 will be published on December 22, 2007,
in DVD3 holographic format, complete with a remastered
Unicode
On Fri, Jan 25, 2002 at 11:31:19AM -0800, Julie Allen wrote:
Speaking in my official capacity as editor, the answer is yes, you can
expect another book. The editorial committee is already hard at work on
4.0, which we expect to publish as one volume.
So are you worried about 4.0 being 2,000
Ken let the cat out of the bag:
Unicode 5.0 will be published on December 22, 2007...
complete with a remastered Unicode hymn...
It's true. We've already booked an Abbey Road studio for five days in
March 2007, and we've signed 75 of the hottest young voices in the world to
be in the
On 01/23/2002 02:50:58 AM John Hudson wrote:
The problem for Win 9x users, even with current browsers, is lack of a
system installed Devanagari font with OpenType layout tables. The version
of Arial Unicode that ships with pre-XP versions of Windows does not
contain layout tables for Indic
At 12:56 1/25/2002, [EMAIL PROTECTED] wrote:
The version of Arial Unicode MS on my system does have layout tables for
Devanagari. I don't know with what product this version was introduced to
my system -- I've got Win2K, IE5.5 and Office XP.
Interesting. What's the file date on that font?
John
There is a quite simple way to do what you want:
If you want to input directly into an HTML form on the Geocities site, all
you have to do is pull down your "view" menu (I presuppose IE here) and
choose "UTF-8" from the Encoding submenu. Since Geocities doesn't send a
META tag, your browser will
- Original Message -
From: Marco Cimarosti [EMAIL PROTECTED]
To: 'Berthold Frommann' [EMAIL PROTECTED]; Rajat Bawa
[EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Sent: den 25 januari 2002 12:06
Subject: RE: Multiple script Handling (kanji - kana)
In plain-text unicode, furigana might be
At 11:31 AM 1/25/02 -0800, Julie Allen wrote:
John Hudson asked,
As Unicode continues to grow, I wonder if we can expect another book--
or
multiple volumes -- at some stage, or if the standard will become a
purely
electronic document? Has any decision been taken about this?
There are
On Fri, Jan 25, 2002 at 11:31:19AM -0800, Julie Allen wrote:
Speaking in my official capacity as editor, the answer is yes, you
can
expect another book. The editorial committee is already hard at work
on
4.0, which we expect to publish as one volume.
So are you worried about 4.0 being
In my Love Hina vol 7, $B@iG/(B has furigana $B%_%l%K%"%`(B.
Just thought you might wanna know.
_
$B%a!<%k%5!<%S%9$O!"@$3&(B No.1 $B$N(B MSN Hotmail
$B$G!*(Bhttp://www.hotmail.com/JA/
- Original Message -
From: ろ ろ〇〇〇 [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Sent: den 25 januari 2002 23:23
Subject: Furigana can be katakana
In my Love Hina vol 7, 千年 has furigana ミレニアム.
In cases such as ?瑞典?スウェーデン? (is the furigana encoded correctly?)
Julie We're estimating that 4.0 will be roughly 1500 pages, which the
Julie publisher says is not a problem for one volume. Now whether you can
Julie carry it with one hand is a different question. :-)
We Unicode accolytes have a rule that requires using both hands when carrying
the
Here I am in the list
-Original Message-
From: Stefan Persson [mailto:[EMAIL PROTECTED]]
Sent: Friday, January 25, 2002 4:49 PM
To: Marco Cimarosti; 'Berthold Frommann'; Rajat Bawa
Cc: [EMAIL PROTECTED]
Subject: Re: Multiple script Handling (kanji - kana)
- Original Message -
- Original Message -
From: [EMAIL PROTECTED]
The version of Arial Unicode MS on my system does have layout tables for
Devanagari. I don't know with what product this version was introduced to
my system -- I've got Win2K, IE5.5 and Office XP.
I guess the question becomes, which
Hi all,
Does anybody know if there exists mappings from any of the iMode
(DoCoMo) vendor-specific character codes to Unicode? The iMode
characters (164 of them) exist in Shift-JIS from 0xF89F to 0xF9AF.
Most of them would be considered dingbats. Yes, I could eyeball the
glyphs and come up
-BEGIN PGP SIGNED MESSAGE-
Asmus Freytag wrote:
At 06:29 AM 1/24/02 +, David Hopwood wrote:
Kenneth Whistler wrote:
And StandardizedVariants.html has been updated again, with more
of the missing glyphs provided.
I can't see any difference between plain U+2278 (either in
On 01/25/2002 04:45:04 PM Mark Leisher wrote:
Mind you, revealed wisdom should never exceed 4.5 kilograms in weight (in
Earth-normal gravity)
Uh, Mark? Kilograms are units of mass, not weight, so something that's 4.5
kilograms or less will be 4.5 kilograms or less whether in Earth-normal
Hello Unicode list members,
Unicode now has a serious competitor. Please read
about it at www.bytext.org. Everyone on this list
should find it extremely interesting.
I hope people concerned with Unicode see this as an
opportunity for growth. I think you will find Bytext
to be a superior
At 13:48 1/25/2002, Julie Allen wrote:
We're estimating that 4.0 will be roughly 1500 pages, which the
publisher says is not a problem for one volume. Now whether you can
carry it with one hand is a different question. :-)
Please try to ensure a sturdy binding. The binding of 3.0 is a little
Unicode now has a serious competitor. Please read
about it at www.bytext.org. Everyone on this list
should find it extremely interesting.
Goll dang! Just what ah've bin waitin' fer!
Code points is gettin' way too expensive in Unicode,
so I sure hope bytext is sellin' 'em cheaper.
Yer ol' pal,
On 01/25/2002 03:17:59 PM John Hudson wrote:
The version of Arial Unicode MS on my system does have layout tables for
Devanagari. I don't know with what product this version was introduced
to
my system -- I've got Win2K, IE5.5 and Office XP.
Interesting. What's the file date on that font?
Nov
From Bernard's personal site:
There are so many people smarter than me
Indeed.
But few who are so presumptuous to believe that they are a serious
competitor on such a basis? Though I can offer you a deal on personalized
tutorials to help you with your misconceptions on Unicode, though it may
On Fri, Jan 25, 2002 at 08:33:15PM -0800, Bernard Miller wrote:
Unicode now has a serious competitor. Please read
about it at www.bytext.org. Everyone on this list
should find it extremely interesting.
Let's see. Bytext has no corporate supporters, nor is it supported by
any standards
In a message dated 2002-01-25 20:45:46 Pacific Standard Time,
[EMAIL PROTECTED] writes:
Unicode now has a serious competitor. Please read
about it at www.bytext.org. Everyone on this list
should find it extremely interesting.
I just downloaded the PDF file and spent about 10 minutes
At 10:58 PM 1/24/02 +, David Hopwood wrote:
One possibility is to make VS1 specify what is now the reference glyph,
and VS2 specify the alternate glyph. Unmarked would mean either.
Boy, great minds do think alike. I proposed that in a paper to the UTC
last year. ;-)
You realize that this
So, is there a script -- something along the lines of the dialectizer
recently mentioned here -- that automatically generates 'Competitor to
Unicode' websites? I wonder, because they all make the same set of claims,
display the same confusion about or misrepresentation of Unicode, and offer
At 08:33 PM 2002-01-25 -0800, Bernard Miller wrote:
Hello Unicode list members,
Unicode now has a serious competitor. Please read
about it at www.bytext.org. Everyone on this list
should find it extremely interesting.
Juvenal said It is hard NOT to write satire.
But in future I suggest you
Unicode now has a serious competitor.
Kllhk!! Kllhk!! Kllhk! Whoa! Almost choked on my tofu burger!
Oh dewd, you have it so, like, all wrong... Universal character encoding
isn't about Competition and Marketing, it's about everybody doin' it in the
road, all together like, in love, peace,
47 matches
Mail list logo