On Fri, 23 Mar 2001, Jonathan Coxhead wrote:
It would be very entertaining to do the same job with the ideographs (down
to the radical level) and count the number of atoms. I suspect the resulting
"character set" would contain less than 2000 atoms altogether.
Please do feel free
Jonathan Coxhead wrote:
A while ago, I tried to perform a similar exercise: work out which
characters in Unicode are "atomic", [...]
The result is at http://www.doves.demon.co.uk/atomic.html.
I find that it is a very interesting work. You are too humble if you
consider it just a joke.
I wrote:
I am afraid that it would be a very long work, as
entertaining as cooking stuffed arms.
Ooops! I meant "stuffed ants" ("fourmies farcis"?).
The macabre image was totally unintentional.
_ Marco
-Original Message-
From: Marco Cimarosti [mailto:[EMAIL PROTECTED]]
I wrote:
I am afraid that it would be a very long work, as
entertaining as cooking stuffed arms.
Ooops! I meant "stuffed ants" ("fourmies farcis"?).
The macabre image was totally unintentional.
I didn't
[Hoping the shubnet doesn't got this one too . . .]
WTF-8 could potentially be as compact or more compact than UTF-8 (for
Greek, Arabic ...), since much of the Latin-1 and Latin Extended A blocks
aren't needed in WCode. If you moved the other characters down to
fill that space, you might win
From: "Jonathan Coxhead" [EMAIL PROTECTED]
It would be very entertaining to do the same job with the ideographs
(down
to the radical level) and count the number of atoms. I suspect the
resulting
"character set" would contain less than 2000 atoms altogether.
More than just entertaining,
It would be very entertaining to do the same job with the ideographs (down
to the radical level) and count the number of atoms. I suspect the resulting
"character set" would contain less than 2000 atoms altogether.
MichKa replied ...
More than just entertaining, one would definitely find
John Cowan wrote:
The result is a back-to-the-principles "WCode", nicely streamlined:
- no compatibility or precomposed characters
But less compact. Without precomposed characters, the overhead of
conversion from old character sets grows considerably.
True. Compactness was not a goal
8 matches
Mail list logo