Received: from china.com (TCE-E-7-182-12.bta.net.cn [202.106.182.12])
by unicode.org (8.9.3/8.9.3) with SMTP id AAA03398
for [EMAIL PROTECTED]; Fri, 23 Mar 2001 00:11:40 -0500
Received: from china.com([10.1.7.101]) by china.com(JetMail 2.5.3.0)
with SMTP id jm93abafcca;
[Feed another to the shubnet . . .]
I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
to transcribe for Project Gutenberg. Unfortunately, he represents the
Malaysian alphabet in a Latin transliteration that includes ng as a
single ligatured form, and I don't know how to
[Hoping the shubnet doesn't got this one too . . .]
WTF-8 could potentially be as compact or more compact than UTF-8 (for
Greek, Arabic ...), since much of the Latin-1 and Latin Extended A blocks
aren't needed in WCode. If you moved the other characters down to
fill that space, you might win
At 20:56 -0500 2001-03-22, Sarasvati wrote:
Here by popular demand is the poll of the day...
http://www.unicode.org/~sarasvati/poll.html
Not Found
The requested URL /~sarasvati/Democratic-Process was not found on this server.
Sarasvati wrote:
TO FIND OUT WHO IS ON A LIST:
Send a message to the listar account on the server with
a subject
of "who [listname]". You will receive a list of people
subscribed
to the list who are not hidden. Admins will be able to see
everbody, including
Ar 23 Mar 2001, ag 1:44 scrobh Sarasvati
fn bhar "Re: Moving mail lists":
At this moment, there are 691 addresses subscribed to
the Unicode mail list. At least 24 of those entities
are points of further fan-out to local lists elsewhere.
If you can gather a list of at least 346 current
Am 2001-03-22 um 14:31 h MEZ hat Tomas McGuinness geschrieben:
I am currently developing a product that will support UCS-2
For a new project, it would be better to support UTF-16, rather
than UCS-2, from the very beginning. There are already characters
accepted for standardization that can not
On Fri, 23 Mar 2001, Sean O Seaghdha wrote:
I'll take the above private response as a request for everyone opposed to or
in favour of this piece of nonsense to e-mail Sarasvati [EMAIL PROTECTED]
with your opinion. I trust Sarasvati will keep us apprised of the tally, the
proportion of
Touché by all of you who've corrected my reliance on dictionaries for tech
definitions.
Jeff,
A byte is the least addressable portion of memory. The IBM 1401 for example
has 6 bit bytes + a word mark. Parity bits don't count. A lot of systems
in the 50's and early 60's had 6 bit bytes. That is why octal became so
popular.
Bytes were not used for systems like the IBM 1620 which
Marco,
I find that people often understand it better when you get away from bytes,
octets etc. and describe Unicode strings as an array of unsigned short (16
bit unsigned integers) in the same manner as single byte characters are an
array of 8 bit integers. This way the only time you have to
Do you know CJK (kandji) dictionary with unicode codings?
(This range's english meaning:
4E00CJK Ideograph, First
9FA5CJK Ideograph, Last
F900CJK Compatibility Ideograph, First
FA2DCJK Compatibility Ideograph, Last)
Thanks.
Gza
A byte may be 8 bits now but it was not always 8 bits.
Au contraire!
It was the designers of System/360 who invented the word "byte" to mean the
smallest addressable unit of storage, in their case 8 bits. It is others who
have appropriated the word for their own purposes, as has happened with
At 09:21 -0800 2001-03-23, Michael \(michka\) Kaplan wrote:
After all, no one at all is claiming incompetence on the part of our
ever-vigilant Bubble Queen of the River Ganga, but some people are
talking about how much they preferred the way our effervescent but
bitwise conservative used to do
Sarasvati had wriiten:
TO FIND OUT WHO IS ON A LIST...
Marco had written:
O, thank you! This is a great great utility for spammers!
[EMAIL PROTECTED] wrote:
Marco: you evidently missed this line in her message:
The list of subscribers is not available.
Because of this very sentence, I
Marco,
Thank you for your valuable and candid opinion.
I appreciate your confidence in my intelligence.
Let me point out, however, that I specifically
wrote in my NOTES about the helpful info:
The list of subscribers is not available.
as Peter Constable has already pointed out.
Cheery
Unfortunately, anyone who felt strongly about things could easily (and
reasonably) take it another way.
After all, no one at all is claiming incompetence on the part of our
ever-vigilant
Bubble Queen of the River Ganga, but some people are talking about how much
they preferred the way our
Because of this very sentence, I have tested the Who command, and
guess what? On Fri, 23 Mar 2001 08:45:18 -0500 (EST), Listar
[EMAIL PROTECTED] happily sent me a list of 686 subscribers
to the Unicode list.
Paranoid, I just tried the same and got:
SNIP
List context changed to
Another web page, for your collective amusement:
http://linguistics.berkeley.edu/~rscook/html/Unicode-tetralog.html
Ar 21 Mar 2001, ag 11:58 scrobh [EMAIL PROTECTED]
fn bhar "[unicode] Re: Moving mail lists":
Those whom can filter their mail also can alter the subject line easily
with, for example, small perl script.
Since this is so easy, could you send me one?
At 08:13 23-03-2001 -0800, Carl W. Brown wrote:
Adam,
I think that the poll was not arrogant but a little fun to break the
tension.
I'd have probably been more amused had I not been logged off the Internet
at the time I was reading the message announcing the poll: I logged on,
loaded the
Those whom can filter their mail also can alter the subject line easily
with, for example, small perl script.
Sean Since this is so easy, could you send me one?
% perl -ne 's/\[unicode\]// if (/^Subject:/);' messagefile
Oops. I missed a spot.
% perl -ne 's/\[unicode\]// if (/^Subject:/); print;' messagefile
-
Mark Leisher Times are bad. Children no longer obey
Computing Research Labtheir parents, and
From: "Jonathan Coxhead" [EMAIL PROTECTED]
It would be very entertaining to do the same job with the ideographs
(down
to the radical level) and count the number of atoms. I suspect the
resulting
"character set" would contain less than 2000 atoms altogether.
More than just entertaining,
It would be very entertaining to do the same job with the ideographs (down
to the radical level) and count the number of atoms. I suspect the resulting
"character set" would contain less than 2000 atoms altogether.
MichKa replied ...
More than just entertaining, one would definitely find
At Fri, 23 Mar 2001 00:13:33 -0800, Rick McGowan [EMAIL PROTECTED] wrote:
David Starner wrote:
I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
to transcribe for Project Gutenberg. Unfortunately, he represents
the
Malaysian alphabet in a Latin transliteration that
I taught myself to read a bit of SJIS mojibake, partly
from studying the scrambled output of my clock program
with fullwidth digits. (glitch plus O = 0. Glitch plus
P = 1, etc., I think) Anyone else here can read mojibake?
What is the English word for mojibake?
Isn't Unicode mojibake three
27 matches
Mail list logo