At Fri, 23 Mar 2001 00:13:33 -0800, Rick McGowan <[EMAIL PROTECTED]>
wrote:
>David Starner wrote:
>
>> I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
>> to transcribe for Project Gutenberg. Unfortunately, he represents
>>the
>> Malaysian alphabet in a Latin transliterati
I taught myself to read a bit of SJIS mojibake, partly
from studying the scrambled output of my clock program
with fullwidth digits. (glitch plus O = 0. Glitch plus
P = 1, etc., I think) Anyone else here can read mojibake?
What is the English word for mojibake?
Isn't Unicode mojibake three mojibak
At Fri, 23 Mar 2001 00:13:33 -0800, Rick McGowan <[EMAIL PROTECTED]> wrote:
>David Starner wrote:
>
>> I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
>> to transcribe for Project Gutenberg. Unfortunately, he represents
>>the
>> Malaysian alphabet in a Latin transliteratio
It would be very entertaining to do the same job with the ideographs (down
to the radical level) and count the number of atoms. I suspect the resulting
"character set" would contain less than 2000 atoms altogether.
MichKa replied ...
> More than just entertaining, one would definitely find
From: "Jonathan Coxhead" <[EMAIL PROTECTED]>
>It would be very entertaining to do the same job with the ideographs
(down
> to the radical level) and count the number of atoms. I suspect the
resulting
> "character set" would contain less than 2000 atoms altogether.
More than just entertaining
>> Have Marco or Roozbeh looked at the ATSUI
>> docs at the Apple dev site? http://developer.apple.com
>> It's interesting how their approach corresponds to and
>> differs from that one.
>
> Would you please tell me where should I look? Trying to search for ATSUI
> at the search box provided at
> WTF-8 could potentially be as compact or more compact than UTF-8 (for
> Greek, Arabic ...), since much of the Latin-1 and Latin Extended A blocks
> aren't needed in WCode. If you moved the other characters down to
> fill that space, you might win what you lost to C1 compatibilty.
A while
On Fri, 23 Mar 2001, jgo wrote:
> Let's move on. Have Marco or Roozbeh looked at the ATSUI
> docs at the Apple dev site? http://developer.apple.com
> It's interesting how their approach corresponds to and
> differs from that one.
Would you please tell me where should I look? Trying to search
Perhaps Roman Czyborra's Unifont would work well for this
application. It is explained and available at:
http://www.czyborra.com/unifont/
Magda Danish forwarded:
> -
>
> We are a company that builds electronic equipment. Ou
> At 2001-03-23 11:27:46 EST [EMAIL PROTECTED] wrote:
> The [unicode] tag...
I like it; got attuned to it on the Carbon list;
been wanting it for months.
Let's move on. Have Marco or Roozbeh looked at the ATSUI
docs at the Apple dev site? http://developer.apple.com
It's interesting how their a
On Sat, 24 Mar 2001, Sean O Seaghdha wrote:
> Ar 21 Mar 2001, ag 11:58 scríobh [EMAIL PROTECTED]
> fán ábhar "[unicode] Re: Moving mail lists":
>
> > Those whom can filter their mail also can alter the subject line easily
> > with, for example, small perl script.
>
> Since this is so easy, co
Oops. I missed a spot.
% perl -ne 's/\[unicode\]// if (/^Subject:/); print;' messagefile
-
Mark Leisher Times are bad. Children no longer obey
Computing Research Labtheir parents, and ev
>> Those whom can filter their mail also can alter the subject line easily
>> with, for example, small perl script.
Sean> Since this is so easy, could you send me one?
% perl -ne 's/\[unicode\]// if (/^Subject:/);' messagefile
At 08:13 23-03-2001 -0800, Carl W. Brown wrote:
>Adam,
>
>I think that the poll was not arrogant but a little fun to break the
>tension.
I'd have probably been more amused had I not been logged off the Internet
at the time I was reading the message announcing the poll: I logged on,
loaded the bro
Ar 21 Mar 2001, ag 11:58 scríobh [EMAIL PROTECTED]
fán ábhar "[unicode] Re: Moving mail lists":
> Those whom can filter their mail also can alter the subject line easily
> with, for example, small perl script.
Since this is so easy, could you send me one?
`~:.,.:'^`~:.,.:'^`~:.,.:'^`~:.,.:'^`
Ar 23 Mar 2001, ag 13:44 scríobh Sarasvati
fán ábhar "[unicode] Re: Helpful info":
> One user quoted my offer to put feature (2)
> to a vote, but neglected to quote the other part
> of my note, which mentioned that this feature
> is often requested, and failed to mention also
> I said that I am
Another web page, for your collective amusement:
http://linguistics.berkeley.edu/~rscook/html/Unicode-tetralog.html
Dear list subscribers,
I received this query by fax. I could use help from the list to respond
to it.
Thanks for your time.
Magda Danish
Administrative Director
The Unicode Consortium
650-693-3921
-
We are a company that buil
Dear Uni-cadets --
Thanks to all of you for your input.
Over the years, there have been two features
that are requested time and again by users:
1. digest mode
2. tags in the subject line.
Both of these features are available in
"Listar", the new mail list software, and so
I have enabled them.
Mike,
They did define it as "the smallest addressable unit of storage" but even on
the S/360 it was not always 8 bits. The S/360 also ran 1401/1410 emulation
and in that mode the byte was 6 bits of data + a word mark. Rather than
EBCDIC is was BCD.
Carl
-Original Message-
From: [EMAIL
> Because of this very sentence, I have tested the Who command, and
> guess what? On Fri, 23 Mar 2001 08:45:18 -0500 (EST), Listar
> <[EMAIL PROTECTED]> happily sent me a list of 686 subscribers
> to the Unicode list.
Paranoid, I just tried the same and got:
List context changed to '
Unfortunately, anyone who felt strongly about things could easily (and
reasonably) take it another way.
After all, no one at all is claiming incompetence on the part of our
ever-vigilant
Bubble Queen of the River Ganga, but some people are talking about how much
they preferred the way our efferve
Marco,
Thank you for your valuable and candid opinion.
I appreciate your confidence in my intelligence.
Let me point out, however, that I specifically
wrote in my NOTES about the helpful info:
> The list of subscribers is not available.
as Peter Constable has already pointed out.
Cheery regar
Sarasvati had wriiten:
> TO FIND OUT WHO IS ON A LIST...
Marco had written:
> O, thank you! This is a great great utility for spammers!
[EMAIL PROTECTED] wrote:
> Marco: you evidently missed this line in her message:
> > The list of subscribers is not available.
Because of this very sentence, I
Doug Ewell wrote:
> And there is the problem somebody mentioned with "Re:
> [unicode] Re: [unicode] Re:..."
The list software seems to be smart enough to rewrite the Subject header so
that there is only one [unicode]. My mailer is prepending RE: to the
"[unicode] [OT]..." subject, but I doubt th
At 09:21 -0800 2001-03-23, Michael \(michka\) Kaplan wrote:
>After all, no one at all is claiming incompetence on the part of our
>ever-vigilant Bubble Queen of the River Ganga, but some people are
>talking about how much they preferred the way our effervescent but
>bitwise conservative used to
>
> A byte may be 8 bits now but it was not always 8 bits.
>
Au contraire!
It was the designers of System/360 who invented the word "byte" to mean the
smallest addressable unit of storage, in their case 8 bits. It is others who
have appropriated the word for their own purposes, as has happened wi
From: "Carl W. Brown" <[EMAIL PROTECTED]>
> I think that the poll was not arrogant but a little fun to break the
> tension.
Unfortunately, anyone who felt strongly about things could easily (and
reasonably) take it another way.
After all, no one at all is claiming incompetence on the part of ou
In a message dated 2001-03-23 4:50:21 Pacific Standard Time,
[EMAIL PROTECTED] writes:
> That Sarasvati wants to do things her way is fine by me. That she made
> me log into the Internet, launch my browser, just to get to the mockery
> of the poll.html and no Democratic-Process, is a slap in
Do you know CJK (kandji) dictionary with unicode codings?
(This range's english meaning:
4E00
9FA5
F900
FA2D)
Thanks.
Géza
-
Rozsa Geza
432-8279, (30)996-0007;
SMSre a subject: [EMAIL PROTECTED]
Adam,
I think that the poll was not arrogant but a little fun to break the
tension.
Carl
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of G. Adam Stanislav
Sent: Friday, March 23, 2001 4:38 AM
To: Michael Everson; [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Marco,
I find that people often understand it better when you get away from bytes,
octets etc. and describe Unicode strings as an array of unsigned short (16
bit unsigned integers) in the same manner as single byte characters are an
array of 8 bit integers. This way the only time you have to dea
Jeff,
A byte is the least addressable portion of memory. The IBM 1401 for example
has 6 bit bytes + a word mark. Parity bits don't count. A lot of systems
in the 50's and early 60's had 6 bit bytes. That is why octal became so
popular.
Bytes were not used for systems like the IBM 1620 which
Touché by all of you who've corrected my reliance on dictionaries for tech
definitions.
On 03/23/2001 04:58:39 AM unicode-bounce wrote:
[Marco has that old Unicode bounce in him again! :-)]
>Sarasvati wrote:
>> TO FIND OUT WHO IS ON A LIST...
>O, thank you! This is a great great utility for spammers!
Marco: you evidently missed this line in her message:
>The list of subscriber
On Fri, 23 Mar 2001, Sean O Seaghdha wrote:
> I'll take the above private response as a request for everyone opposed to or
> in favour of this piece of nonsense to e-mail Sarasvati <[EMAIL PROTECTED]>
> with your opinion. I trust Sarasvati will keep us apprised of the tally, the
> proportion
Am 2001-03-22 um 14:31 h MEZ hat Tomas McGuinness geschrieben:
> I am currently developing a product that will support UCS-2
For a new project, it would be better to support UTF-16, rather
than UCS-2, from the very beginning. There are already characters
accepted for standardization that can not
Otto Stolz scripsit:
> Of course, if you allow for
> names comprising several characters, it would be better to spell out the
> multiplication operator; U+22C5, U+2219, U+00B7, and 00D7 coming to mind.
Yes, of course. But that wasn't at all the case I had in mind. Letting x
and y be two rando
At 09:48 23-03-2001 +, Michael Everson wrote:
>> http://www.unicode.org/~sarasvati/poll.html
>
>
>Not Found
>The requested URL /~sarasvati/Democratic-Process was not found on this
server.
That was the whole point, I believe. Since there were only two choices,
and both identical, there is
Ar 23 Mar 2001, ag 1:44 scríobh Sarasvati
fán ábhar "Re: Moving mail lists":
> At this moment, there are 691 addresses subscribed to
> the Unicode mail list. At least 24 of those entities
> are points of further fan-out to local lists elsewhere.
> If you can gather a list of at least 346 current
Sarasvati wrote:
> TO FIND OUT WHO IS ON A LIST:
> Send a message to the listar account on the server with
> a subject
> of "who [listname]". You will receive a list of people
> subscribed
> to the list who are not hidden. Admins will be able to see
> everbody, includin
At 20:56 -0500 2001-03-22, Sarasvati wrote:
>Here by popular demand is the poll of the day...
>
> http://www.unicode.org/~sarasvati/poll.html
Not Found
The requested URL /~sarasvati/Democratic-Process was not found on this server.
--
[Hoping the shubnet doesn't got this one too . . .]
WTF-8 could potentially be as compact or more compact than UTF-8 (for
Greek, Arabic ...), since much of the Latin-1 and Latin Extended A blocks
aren't needed in WCode. If you moved the other characters down to
fill that space, you might win wh
John Cowan schrieb:
:Hmm. If you multiply x-bar by y-bar, surely you want the bars to be
:separated, not run together into a single bar (which would be the mean
:of x times y), no? In that case COMBINING MACRON would be better.
:Or should x-bar times y-bar be written with a THIN SPACE separatin
David Starner wrote:
> I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
> to transcribe for Project Gutenberg. Unfortunately, he represents the
> Malaysian alphabet in a Latin transliteration that includes ng as a
> single ligatured form, and I don't know how to transcribe
[Hoping the shubnet doesn't got this one too . . .]
WTF-8 could potentially be as compact or more compact than UTF-8 (for
Greek, Arabic ...), since much of the Latin-1 and Latin Extended A blocks
aren't needed in WCode. If you moved the other characters down to
fill that space, you might win wh
[Feed another to the shubnet . . .]
I have a copy of Shellbear's Practical Malay Grammar that I'm preparing
to transcribe for Project Gutenberg. Unfortunately, he represents the
Malaysian alphabet in a Latin transliteration that includes ng as a
single ligatured form, and I don't know how to t
John, thank you. China.com subscribers have been
removed pending resolution of the problem.
-- Sarasvati
> Received: from china.com (TCE-E-7-182-12.bta.net.cn [202.106.182.12])
> by unicode.org (8.9.3/8.9.3) with SMTP id AAA03398
> for <[EMAIL PROTECTED]>; Fri, 23 Mar 2001 00:11:40 -0500
> Received: from china.com([10.1.7.101]) by china.com(JetMail 2.5.3.0)
> with SMTP id jm93abafcc
49 matches
Mail list logo