Robert Palais wrote:
I will be doing so, and apologize if my inquiry intruded on your
work, and at the same time, appreciate the many thoughtful
considerations on the matter of process of symbol standardization
that I received.
and later:
I apologize again if my misunderstanding that I
juuitchan wrote:
1. I have a Geocities page now. I do not know what encoding
Geocities uses,
but I think it's unicode. What I did for the Japanese text on
it was not
think about encodings and just type it in with Microsoft's
IME (and do some
swearing at the IME at the process). And
Marco Cimarosti wrote,
juuitchan wrote:
1. I have a Geocities page now. I do not know what encoding
Geocities uses,
URL?
Perhaps it's:
http://www.geocities.co.jp/AnimeComic-Pen/9973
...which has the following in its HTML header:
meta http-equiv=Content-Type content=text/html;
* Addison Phillips
|
| According to Lunde (p. 205), the range is through F9FC. There are
| real characters in the range FA40 - FC4B, at least in CP932, which
| may be causing you some confusion, since these have concrete
| mappings to Unicode (not just a mapping in the U+E000 range).
A very
* Lars Marius Garshol
|
| I've just discovered that it seems that Shift-JIS encodes a number
| of User-Defined Characters in the 0xF040 to 0xFCFC range, and that
| these
* Markus Scherer
|
| Yes, and every implementor may assign characters to them as they see
| fit.
I know. My problem is
* Thomas Chan
|
| Besides CP932 use of Shift-JIS UDC codepoints, one might also want to
| consider (used separately or in conjunction with CP932 UDC usage):
| - encodings of JIS X 0213 in Shift-JIS using UDC codepoints
Is this commonly used on the web?
--Lars M.
* David Hopwood
|
| Presumably the NT 4.0 mapping at http://www.autumn.org/etc/unidif.html
| (in Japanese, but the table is readable by non-Japanese-speakers).
|
| That mapping is a superset of CP932
| (ftp://ftp.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP932.TXT),
| with additional
Here is a proposal for a Unicode Musical Symbols font (forwarded with the
agreement of the originator).
Alan Wood
-Original Message-
From: William Will [SMTP:[EMAIL PROTECTED]]
Sent: Thursday, January 17, 2002 7:29 PM
To: [EMAIL PROTECTED]; [EMAIL PROTECTED]
Cc: [EMAIL
Thomas Chan wrote:
- NTT-DoCoMo pictographs[1] in webpages for cell phones
http://www.nttdocomo.co.jp/i/tag/emoji/ .
Something is definitely weird in this page.
They suggests to use shift-JIS codes for numerical character references, but
such a thing is not allowed by the current definition
If you want to duplicate the IE mappings, you could write a quick little
program to see what the COM APIs map the PUA SJIS characters to.
Mark
—
Πόλλ’ ἠπίστατο ἔργα, κακῶς δ’ ἠπίστατο
πάντα — Ὁμήρου Μαργίτῃ
[For transliteration, see http://oss.software.ibm.com/cgi-bin/icu/tr]
On Fri, 18 Jan 2002 [EMAIL PROTECTED] wrote:
For my part at least, I feel it is important to explain to proponents WHY
their proposed characters may not be suitable for encoding, rather than
simply telling them No.
I thought that had been done quite well.
I think the Unicode Consortium
* Lars Marius Garshol
|
| The problem is what to do about the rest of the range. Lunde
| suggests mapping to the Unicode PUA, but I don't think this is what
| the people using these characters in web pages expect that mapping.
Well, that's what I thought, anyway. But now my Japanese colleagues
At 10:06 AM 1/18/2002 -0700, Robert Palais wrote:
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit new symbols
until they are widely used... not necessarily the
Marco Cimarosti wrote,
juuitchan wrote:
1. I have a Geocities page now. I do not know what encoding
Geocities uses,
URL?
Perhaps it's:
http://www.geocities.co.jp/AnimeComic-Pen/9973
That was a page which I wrote when I was having a hard time with homepages,
and in a very BAD
At 10:06 -0700 2002-01-18, Robert Palais wrote:
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit new symbols
until they are widely used... not necessarily the
On Fri, 18 Jan 2002 [EMAIL PROTECTED] wrote:
I think the Unicode Consortium and WG2 do understand this, and that is why
they are so reluctant to encode symbols that do not have established usage,
as in the case of 2 pi, or seek to make a social or political statement that
the Consortium
Title: Message
-Original Message-From: Mohammed Shakeel
[mailto:[EMAIL PROTECTED]] Sent: Thursday, January 17, 2002 4:36
AMTo: [EMAIL PROTECTED]Subject: Require
Information
sir/madam
we are development firm who require to build a site
that is to display information in multiple
Hello Robert and others,
I think the Unicode Consortium and WG2 do understand this, and that is why
they are so reluctant to encode symbols that do not have established usage,
as in the case of 2 pi, or seek to make a social or political statement that
the Consortium and WG2 do not intend,
Robert Palais wrote:
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit
new symbols
until they are widely used... not necessarily the intent,
but possibly
At 01:45 PM 1/18/2002 -0500, you wrote:
The limitation of characters to those that are in current use is related
in large part to the code point limitations
What limitations? We have over a million codepoints to play with.
There is plenty of room.
I've always been under the impression that
At 10:06 AM 1/18/02 -0700, Robert Palais wrote:
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit new symbols
until they are widely used... not necessarily the
For those that have not heard about the Unicode standard, you may want to
download the pdf file that describes it at
http://www.unicode.org/charts/PDF/U1D100.pdf
That isn't the first place I would say describes the encoding. That is
just the final code chart and name list. (Which is, of
At 11:02 -0800 2002-01-18, Barry Caplan wrote:
There are plenty of characters which exist in the literature that
are not ended in Unicode, and in fact are specifically excluded:
those of written but dead languages.
They are not only not excluded, they are included: Runic and Deseret
are just
Just 1 week to go!
Twentieth International Unicode Conference (IUC20)
Unicode and the Web: The Global Connection
http://www.unicode.org/iuc/iuc20
January 28-31, 2002
Washington, DC, USA
Just an aside on terminolgy:
At 08:02 PM 1/18/02 +0100, Marco Cimarosti wrote:
3) A newly added operator (ZWL) which allows joining two characters into a
it's CGJ for Combinign Grapheme Joiner
4) A set of operators called Ideographic Description Character (IDC) for
They are for Ideographic
On Fri, Jan 18, 2002 at 11:35:44AM -0800, Asmus Freytag wrote:
Furthermore, there is a small cost of 'carrying a character on the books',
as each character added will incrementally grow the size of support files
that Unicode implementations will need.
They will also end up in fonts that
R. Palais wrote...
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit new symbols
until they are widely used... not necessarily the intent, but possibly
the
Which seems to make Unicode a defender of the status quo. Inaction is
as political as action. We are holders of the standards
for the technology for encoding symbols, and we won't admit new symbols
until they are widely used... not necessarily the intent, but possibly
the impact - that
This is the same situation as having one person in town be the mural
painter and another be the news photographer. Is every news photographer
required to paint murals, too, or be otherwise accused of hampering
artistic evolution?
That seems to be the wrong analogy. The question is
Oh my! I have to agree, the discussion on the impact of symbol
uniformization IS extremely enlightening to me, although I'm
somewhat apologetic again from distracting everyone from more
serious and practical issues. Thank you all for your thoughtful
responses, both on and off-group!
On Fri, 18
David Starner wrote:
If the symbols in Unicode make a political
statement by being there, then Unicode supports Christianity (U+2626 and
others), anti-Christianity (U+FB29), Islam (U+262a), Hippies (U+262e),
Communism (U+262d), and Dharma (U+2638).
Ahem... Not to mention Turtles. ;-)
I'm wondering what U+05C4 HEBREW MARK UPPER DOT is supposed to be used
for. Specifically, I'm trying to make sense of two characters that I know
are used and that are similar in appearance, and determine which (if
either) is 05C4. There's a hundreds mark - a dot placed over consonants to
At 11:02 AM 1/18/02 -0800, Barry Caplan wrote:
I've always been under the impression that one of the original goals of
the Unicode effort was to do away with he sort of multi-width encodings we
are all too familiar with (EUC, JIS, SJIS, etc.). this was to be
accomplished by using a fixed width
At 11:36 AM 1/18/02 -0800, Rick McGowan wrote:
It is our job as a standarizing organization to standardize what is IN USE
so that (as a goal) people can standard-ly communicate those symbols
internationally without ambiguity. It is _NOT_ our job, and never will be
our job, to invent new symbols
Robert Palais wrote,
I'd even support the inclusion of a copyleft symbol ahead of \newpi!
Has there been any consideration of practical alternatives, such as
selecting a lookalike or similar character from the plethora of those
already encoded and promoting its use to represent the newpi
Ron Tucker wrote,
I am new to this entire area. I have an application that requires both
English and Arabic. Besides having the UNICODE Standard, what do I
need to implement UNICODE into a Visual C++ application and where
can I find it?
A good, general article includes some useful info:
I think it is both.
The upper dot has been used in Hebrew for a number of purposes. The
exact shape does not matter.
Jony
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED]] On Behalf Of
[EMAIL PROTECTED]
Sent: Friday, January 18, 2002 11:56 PM
To: [EMAIL
1. I have a Geocities page now. I do not know what encoding Geocities
uses,
but I think it's unicode. What I did for the Japanese text on it was not
think about encodings and just type it in with Microsoft's IME (and do
some
swearing at the IME at the process). And it comes out fine, for the
38 matches
Mail list logo