I have heard a rumour (i.e. my source is not involved in the reported
activity) that:
quote
SAP, PeopleSoft, Siebel, Oracle and others are actually
in the process of proposing a new format of UTF that will cause a UTF-16
surrogate pair to become two 3-byte UTF-8 codepoints so that UTF-8 will
Using UTF-8 to handle characters in the supplementary planes by way of using
two separate code points in the surrogate range is NOT considered
acceptable.
Currently it is legal to interpret them but *not* to generate them (multople
refs on the Unicode site). Therefore, I hope you are mistaken
In a message dated 2001-02-05 5:19:59 Pacific Standard Time,
[EMAIL PROTECTED] writes:
I have heard a rumour (i.e. my source is not involved in the reported
activity) that:
quote
SAP, PeopleSoft, Siebel, Oracle and others are actually
in the process of proposing a new format
The topic came up in a UTC meeting some time ago, a "UTF-8S". The motivation
was for performance (having a form that reproduces the binary order of
UTF-16). We have yet to see a formal proposal for this, though.
Mark
- Original Message -
From: "J M Sykes" [EMAIL PROTECTED]
To: "Unicode
A communication with someone offlist (though I think he is on the list)
suggested that Unicode is not supported at all in Macintosh OS8.6 or OS9,
not even to the degree that it is supported in Windows 9x, except by means
of Windows emulation (if I'm characterizing the message correctly; it is on
Within a String, the encoding of char values is practically irrelevant. It is a
hidden encoding that is never exposed to the user...or developer. When you access
String char values, you use an index to 16-bit Unicode values. To my knowledge,
Sun does not claim that its internal encoding of String
John O'Conner wrote:
Within a String, the encoding of char values is practically irrelevant. It is a
hidden encoding that is never exposed to the user...or developer. When you access
String char values, you use an index to 16-bit Unicode values. To my knowledge,
Sun does not claim that its
John,
It does impact developers.
The API for DataInputStream defines FSS_UTF, which includes the funky
null behavior.
http://java.sun.com/products/jdk/1.2/docs/api/java/io/DataInputStream.html
Since this API and other use this UTF, it gets into file formats and
applications
end up supporting
-Original Message-
From: William Palmer [mailto:[EMAIL PROTECTED]]
Sent: Thursday, February 01, 2001 3:45 PM
To: '[EMAIL PROTECTED]'
Subject: Question on Unicode enabled product Support.
Hello...!
Question:
Has IBM OS/390 Ver 2.6, 2.7 or 2.9 MVS/ESA Operating support for
Perhaps the methods readUTF and writeUTF should be deprecated in favor of
read/writeString. I will submit an RFE (request for enhancement) for this.
I noticed that although the Data{Input,Output} interface clearly says that the
write/readUTF handles a "Java modified UTF-8". The actual javadoc in
John,
I am not clear from your comments which is the bug, since the doc
goes both ways. Are the doc bugs that they say
it is UTF-8, or that they say it is modified UTF-8?
It would be great to learn that the functions are actually unmodified
UTF-8, as I know of some interfaces that are writing
Tex Texin wrote:
I am not clear from your comments which is the bug, since the doc
goes both ways. Are the doc bugs that they say
it is UTF-8, or that they say it is modified UTF-8?
It uses modified UTF-8, modified in three ways:
1) U+ is encoded in two bytes as 0xc0 0x80;
2) values
-- "P. T. Rourke" [EMAIL PROTECTED] is rumored to have mumbled on Montag, 5.
Februar 2001 8:47 Uhr -0800 regarding Macintosh OS8.6, OS9:
A communication with someone offlist (though I think he is on the list)
suggested that Unicode is not supported at all in Macintosh OS8.6 or OS9,
It's not
Mon, 5 Feb 2001 08:20:43 -0800 (GMT-0800), Mark Davis [EMAIL PROTECTED] pisze:
The topic came up in a UTC meeting some time ago, a "UTF-8S". The
motivation was for performance (having a form that reproduces the
binary order of UTF-16).
This is unfair: it slows down the conversion UTF-8 -
Thanks to everyone who responded, especially Mr. Hagedorn; as it is
precisely the Extended Greek, Basic Greek, and Combining diacriticals blocks
that interest me, this was very important information.
Patrick Rourke
Netscape, Internet Explorer and Icab (another browser for the MacOs)
use UTF8
Unfortunately, the issue at this point is that some companies have either
already accepted it or in the process of accepting it now.
MichKa
Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/
- Original Message -
From: "Marcin 'Qrczak' Kowalczyk" [EMAIL PROTECTED]
To:
The MacOs did support Unicode in 8.6 and 9.x, see
Apple Type Services for Unicode Imaging Reference
http://developer.apple.com/techpubs/macos8/TextIntlSvcs/ATSUI/ATSUI_ref/index.html
Handling Unicode Text Editing With Multilingual Text Engine
Here's what I see about the Java API docs:
1. The Data{Input, Output}Stream methods {read, write}UTF could be named better. More
appropriate names are {read, write}String. Strictly speaking, this is not a bug, but
it could
be better. That's why I call it an RFE (request for enhancement).
2.
18 matches
Mail list logo