Re: [idn] Javascript code charts, unicode converter, show-characters

2000-11-16 Thread Mark Davis
That agrees with the results I get on http://www.macchiato.com/unicode/convert.html. Mark - Original Message - From: J. William Semich To: Mark Davis ; Rick H Wesson Cc: Unicore ; Unicode ; w3c-i18n-ig Sent: Wednesday, November 15, 2000 22:46 Subject: Re:

Re: Unicode not approved by China

2000-11-16 Thread Brendan Murray/DUB/Lotus
Bjorn Stabell [EMAIL PROTECTED] wrote: According to this news item (in Chinese), China rejected HK's application to use Unicode, and instead says they have to use ISO 10646-1:2000 or GB18030. Apparently they don't like to standardize on a standard controlled by an organization of

Serious problems with Arabic

2000-11-16 Thread Roozbeh Pournader
Dear All, I have serious problems with Unicode Arabic. The main problem is with the Arabic shaping rules in TUS 3.0, pages 192--197. I think these should be changed in some suggested ways. Would someone please guide me on how should I prepare an official suggestion? 1. "Bidi and Cursive

Re: Persian decimal separator

2000-11-16 Thread Michael \(michka\) Kaplan
The only question I have here is that no one was *ever* suggesting a slash be used for the decimal separator. The suggestion was made for the CURRENCY separator. Since the question was asked (and all of the answers were made) mentioning the decimal separator only, I would have to wonder whether

Re: Java and Unicode

2000-11-16 Thread Elliotte Rusty Harold
At 4:44 PM -0800 11/15/00, Markus Scherer wrote: In the case of Java, the equivalent course of action would be to stick with a 16-bit char as the base type for strings. The int type could be used in _additional_ APIs for single Unicode code points, deprecating the old APIs with char. It's

Re: Persian decimal separator

2000-11-16 Thread Roozbeh Pournader
On Thu, 16 Nov 2000, Michael (michka) Kaplan wrote: The only question I have here is that no one was *ever* suggesting a slash be used for the decimal separator. The suggestion was made for the CURRENCY separator. Since the question was asked (and all of the answers were made) mentioning

Re: Java and Unicode

2000-11-16 Thread Valeriy E. Ushakov
On Thu, Nov 16, 2000 at 05:58:27 -0800, Elliotte Rusty Harold wrote: public char charAt(int index) This method is used to walk strings, looking at each character in turn, a useful thing to do. Clearly it would be possible to replace it with a method with a String return type like this:

Re: Java and Unicode

2000-11-16 Thread Elliotte Rusty Harold
At 7:26 AM -0800 11/16/00, Valeriy E. Ushakov wrote: On Thu, Nov 16, 2000 at 05:58:27 -0800, Elliotte Rusty Harold wrote: public char charAt(int index) This method is used to walk strings, looking at each character in turn, a useful thing to do. Clearly it would be possible to replace

Re: Persian decimal separator

2000-11-16 Thread Michael \(michka\) Kaplan
I would openly encourage you to help "perpetuate the evidence" so that this change can be evaluated and the right thing can be done in MS products (which as I said currently have the slash as a currency sign). Cites and sources for the investigation, etc. would be very important here. I do not

Re: Persian decimal separator

2000-11-16 Thread Roozbeh Pournader
On Thu, 16 Nov 2000, Michael (michka) Kaplan wrote: I would openly encourage you to help "perpetuate the evidence" so that this change can be evaluated and the right thing can be done in MS products (which as I said currently have the slash as a currency sign). Cites and sources for the

Re: Persian decimal separator

2000-11-16 Thread Michael \(michka\) Kaplan
There is no legal Farsi 32-bit version of Windows available in Iran, but there was once a 16-bit one available. BTW, it depends on how do you interpret legal. Iran is not in the international copyright agreement, so doing a localized Windows is considered legal here. There are more

[OT] Persian version of Windows (was Re: Persian decimal separator)

2000-11-16 Thread Roozbeh Pournader
On Thu, 16 Nov 2000, John Cowan wrote: If the copy cannot show a lawful chain of licensing from Microsoft, then it cannot be sold in Berne Convention countries (i.e. most of the world). This seems reasonable. I think the point is that some day there will be a licensed Persian Windows,

Re: Java and Unicode

2000-11-16 Thread Thomas Chan
On Thu, 16 Nov 2000, Markus Scherer wrote: The ICU API was changed this way within a few months this year. Some of the higher-level implementations are still to follow until next summer, when there will be some 45000 CJK characters that will be infrequent but hard to ignore - the Chinese and

Re: Java and Unicode

2000-11-16 Thread Markus Scherer
Juliusz Chroboczek wrote: I believe that Java strings use UTF-8 internally. .class files use a _modified_ utf-8. at runtime, strings are always in 16-bit unicode. At any rate the internal implementation is not exposed to applications -- note that `length' is a method in class String (while

Okay, I didn't mean to send that... (fwd)

2000-11-16 Thread addison
... but I have fat fingers today, and the difference betwixt Ctrl-C (cancel) and Ctrl-X (send) is pretty small. In any case, if you bothered to read my ruminations, your comments on the dilemma are welcome. Best Regards, Addison ===

string vs. char [was Re: Java and Unicode]

2000-11-16 Thread addison
Normally this thread would be of only academic interest to me... ...but this week I'm writing a spec for adding Unicode support to an embedded operating system written in C. Due to Mssrs. O'Conner and Scherer's presentations at the most recent IUC, I was aware of the clash between internal

RE: [OT] Localized Persian Windows (was Re: Persian decimal separator)

2000-11-16 Thread Houman Pournasseh
Microsoft never disclosed Windows source code for localization into Farsi. We once developed a Farsi version of Windows 3.1 entirely in-house and shipped it to Iran. We used a vendor for localization but source bits were never sent to Iran. That product was protected by a dongle which probably

Re: string vs. char [was Re: Java and Unicode]

2000-11-16 Thread Mark Davis
We have found that it works pretty well to have a uchar32 datatype, with uchar16 storage in strings. In ICU (C version) we use macros for efficient access; in ICU (C++) version we use method calls, and for ICU (Java version) we have a set of utility static methods (since we can't add to the Java