Re: Unicode in web pages

2000-09-05 Thread Stephen Toner
Does that mean that inputted code from a web-page must be changed from its UTF-8 encoding to UCS-2 for storage in SQL server? If so are there any converters out there? Can UCS-2 be used as the encoding for a web-page, or must conversion be done between the two encodings. From: "Michael

RE: Same language, two locales

2000-09-05 Thread Michael Everson
Ar 16:40 -0800 2000-09-03, scríobh John Cowan: On Sun, 3 Sep 2000, Alistair Vining wrote: Except that the Oxford dictionaries (and hence many UK users) have gone over to -ize spellings, so you have to learn to ignore the false negatives and search for the false positives... In this case it

RE: Same language, two locales

2000-09-05 Thread John Cowan
On Tue, 5 Sep 2000, Michael Everson wrote: Oxford notes that it is in imitation of French orthography. Meaning the use of "-ise" everywhere, I suppose; a straight parse of what you said would be "Oxford notes that Oxford is in imitation..." which seems to contradict what you meant. -- John

Re: Plane 14 redux (was: Same language, two locales)

2000-09-05 Thread Peter_Constable
Ar 11:51 -0800 2000-09-02, scríobh John Cowan: Its only real competitor is the SIL set, and an effort is underway to incorporate it en masse (or nearly so) into the RFC 1766 registry. I assure you that this will not happen. We have procedures and they must be followed, and dumping thousands of

Re: Plane 14 redux (was: Same language, two locales)

2000-09-05 Thread Michael Everson
Ar 09:38 -0500 2000-09-05, scríobh [EMAIL PROTECTED]: Well, Michael, *something* has to happen. There are a significant number of users for whom the status quo isn't adequate. Come on, Peter. Which users are they and what languages do they need? Be specific. Are they candidates for encoding in

Re: Unicode in web pages

2000-09-05 Thread Michael \(michka\) Kaplan
Well, I do not understand the issues underlying the technologies you have chosen. In ASP (for example) you have a CodePage property that you set to have this done automatically. One would hope that there is a similar solution in your case? I would not recommend encoding a web page as UCS-2, as

Re: Unicode in web pages

2000-09-05 Thread Vinod Balakrishnan
Hi Stephen, I had a similar problem last year when I was using an older version of the JDK with a java tool. This is my tip. Enter some Japanese text in the form and check the values of the characters in the each stage like, data from the browser (make sure you are getting UTF8), data

Re: FW: Char sets up-to-date

2000-09-05 Thread Misha Wolf
Received from ietf-charsets. Misha FYI: IETF-Charsets The following are recent changes and registrations to the Character Sets registry: Name: IBM00858Name: IBM01144 MIBenum: 2089 MIBenum: 2095 Name: IBM00924Name: IBM01145 MIBenum: 2090

IANA Character Set registration for SCSU

2000-09-05 Thread Markus Scherer
Hello, the IANA charset list is nicely refurbished, updated, and now lists SCSU. See below. markus Original Message From: "IANA" [EMAIL PROTECTED] Subject: Character Set registration To: [EMAIL PROTECTED] Dear Markus, We sincerely apologize for the delay. We have registered

Re: Plane 14 redux (was: Same language, two locales)

2000-09-05 Thread Peter_Constable
On 09/05/2000 09:38:00 AM Michael Everson wrote: Well, Michael, *something* has to happen. There are a significant number of users for whom the status quo isn't adequate. Come on, Peter. Which users are they and what languages do they need? Be specific. The languages are all those listed in