RE: Help in a HURRY !!!!!!!!!!!!!!!!!!!!!!!

2001-05-15 Thread Yves Arrouye
To go with Lukas's Perl code, I'll provide a C version, not really tested either, with ICU, to give him a choice. No error checking etc., just to give the idea. If you want UTF-16 you'll need to use the macros in unicode/utf16.h to generate surrogate pairs properly. #include stdio.h #include

RE: UCD in XML

2001-05-15 Thread Yves Arrouye
I then tried my usual remedy: Bow in precisely the correct direction (359° 16' 32 N*) Adjust the bearing for declination (15° 26' E according to my chart of the bay), and try again compass in hand, maybe? ;-) YA

Re: A new web page relating to Khmer in Unicode

2001-05-15 Thread Michael Everson
At 21:39 +0100 2001-05-14, Maurice Bauhahn wrote: Hello Friends, I would appreciate any corrections, additional resources, suggestions, and critiques to fill out a new Web page which I have compiled relating to the Khmer script in Unicode: http://www.bauhahnm.clara.net/Khmer/Welcome.html Are

RE: UTF-8 signature in web and email

2001-05-15 Thread Roozbeh Pournader
On Tue, 15 May 2001, Richard, Francois M wrote: UTF-8 is considered as a character encoding form as any other... For UTF-16 only, the BOM is recommended. See http://www.w3.org/TR/REC-html40/charset.html#h-5.2.1 So BOM for UTF-8 HTML is neither recommended nor discouraged? Does anyone agree

Ancient writing found in Turkmenistan

2001-05-15 Thread Misha Wolf
quote A previously unknown civilisation was using writing in Central Asia 4,000 years ago, hundreds of years before Chinese writing developed, archaeologists have discovered. /quote http://news.bbc.co.uk/hi/english/world/asia-pacific/newsid_133/1330705.stm Misha

Re: Ancient writing found in Turkmenistan

2001-05-15 Thread Michael Everson
At 13:09 +0100 2001-05-15, Misha Wolf wrote: quote A previously unknown civilisation was using writing in Central Asia 4,000 years ago, hundreds of years before Chinese writing developed, archaeologists have discovered. /quote

RE: Ancient writing found in Turkmenistan

2001-05-15 Thread jarkko . hietaniemi
quote A previously unknown civilisation was using writing in Central Asia 4,000 years ago, hundreds of years before Chinese writing developed, archaeologists have discovered. /quote http://news.bbc.co.uk/hi/english/world/asia-pacific/newsid_133/1330705.s tm I noticed that too and was

Re: UTF-8 signature in web and email

2001-05-15 Thread Misha Wolf
This mail, addressed to [EMAIL PROTECTED], was, presumably, intended for [EMAIL PROTECTED]. Misha On 15/05/2001 00:32:24 Roozbeh Pournader wrote: Well, I received a UTF-8 email from Microsoft's Dr International today. It was a multipart/alternative, with both the text/plain and text/html

Re: UTC Agenda Item : UTF-8S

2001-05-15 Thread Markus Scherer
3 comments: 1. Binary order of UTF-16 strings compatible with binary order of UTF-8/32 is easily achieved using the fix-up described in my article on developerWorks (there is currently a problem with that site). Essentially, one rotates the 16-bit values so that the surrogates get to the top