To go with Lukas's Perl code, I'll provide a C version, not really tested
either, with ICU, to give him a choice. No error checking etc., just to give
the idea. If you want UTF-16 you'll need to use the macros in
unicode/utf16.h to generate surrogate pairs properly.
#include stdio.h
#include
I then tried my usual remedy: Bow in precisely the correct
direction (359° 16' 32 N*)
Adjust the bearing for declination (15° 26' E according to my chart of the
bay), and try again compass in hand, maybe? ;-)
YA
At 21:39 +0100 2001-05-14, Maurice Bauhahn wrote:
Hello Friends,
I would appreciate any corrections, additional resources, suggestions,
and critiques to fill out a new Web page which I have compiled relating
to the Khmer script in Unicode:
http://www.bauhahnm.clara.net/Khmer/Welcome.html
Are
On Tue, 15 May 2001, Richard, Francois M wrote:
UTF-8 is considered as a character encoding form as any other...
For UTF-16 only, the BOM is recommended.
See http://www.w3.org/TR/REC-html40/charset.html#h-5.2.1
So BOM for UTF-8 HTML is neither recommended nor discouraged? Does anyone
agree
quote
A previously unknown civilisation was using writing in Central Asia
4,000 years ago, hundreds of years before Chinese writing developed,
archaeologists have discovered.
/quote
http://news.bbc.co.uk/hi/english/world/asia-pacific/newsid_133/1330705.stm
Misha
At 13:09 +0100 2001-05-15, Misha Wolf wrote:
quote
A previously unknown civilisation was using writing in Central Asia
4,000 years ago, hundreds of years before Chinese writing developed,
archaeologists have discovered.
/quote
quote
A previously unknown civilisation was using writing in Central Asia
4,000 years ago, hundreds of years before Chinese writing developed,
archaeologists have discovered.
/quote
http://news.bbc.co.uk/hi/english/world/asia-pacific/newsid_133/1330705.s
tm
I noticed that too and was
This mail, addressed to [EMAIL PROTECTED], was, presumably, intended
for [EMAIL PROTECTED].
Misha
On 15/05/2001 00:32:24 Roozbeh Pournader wrote:
Well, I received a UTF-8 email from Microsoft's Dr International today. It
was a multipart/alternative, with both the text/plain and text/html
3 comments:
1. Binary order of UTF-16 strings compatible with binary order of UTF-8/32 is easily
achieved using the fix-up described in my article on developerWorks (there is
currently a problem with that site).
Essentially, one rotates the 16-bit values so that the surrogates get to the top
9 matches
Mail list logo