Re: Need encoding conversion routines

askq1 askq1 Fri, 14 Mar 2003 04:59:07 -0800

From: "Pim Blokland" <[EMAIL PROTECTED]>
To: "Unicode mailing list" <[EMAIL PROTECTED]>
Subject: Re: Need encoding conversion routines
Date: Fri, 14 Mar 2003 12:30:44 +0100

askq1 askq1 schreef:

> In particular I need source code (or some way) for following
requirements:
> - Convert Unicode code-point to UTF8 encoding and vice-versa.
> - Convert Unicode code-point to UCS2 encoding and vice-versa.
> - Convert Unicode code-point to UTF16 encoding and vice-versa.

Ahem. Unicode *IS* UTF-8, UTF-16 and UCS-2. For instance, codepoint
U+4321 has the value (hex) 4321, which is defined as its Unicode
value. This is the same in any encoding. So I'm not sure what you
want. If the C routines at
http://www.unicode.org/Public/PROGRAMS/CVTUTF/ don't do it for you,
which conversion do you need? LE byte order to BE and back?
Canonical decomposing? Fallback character substitutions? BOM
insertion? What?

Yes I agree to what you are saying above. Let em explain what I want. Character U+4321 is the unicode code-point but to store this character into a file we need to use a certain encoding format. e.g. There must be some algorithm to find *the sequence of bytes* that represent this character into *UTF8 encoding*. Similar algorithms must be there for UTF16 and UCS2 encodings, I want C implementation of such algorithms.

Thanks,
~ K.

Pim Blokland

_________________________________________________________________ Cricket - World Cup 2003 http://server1.msn.co.in/msnspecials/worldcup03/ News, Views and Match Reports.

Re: Need encoding conversion routines

Reply via email to