Re: Unicode Decoding to ASCII

Ray Wurlod Mon, 01 Mar 2004 11:49:27 -0800

Unicode covers (potentially) 65,536 different characters.  ASCII covers 128.  
Translation will only work if the Unicode document contains only the same 128 
characters and only these.
To compound your problem further, there are different encodings of Unicode.  Properly, 
Unicode is a two-byte-per-character set of values.  However there are serveral UTF-8 
encodings, where single byte streams can represent the Unicode code points.  In these 
you get one to four bytes per character.  UniVerse UTF-8 encoding is an example; it 
uses special handling for the NULL and dynamic array delimiter characters.


----- Original Message -----
From: "Dave Taylor" <[EMAIL PROTECTED]>
Date: Sun, 29 Feb 2004 15:51:30 -0800
To: "u2-users" <[EMAIL PROTECTED]>
Subject: Unicode Decoding to ASCII

> Does anyone know of any tools that will run under Windows for decoding a Unicode 
> document to ASCII using a conversion table containing the source language character 
> hex codes and the corresponding ASCII character hex codes?
> 
> This will be used only for decoding documents containing a pre-definable character 
> set (not a tranlation of free-flowing text).
> 
> This will enable conversion independently of whether or not the mv database in use 
> supports Unicode internally - which many do not.
> 
> Thanks for your help,
> 
> Dave
> 
> Sysmark Information Systems, Inc.
> 800-SYSMARK (800-797-6275)
> www.sysmarkinfo.com
> --
> u2-users mailing list
> [EMAIL PROTECTED]
> http://www.oliver.com/mailman/listinfo/u2-users
> 

-- 
u2-users mailing list
[EMAIL PROTECTED]
http://www.oliver.com/mailman/listinfo/u2-users

Re: Unicode Decoding to ASCII

Reply via email to