These any help?

Unicode UTF-8 encoding
http://www1.tip.nl/~t876506/utf8tbl.html

http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&item_id=IWS-AppendixA

--- In [email protected], "silvermoonwoman2001" <sheri...@...> wrote:
>
> --- In [email protected], "silvermoonwoman2001" <sherip99@> wrote:
> > 
> > With the help of this page <http://czyborra.com/utf/> I was able
> > to make a script that readily generates utf-8 strings directly
> > from unicode code points. Works well. Can call this from a for
> > loop that appends the result to a string of many, many utf-8
> > characters. 
> 
> > Function ReturnUTF8(c)
> > ;c is a code point, e.g., 10400 or FFFF or whatever
> > local u
> > local point="U"++c
> > c=eval("0x"++c)
> > if (c < 0x80) do
> >   u = ?"\x"++win.hex(c)
> > elseif (c < 0x800)
> >   u = ?"\x"++ win.hex (0xC0 | c>>6)
> >   u++=?"\x"++win.hex(0x80 | c & 0x3F)
> > elseif (c < 0x10000)
> >   u= ?"\x"++win.hex(0xE0 | c>>12)
> >   u++=?"\x"++win.hex(0x80 | c>>6 & 0x3F)
> >   u++=?"\x"++win.hex(0x80 | c & 0x3F)
> > elseif (c < 0x200000)
> >   u = ?"\x"++win.hex(0xF0 | c>>18)
> >   u++=?"\x"++win.hex(0x80 | c>>12 & 0x3F)
> >   u++=?"\x"++win.hex(0x80 | c>>6 & 0x3F)
> >   u++=?"\x"++win.hex(0x80 | c & 0x3F)
> > endif
> > win.debug(point++": "++u)
> > quit(esc(u,?+\+))
> 
> I would like to reverse the above activity, such that given a series of utf8 
> bytes that represent a single unicode code point, it will return the code 
> point. Help?
> 
> Thanks,
> Sheri
>


Reply via email to