Re: Unicode and Friends (Was: JSON)

Andrew Lentvorski Tue, 25 Oct 2005 23:53:33 -0700

Stewart Stremler wrote:

UTF8 does give you that. UTF-16 (or is it UCS-16?) doesn't.


Actually, UTF-16 does, too.

http://www.terena.nl/library/multiling/unicode/utf16.html

"Basically, UTF-16 allows the inclusion of certain UCS-4 codes in aUCS-2 encoded string. It does this by reserving 1024 high-half zonecodes from the BMP (D800 - DBFF) and 1024 low-half zone codes from theBMP (DC00 - DFFF). Using a contiguous pair of codes as:


    <high-half zone code> <low-half zone code>"

This is now supported in Java 1.5.

I prefer to use UTF-8, but UTF-16 is the default in Java.

And Unicode introduces *another* problem -- the problem of too-similiar
glyphs *explodes*.  This is a security issue -- a boon to phishers all
over the world.  If I can't set my locale (or toggle my display) so that
the extended character sequences show up as non-ambiguous character
sequences, I have a problem from the whole mess from the standpoint as
a user.


That's a similar issue to typosquatting.

The issue is a lack of being able to authenticate identity of a web siteunambiguously. Unicode makes that neither better nor worse.


-a


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Re: Unicode and Friends (Was: JSON)

Reply via email to