On Wed, Mar 28, 2007 at 11:05:56PM -0400, Daniel B. wrote: > ???????? wrote: > > > > 2007/3/28, Egmont Koblinger <[EMAIL PROTECTED]>: > > > > > ...f you only handle _texts_ then > > > probably the best approach is to convert every string as soon as they > > > arrive > > > at your application to some Unicode representation (UTF-8 for Perl, > > > "String" > > > (which uses UTF-16) for Java and so on) > > > > Hrm, I think Java needs to be fixed. Their internal utf-16 mandate was > > a mistake, imo. > > Are you aware that Java was created (or frozen) when Unicode required > 16 bits? (It wasn't a mistake at the time.)
Java was introduced in May 1995. UTF-8 existed since September 1992. There was never any excuse for UCS-2/UTF-16 existing at all. Read Thompson & Pike’s UTF-8 paper for details. 〜Rich -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
