On Wed, Mar 28, 2007 at 11:05:56PM -0400, Daniel B. wrote:
> ???????? wrote:
> > 
> > 2007/3/28, Egmont Koblinger <[EMAIL PROTECTED]>:
> > 
> > > ...f you only handle _texts_ then
> > > probably the best approach is to convert every string as soon as they 
> > > arrive
> > > at your application to some Unicode representation (UTF-8 for Perl, 
> > > "String"
> > > (which uses UTF-16) for Java and so on)
> > 
> > Hrm, I think Java needs to be fixed. Their internal utf-16 mandate was
> > a mistake, imo.
> 
> Are you aware that Java was created (or frozen) when Unicode required
> 16 bits?  (It wasn't a mistake at the time.)

Java was introduced in May 1995. UTF-8 existed since September 1992.
There was never any excuse for UCS-2/UTF-16 existing at all.

Read Thompson & Pike’s UTF-8 paper for details.

〜Rich

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to