???????? wrote: > > 2007/3/28, Egmont Koblinger <[EMAIL PROTECTED]>: > > > ...f you only handle _texts_ then > > probably the best approach is to convert every string as soon as they arrive > > at your application to some Unicode representation (UTF-8 for Perl, "String" > > (which uses UTF-16) for Java and so on) > > Hrm, I think Java needs to be fixed. Their internal utf-16 mandate was > a mistake, imo.
Are you aware that Java was created (or frozen) when Unicode required 16 bits? (It wasn't a mistake at the time.) > Normally, you should not have to ever convert strings between > encodings. Then how do you process, say, a multi-part MIME body that has parts in different character encodings? > Its just > not your problem, plus it indroces a ton of potential headaches. > Just assume your input is in the encoding its supposed to be in. You never deal with multiple inputs? Daniel -- Daniel Barclay [EMAIL PROTECTED] -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/
