Peter Firminger skrev:
Unicode isn't a simple fix-all solution. It makes it easy for simple
things like European keyboard inputs (French, German, Spanish etc.)
but once you get to the non-latin charsets it gets difficult. I don't
believe (though I haven't read the docs for a while now) that all the
characters required for a universal solution are included in UTF-8.
From (distant) memory you have to go to something like UTF-16 or
UTF-32 to get anywhere near the number of characters required for all
languages and I don't know that browser support is very good with
those and I don't think they were even intended for web use.
UTF-8, UTF-16 and UTF-32 is only different ways to represent the same data.
http://www.unicode.org/faq/utf_bom.html#5
"UTF-8 is most common on the web. UTF-16 is used by Java and Windows.
UTF-32 is used by various Unix systems. The conversions between all of
them are algorithmically based, fast and lossless."
/Anders
******************************************************
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list & getting help
******************************************************