The_Monk:
--------------------------------------------------------------------------------
... differences between unicode and UTF-8
...--------------------------------------------------------------------------------


Hi, you may check e.g. the following article about the differences of "Unicode"
and "utf-8"...
http://www.joelonsoftware.com/articles/Unicode.html

Basically, Unicode isn't an encoding, it's a conventional mapping of "letters"
to their unique identifiers - hex numbers named "code points".

There are severeal "encodings" to represent these "letters" (or "code points"
for unicode systems) in bytes - utf-8 beeing one of them (PSPad also supports
utf-16, others should be available in the future).

I don't have any experiences with java, but I'd find it rather surprising, if
there were any problems with using utf-8, as this is becoming a kind of standard
(also because of its relative space efficiency - esp. for latin characters - and
the full coverage of all code points defined by the unicode standard.)

Unicode is indeed a complex topic, you may find the web site of the Unicode
consortium helpful, e.g.

What is unicode
http://www.unicode.org/standard/WhatIsUnicode.html

Code Charts - pdf tables of the supported characters
http://www.unicode.org/charts/

Unicode 5.1 names list (1 MB txt file)
http://www.unicode.org/Public/UNIDATA/NamesList.txt

  vbr

-- 
<http://forum.pspad.com/read.php?2,46017,46018>
PSPad freeware editor http://www.pspad.com

Odpovedet emailem