Hello,
The utf-8 encoding scheme goes like this
for
1-byte characters 0xxxxxxx
2-byte characters 110xxxxx 10xxxxxx
3-byte characters 1110xxxx 10xxxxxx
here the bits marked x are used up for the actuall encoding of characters
i would like to know the way these bits are used to code a particular
charter, also is this dependent on the operating system, can u provide a
program which checks finds this or any link that provides information
about this
Thanks
Jeu George
-
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/lists/