Zvi Har'El wrote:
> > umm, isn't UTF-8 8 bit with occasional 16? :)
> 
> UTF-8 is one, two or three bytes per character. In the Hebrew case, a Hebrew
> character is two bytes.

Of course.
But there are some special Hebrew characters (such as RLM/LRM, etc.)
that are 3.
And theoretically, UTF8 can handle up to 5 bytes.

-- 
Eli Marmor
[EMAIL PROTECTED]
CTO, Founder
Netmask (El-Mar) Internet Technologies Ltd.
__________________________________________________________
Tel.:   +972-9-766-1020          8 Yad-Harutzim St.
Fax.:   +972-9-766-1314          P.O.B. 7004
Mobile: +972-50-23-7338          Kfar-Saba 44641, Israel

=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Reply via email to