Zvi Har'El wrote: > > umm, isn't UTF-8 8 bit with occasional 16? :) > > UTF-8 is one, two or three bytes per character. In the Hebrew case, a Hebrew > character is two bytes.
Of course. But there are some special Hebrew characters (such as RLM/LRM, etc.) that are 3. And theoretically, UTF8 can handle up to 5 bytes. -- Eli Marmor [EMAIL PROTECTED] CTO, Founder Netmask (El-Mar) Internet Technologies Ltd. __________________________________________________________ Tel.: +972-9-766-1020 8 Yad-Harutzim St. Fax.: +972-9-766-1314 P.O.B. 7004 Mobile: +972-50-23-7338 Kfar-Saba 44641, Israel ================================================================= To unsubscribe, send mail to [EMAIL PROTECTED] with the word "unsubscribe" in the message body, e.g., run the command echo unsubscribe | mail [EMAIL PROTECTED]
