2015-09-27 21:26 GMT+00:00 Páll Haraldsson <pall.haralds...@gmail.com>:

> 2015-09-27 20:29 GMT+00:00 Scott Jones <scott.paul.jo...@gmail.com>:
>
>> If it is mainly in North/South America, Western Europe, or Australia/NZ,
>> UTF-8 does OK.
>> UTF-8 is great for data interchange, but can really slow things down if
>> you have many non-ASCII characters
>>
>
> Did you mean non-BMP? Non-ASCII, but BMP ("European") will take 16 bits,
> same as in UTF-16.
>

Sorry, I was thinking of my own language/Latin1. You can expect three bytes
also, but I would guess balanced out by 1 byte chars..

Anyway, in general UTF-8 can go up to four bytes (5 or 6 are no longer
allowed..). Probably allocating 50% not double for buffers is something
someone thought ok.. I'm not sure standards allow strictly (for SQL)..

-- 
Palli.

Reply via email to