2015-09-27 21:26 GMT+00:00 Páll Haraldsson <pall.haralds...@gmail.com>:
> 2015-09-27 20:29 GMT+00:00 Scott Jones <scott.paul.jo...@gmail.com>: > >> If it is mainly in North/South America, Western Europe, or Australia/NZ, >> UTF-8 does OK. >> UTF-8 is great for data interchange, but can really slow things down if >> you have many non-ASCII characters >> > > Did you mean non-BMP? Non-ASCII, but BMP ("European") will take 16 bits, > same as in UTF-16. > Sorry, I was thinking of my own language/Latin1. You can expect three bytes also, but I would guess balanced out by 1 byte chars.. Anyway, in general UTF-8 can go up to four bytes (5 or 6 are no longer allowed..). Probably allocating 50% not double for buffers is something someone thought ok.. I'm not sure standards allow strictly (for SQL).. -- Palli.