Re: Why UTF-8/16 character encodings?

anonymous Fri, 24 May 2013 10:55:33 -0700

On Friday, 24 May 2013 at 17:05:57 UTC, Joakim wrote:

On Friday, 24 May 2013 at 09:49:40 UTC, Jacob Carlborg wrote:
toUpper/lower cannot be made in place if it should handle allUnicode. Some characters will change their length when convertto/from uppercase. Examples of these are the German double Sand some Turkish I.
This triggered a long-standing bugbear of mine: why are weusing these variable-length encodings at all? Does anybodyreally care about UTF-8 being "self-synchronizing," ie doesanybody actually use that in this day and age? Sure, it'sbackwards-compatible with ASCII and the vast majority of usageis probably just ASCII, but that means the other languagesdon't matter anyway. Not to mention taking the valuable 8-bitreal estate for English and dumping the longer encodings oneveryone else.


The German ß becomes SS when capitalised. It's no encoding issue.

Re: Why UTF-8/16 character encodings?

Reply via email to