> Date: Mon, 27 Feb 2017 20:24:19 +0000 (GMT) > From: Jan Wedekind <[email protected]> > cc: Eli Zaretskii <[email protected]>, [email protected] > > The encoding support of the Ruby programming language [1] is IMHO pretty > good. It can handle different encodings for source code, input/output, > string variables, and regular expressions. UTF-8 is the preferred encoding > but other encodings are required. E.g. Ruby is used a lot in Japan and > there are many "Kanji" which are currently not covered by UTF-8.
Emacs solves the latter problem as well, by using codepoints beyond the end of the Unicode range. (Don't forget that the Emacs m17n features were designed and implemented by people who came from Japan.) The advantage of the Emacs solution is that the internal representation is still (a superset of) UTF-8, even though the byte sequences for these codepoints could be longer than the maximum of the standard UTF-8.
