On Sat, 21 Mar 2020 13:33:18 -0600
Doug Ewell via Unicode wrote:
> Eli Zaretskii wrote:
> > Emacs uses some of that for supporting charsets that cannot be
> > mapped into Unicode. GB18030 is one example of such charsets. The
> > internal representation of characters in Emacs is UTF-8, so it
Eli Zaretskii wrote:
>> When 137,468 private-use characters aren't enough?
>
> Why is that relevant to the issue at hand?
You're right. I did ask what the uses of non-standard UTF-8 were, and you gave
me an example.
> I don't remember off hand, but last time I looked at GB18030, there
> were a
On 2020-03-21, Eli Zaretskii via Unicode wrote:
>> Date: Sat, 21 Mar 2020 11:13:40 -0600
>> From: Doug Ewell via Unicode
>>
>> Adam Borowski wrote:
>>
>> > Also, UTF-8 can carry more than Unicode -- for example, U+D800..U+DFFF
>> > or U+11000..U+7FFF (or possibly even up to 2³⁶ or 2⁴²),
> From: "Doug Ewell"
> Cc:
> Date: Sat, 21 Mar 2020 13:33:18 -0600
>
> > Emacs uses some of that for supporting charsets that cannot be mapped
> > into Unicode. GB18030 is one example of such charsets. The internal
> > representation of characters in Emacs is UTF-8, so it uses 5-byte
> >
Eli Zaretskii wrote:
>>> Also, UTF-8 can carry more than Unicode -- for example,
>>> U+D800..U+DFFF or U+11000..U+7FFF (or possibly even up to 2³⁶ or
>>> 2⁴²), which has its uses but is not well-formed Unicode.
>>
>> I'd be interested in your elaboration on what these uses are.
>
> Emacs uses
> Date: Sat, 21 Mar 2020 11:13:40 -0600
> From: Doug Ewell via Unicode
>
> Adam Borowski wrote:
>
> > Also, UTF-8 can carry more than Unicode -- for example, U+D800..U+DFFF
> > or U+11000..U+7FFF (or possibly even up to 2³⁶ or 2⁴²), which has
> > its uses but is not well-formed Unicode.
>
Adam Borowski wrote:
> Also, UTF-8 can carry more than Unicode -- for example, U+D800..U+DFFF
> or U+11000..U+7FFF (or possibly even up to 2³⁶ or 2⁴²), which has
> its uses but is not well-formed Unicode.
I'd be interested in your elaboration on what these uses are.
--
Doug Ewell |
7 matches
Mail list logo