On Jul 24 09:28, Brian Inglis via Cygwin wrote:
> On 2025-07-24 04:30, Corinna Vinschen via Cygwin wrote:
> > Or shall simply go along with CESU-8 when converting back to multibyte
> > to keep the string the same as with wcstombs?
> 
> There are 15 * SMP as BMP characters, so many non-Western and emoji
> characters will be expanded from 4 UTF-8 bytes to 6 CESU-8 bytes, and this
> is not supported anywhere as a string representation, designed for internal
> use only per the TR.

We're only talking about invalid sequences, not using CESU-8 throughout.


Corinna

-- 
Problem reports:      https://cygwin.com/problems.html
FAQ:                  https://cygwin.com/faq/
Documentation:        https://cygwin.com/docs.html
Unsubscribe info:     https://cygwin.com/ml/#unsubscribe-simple

Reply via email to